Implementing HER with Replay Buffer/Reverb

Question

Implementing HER with Replay Buffer/Reverb

lhendre opened this issue a year ago · comments

Hi,
Im doing some experiments with a project utilizing acme. We wanted to do some additional experiments utilizing HER(Hindsight Experience Replay). I have been working on including that, both with our own infrastructure and trying to see if we can use Reverb to do this but I have been running into issues. I wanted to know if you are aware of examples that have implemented HER with acme?

Lucas Hendren · Answer 1 · Sun Jul 23 2023 17:32:37 GMT+0800 (China Standard Time)

As a quick addition, for the "with our own infrastructure", we are creating our own replay buffer and utilizing it in our own environment loop, and theres some small modifications in other files to support it but thats where the bulk of the work is done. For reverb, I was looking at modifying the dqn agent