Hindsight Experience Replay (HER)

This repository contains a tensorflow HER implementation and a bit flipping environment as described in OpenAI's paper

The implementation includes :

In Hindsight Experience Replay.ipynb :
1. A DQN and a DDQN agent (which also work on other traditional gym environments)
2. A bit flipping environment
3. Pre-trained models for 30-bits, 40-bits and 50-bits flipping environments
In ChaseEnv_DDPG.ipynb :
1. A DDPG agent
2. A ChaseEnv environment, where a chaser is initialized at a random position in a 2d plane and has to reach a goal in another random position within a certain threshold.

Benchmarks

Check the "Training" cell to adjust training parameters and enable/disable HER.

Here is a link to a robot arm reach environment created in Unity, trained with ML-Agents.

This environment is trained using DDPG with and without HER, and the comparison is plotted. DDPG+HER performs better.

A tensorflow implementation of hindsight experience replay

Language:Jupyter Notebook 100.0%