RyanRizzo96 / Energy-Based-Hindsight-Experience-Prioritization

Exploring different buffer sampling techniques to improve Hindisght Experience Replay on continuous control robotic application tasks. Continous action spaces & sparse rewards.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Energy-Based-Prioritization

Comparison betwween Hindsight Experience Replay and Energy-Based Hindisght Experience Prioritization, averaged across 5 random seeds each, trained on 16 CPUs. (1 epoch = 40,000 iterations)

Results improve on state-of-the-art HER implementation by Andrychowiz et.al. (2017)

image

image


Supported by the AWS Cloud Credits for Research program.

About

Exploring different buffer sampling techniques to improve Hindisght Experience Replay on continuous control robotic application tasks. Continous action spaces & sparse rewards.


Languages

Language:Python 98.8%Language:Shell 1.2%