There are 0 repository under generalized-advantage-estimation topic.
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
A collection of Reinforcement Learning implementations with PyTorch
Phasic-Policy-Gradient
Recurrent Policies for Handling Partially Observable Environments
An implementation from the state-of-the-art family of reinforcement learning algorithms Proximal Policy Optimization using normalized Generalized Advantage Estimation and optional batch mode training. The loss function incorporates an entropy bonus.
Example PPO implementation with ReLAx
Example TRPO implementation with ReLAx