wumo / drl

Deep reinforcement leanring algorithms implemented in PyTorch

drl

Deep reinforcement leanring algorithms implemented in PyTorch.

My contributions are:

New implementations of PPO and VPG according to spinningup.
Optimize the replay buffer of DQN to reduce the memory footprint (from 16GB memory requiremnt to 1.8GB).

Deep reinforcement leanring algorithms implemented in PyTorch

Language:Python 100.0%