Deep Deterministic Policy Gradient on PyTorch |
---|
Overview |
====== |
The is the implementation of Deep Deterministic Policy Gradient (DDPG) using PyTorch. Part of the utilities functions such as replay buffer and random process are from keras-rl repo. Contributes are very welcome. |
Dependencies |
====== |
* Python 3.4 |
* PyTorch 0.1.9 |
Run |
====== |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
TODO |