Deep RL implementations. DQN, SAC, DDPG, TD3, PPO and VPG implemented in pytorch. Tested Env: LunarLander-v2 and Pendulum-v0.
Home Page:https://akashe.io/blog/2020/10/14/policy-gradient-methods/
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool