akashe/DeepReinforcementLearning

deep-rl-implementations algorithms pytorch-implementation lunarlander-v2 pendulum-v0 vpg sac ddpg td3 ppo ppo-pytorch

Deep RL algorithms implemented using Pytorch

Algo list:

Article on deeper Look into policy gradients

Experimental Results:

Algorithm	Discrete Env: LunarLander-v2	Continuous Env: Pendulum-v0
DQN		-
VPG		-
DDPG	-
TD3	-
SAC	-
PPO	-

Usage:

Just run the file/algorithm directly. There is no common structures between algorithms as I implemented them as I learnt them. Different algorithms are inspired from different sources.

Resources:

Future projects:

If time available I will add a simple program for elevator using RL.
Better graphs

About

Deep RL implementations. DQN, SAC, DDPG, TD3, PPO and VPG implemented in pytorch. Tested Env: LunarLander-v2 and Pendulum-v0.

https://akashe.io/blog/2020/10/14/policy-gradient-methods/

deep-rl-implementations algorithms pytorch-implementation lunarlander-v2 pendulum-v0 vpg sac ddpg td3 ppo ppo-pytorch

Languages

Language:Python 74.6%Language:Jupyter Notebook 25.4%