Yilingeacc / reinforcement-learning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

reinforcement learning

reinforcement learning algorithms implementation with pytorch

reference:

https://github.com/AI4Finance-LLC/ElegantRL (DRL algo impl)

https://github.com/starry-sky6688/StarCraft (MARL algo impl)

content

1.Tabular:

  • MazeEnv: my gmy-like environment, for tabular algos

  • MonteCarlo

  • off-policy MonteCarlo (with important sampling)

  • Sarsa

  • QLearning

  • DoubleQLearning

  • n-step Sarsa

  • Sarsa(lambda)

2.Deep Q Network

  • Deep Q Network

  • DDQN(Double DQN)

  • Dueling DQN

  • D3QN(Dueling DDQN)

3.Policy Gradient

  • REINFORCE

  • REINFORCE with baseline

4.Deterministic Policy Gradient

  • DDPG(Deep Deterministic Policy Gradient)

  • TD3(Twin Delayed DDPG)

5.Actor Critic

  • PPO(Proximal Policy Optimization) (PPO-Clip)

6.Multi-Agent RL

(Env: Multi-Agent partical world)

  • Qmix

About


Languages

Language:Python 100.0%