ShengtongZhu / reinforcement-learning

Reinforcement Learning Examples Of Policy Gradients, PPO+GAE, and DDQN Using OpenAI Gym and PyTorch

Home Page:https://medium.com/geekculture/a-simple-guide-to-reinforcement-learning-with-the-super-mario-bros-environment-495a13974a54

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Reinforcement Learning Examples

medium Python3.8.6 PyTorch1.8.1

Pong environment

Animation

Policy Gradients
Checkpoint weights


Lunar Lander environment

Animation

Deep Q-Network
Checkpoint weights

Policy Gradients
Checkpoint weights


Cartpole environment

Animation

Policy Gradients
Checkpoint weights

Deep Q-Network
Checkpoint weights


Mario environment

Animation


Policy Gradients
Checkpoint weights

Plot of average reward per 10 episodes


Double Deep Q-Network
Checkpoint weights

Plot of average reward per 10 episodes


PPO+GAE
Checkpoint weights

Plot of average reward per 10 episodes


Highway environments

video.mp4

Double Deep Q-Network
Checkpoint weights


video.mp4

Double Deep Q-Network
Checkpoint weights


video.mp4

Double Deep Q-Network
Checkpoint weights


video.mp4

Double Deep Q-Network
Checkpoint weights


video.mp4

PPO+GAE
Checkpoint weights


PyBullet Walker2D environment

video.mp4

PPO+GAE
Checkpoint weights

Plot of average reward per 50 episodes

About

Reinforcement Learning Examples Of Policy Gradients, PPO+GAE, and DDQN Using OpenAI Gym and PyTorch

https://medium.com/geekculture/a-simple-guide-to-reinforcement-learning-with-the-super-mario-bros-environment-495a13974a54

License:Apache License 2.0


Languages

Language:Python 100.0%