1jsingh / rl_pong

Train a RL agent to play Pong using Proximal Policy Optimization (PPO)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About

Train a RL agent to play Pong using Proximal Policy Optimization (PPO)

Output demo

The player on the left is normal computer player while the one on the right is the implemented RL agent.

Using REINFORCE

output_demo_reinforce

Using PPO

output_demo_ppo

About

Train a RL agent to play Pong using Proximal Policy Optimization (PPO)

License:MIT License


Languages

Language:Jupyter Notebook 99.5%Language:Python 0.5%