1jsingh / rl_pong

Train a RL agent to play Pong using Proximal Policy Optimization (PPO)

About

Train a RL agent to play Pong using Proximal Policy Optimization (PPO)

The player on the left is normal computer player while the one on the right is the implemented RL agent.

Train a RL agent to play Pong using Proximal Policy Optimization (PPO)

MIT License

Language:Jupyter Notebook 99.5%Language:Python 0.5%