Abhipanda4 / PPO-PyTorch

Implementation of Proximal Policy Optimization(PPO)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This is a Pytorch implementation of Proximal Policy Optimization as described in this paper.

The implementation used in this repo was used as a reference for this implementation.

To run a demo, clone the repo and use the command: python simulate.py

The training plots are shown below:

reward plot

actor loss plot

critic loss plot

About

Implementation of Proximal Policy Optimization(PPO)


Languages

Language:Python 100.0%