keon / policy-gradient

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Policy Gradient

Minimal implementation of Stochastic Policy Gradient Algorithm in Keras

Pong Agent

pg

This PG agent seems to get more frequent wins after about 8000 episodes. Below is the score graph.

score

About

Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras

License:MIT License


Languages

Language:Python 100.0%