pytorch-policy-gradient-example
Train an agent for CartPole-v0 using naive Policy Gradient.
Inspired by Andrej Karpathy's blog.
Code partly from Pytorch DQN Tutorial
Solved in 500 episodes (Avg Reward):
A toy example of Policy Gradient implemented in Pytorch
Train an agent for CartPole-v0 using naive Policy Gradient.
Inspired by Andrej Karpathy's blog.
Code partly from Pytorch DQN Tutorial
Solved in 500 episodes (Avg Reward):
A toy example of Policy Gradient implemented in Pytorch
MIT License