Reinforcement Learning
This repository will contain a number of pytorch implementations of RL algorithms.
PPO
PPO is implemented in the rl.py file. Working examples include ppo_cartpole.py, ppo_maze.py, and ppo_doom_hallway.py.
Holodeck Maze World
0 epochs:
70 epochs: