(Synchronous Multi-Actor) Advantage Actor Critic
- Restricted to single core multi-actor for simple concise code
- PPO
- TD(n)
- git clone https://github.com/0xC0DEF/A2C
- cd A2C
- open Snake.ipynb and run all cell (start training)
- open and run Test.ipynb to test learning agent (You can use Test.ipynb during training)