Actor-Critic Reinforcement Learning

This project intends to provide a documented and extensible implementation of the A2C and ACKTR algorithms by OpenAI.

Based on the paper by Wu, Mansimov, Liao, Grosse, and Ba (2017): https://arxiv.org/pdf/1708.05144.pdf
Original implementation: https://github.com/openai/baselines

Documentation

The API documentation and a short guide can be found on Read the Docs.

The following dependencies need to be installed besides TensorFlow and NumPy (click links for further details):

$ pip install gym

KFAC for TensorFlow. You need the latest version (0.1.1), which currently is not hosted on PyPI. Install with:

$ pip install git+https://github.com/tensorflow/kfac

To use the Atari environments you need:

$ pip install atari-py

$ pip install opencv-python

This project is only tested on Linux with Python 3.6.5.

Run the following to train an Atari model (see a2c_acktr.py for further details):

$ python -m actorcritic.examples.atari.a2c_acktr

Implementation of A2C and ACKTR in TensorFlow.

MIT License

Language:Python 100.0%