Learning of feature points without additional supervision improves reinforcement learning from images

This is a PyTorch implementation of the FPAC method proposed in the paper "Learning of feature points without additional supervision improves reinforcement learning from images" by Rinu Boney, Alexander Ilin, and Juho Kannala.

Dependencies

The main dependencies are pytorch, numpy and dm_control.

Install the required dependencies by creating an anaconda environment from conda_env.yml:

conda env create -f conda_env.yml

and then activate the installed fpac environment: conda activate fpac

Training FPAC on the DeepMind Control Suite requires a valid MuJoCo installation. Refer to https://github.com/deepmind/dm_control#requirements-and-installation for instructions on installing MuJoCo.

Experiments on PlaNet Benchmark

FPAC results reported in the paper (for the six tasks in the PlaNet benchmark) can be reproduced by running:

python train.py domain=ball_in_cup task=catch relative_xy=False
python train.py domain=cartpole task=swingup
python train.py domain=cheetah task=run train_episodes=1000
python train.py domain=finger task=spin
python train.py domain=reacher task=easy lr=1e-3
python train.py domain=walker task=walk train_episodes=1000

About

PyTorch implementation of the FPAC method proposed in the paper "Learning of feature points without additional supervision improves reinforcement learning from images"

MIT License

Languages

Language:Python 100.0%