boscotsang / random-network-distillation

Code for the paper "Exploration by Random Network Distillation"

Home Page:https://arxiv.org/abs/1810.12894

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Status: Archive (code is provided as-is, no updates expected)

Yuri Burda*, Harri Edwards*, Amos Storkey, Oleg Klimov
*equal contribution

OpenAI
University of Edinburgh

Installation and Usage

The following command should train an RND agent on Montezuma's Revenge

python run_atari.py --gamma_ext 0.999

To use more than one gpu/machine, use MPI (e.g. mpiexec -n 8 python run_atari.py --num_env 128 --gamma_ext 0.999 should use 1024 parallel environments to collect experience on an 8 gpu machine).

About

Code for the paper "Exploration by Random Network Distillation"

https://arxiv.org/abs/1810.12894


Languages

Language:Python 100.0%