gail gan imitation-learning off-policy pytorch reinforcement-learning

SAM: Sample-efficient Adversarial Mimic

PyTorch implementation of our work: "Sample-Efficient Imitation Learning via Generative Adversarial Nets". A TensorFlow implementation is also available at sam-tf.

Published in AISTATS 2019 | arXiv link | Video demos | Expert demonstrations

How to

Launching scripts are available in /tasks. To run a task, use:

python spawner.py \
    --config tasks/sam_fixed_local_mujoco.yaml \
    --no-sweep \
    --call \
    --visdom_server=<visdom_server> \
    --visdom_port=<visdom_port> \
    --visdom_username=<visdom_username> \
    --visdom_password=<visdom_password>

Visdom can be disabled (and the associated options can therefore be omitted) by setting the argument enable_visdom to false in the YAML configuration file. The command triggers the creation of a tmux session in which jobs run in distinct windows. For example, if the configuration file specifies 3 environments and a number of random seeds equal to 4, the tmux session will have 12 windows, each containing its unique pair of environment and random seed.

Acknowledgments

Interaction simulated with the openai/gym API.
Some utilities were inspired from openai/baselines.

About

PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"

gail gan imitation-learning off-policy pytorch reinforcement-learning

Languages

Language:Python 99.7%Language:Shell 0.3%