gkswamy98 / pillbox

Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.

Home Page:https://gokul.dev/mmil/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pillbox 💊

Contains PyTorch implementation of the AdVIL, AdRIL, and DAeQuIL algorithms.

Setup

To install dependencies, run:

conda env create -f environment.yml

Running Experiments

To train an expert, run:

python experts/train.py -e env_name

To train a learner, run:

python learners/train.py -a algo_name -e env_name -n num_runs

This package supports training via:

  • Behavioral Cloning
  • AdVIL
  • SQIL
  • GAIL
  • AdRIL

We also support a comparison of the following algorithms that require an interactive expert on both OpenAI Gym and a custom environment:

  • DAgger
  • DAeQuIL

To explore these algorithms and environments, run:

jupyter notebook

and open up learners/imm.ipynb.

Visualizing Results

Run:

jupyter notebook

and open up vis.ipynb.

Citing our Work

@article{swamy2021moments,
  author       = {Gokul Swamy and Sanjiban Choudhury and J. Andrew Bagnell and Zhiwei Steven Wu},
  title        = {Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap},
  conference   = {Proceedings of the 38th International Conference on Machine Learning},
  url          = {https://arxiv.org/abs/2103.03236},
}

About

Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.

https://gokul.dev/mmil/


Languages

Language:Jupyter Notebook 97.5%Language:Python 2.5%