rddy/mimi

MIMI: Mutual Information-Maximizing Interface

MIMI is an algorithm for training an interface to map user command signals to system actions through unsupervised human-in-the-loop reinforcement learning.

Usage

Clone mimi into your home directory ~
Download data.zip and decompress it into mimi/
Setup an Anaconda virtual environment with conda create -n mimienv python=3.6
Install dependencies with pip install -r requirements.txt and pip install pyglet==1.5.11
Replace your_install_dir/gym/envs/box2d/lunar_lander.py with deps/box2d/lunar_lander.py
Install the mimi package with python setup.py install
Jupyter notebooks in mimi/notebooks provide an entry-point to the code base, where you can play around with the environments and reproduce the figures from the paper.

Citation

If you find this software useful in your work, we kindly request that you cite the following paper:

@article{mimi2022,
  title={First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization},
  author={Reddy, Siddharth and Levine, Sergey and Dragan, Anca D.},
  journal={arXiv preprint arXiv:2205.12381},
  year={2022}
}

Latent Space Exploration (Experimental)

Explore the latent space of a generative model of MNIST images using hand gestures

Clone this repo
Set dvae_dir in mimi/utils.py
Download the MNIST dataset to mimi/data/mnist/mnist.npz
Run notebooks/mnist.ipynb

About

Code for the paper, "First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization"

MIT License

Languages

Language:Jupyter Notebook 73.5%Language:Python 26.5%