Center for Human-Compatible AI's repositories
overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
adversarial-policies
Find best-response to a fixed policy in multi-agent RL
human_aware_rl
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
evaluating-rewards
Library to compare and evaluate reward functions
overcooked-demo
Web application where humans can play Overcooked with AI agents.
tensor-trust
A prompt injection game to collect data for robust ML research
ranking-challenge
Testing ranking algorithms to improve social cohesion
tensor-trust-data
Dataset for the Tensor Trust project
learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
overcooked-hAI-exp
Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)
nn-clustering-pytorch
Checking the divisibility of neural networks, and investigating the nature of the pieces networks can be divided into.
recon-email
Script for automatically creating the reconnaissance email.
multiagent-competition
Code for the paper "Emergent Complexity via Multi-agent Competition"
reward-preprocessing
Preprocessing reward functions to make them more interpretable
assistance-games
Supporting code for Assistance Games as a Framework paper
stable-baselines3
PyTorch version of Stable Baselines, improved implementations of reinforcement learning algorithms.
rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
katago-driver-bug-repro
Docker files to help reproduce bug described in https://forums.developer.nvidia.com/t/kernel-oops-null-pointer-dereference-when-closing-cuda-application-katago/211270/3
pytorch-summary
Model summary in PyTorch similar to `model.summary()` in Keras
sgf-viewer
A simple webpage that can visualize a sgf string encoded as a url fragment.
slack-diskbot
low disk space alerts posted to Slack