Center for Human-Compatible AI's repositories
overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
adversarial-policies
Find best-response to a fixed policy in multi-agent RL
human_aware_rl
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
evaluating-rewards
Library to compare and evaluate reward functions
tensor-trust
A prompt injection game to collect data for robust ML research
overcooked-demo
Web application where humans can play Overcooked with AI agents.
tensor-trust-data
Dataset for the Tensor Trust project
ranking-challenge
Testing ranking algorithms to improve social cohesion
leela-interp
Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
nn-clustering-pytorch
Checking the divisibility of neural networks, and investigating the nature of the pieces networks can be divided into.
recon-email
Script for automatically creating the reconnaissance email.
reward-preprocessing
Preprocessing reward functions to make them more interpretable
multiagent-competition
Code for the paper "Emergent Complexity via Multi-agent Competition"
assistance-games
Supporting code for Assistance Games as a Framework paper
stable-baselines3
PyTorch version of Stable Baselines, improved implementations of reinforcement learning algorithms.
ranking-challenge-perspective
Prosocial Ranking Challenge Perspective Ranker
rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
katago-driver-bug-repro
Docker files to help reproduce bug described in https://forums.developer.nvidia.com/t/kernel-oops-null-pointer-dereference-when-closing-cuda-application-katago/211270/3
pytorch-summary
Model summary in PyTorch similar to `model.summary()` in Keras
rc-submission-civirank
PRC: Civirank submission
rc-submission-dante
PRC: Testing ranking algorithms to improve social cohesion
sgf-viewer
A simple webpage that can visualize a sgf string encoded as a url fragment.