Center for Human-Compatible AI

HumanCompatibleAI

Organization data from Github https://github.com/HumanCompatibleAI

CHAI seeks to develop the conceptual and technical wherewithal to reorient the general thrust of AI research towards provably beneficial systems.

https://humancompatible.ai

@HumanCompatibleAI

Center for Human-Compatible AI's repositories

imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Language:PythonMIT1625 15 349

overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

Language:Jupyter NotebookMIT883 20 74

adversarial-policies

Find best-response to a fixed policy in multi-agent RL

Language:PythonMIT290 10 22

human_aware_rl

Code for "On the Utility of Learning about Humans for Human-AI Coordination"

Language:Python109 6 17

evaluating-rewards

Library to compare and evaluate reward functions

Language:PythonApache-2.067 7 5

tensor-trust

A prompt injection game to collect data for robust ML research

Language:PythonBSD-2-Clause65 6 164

overcooked-demo

Web application where humans can play Overcooked with AI agents.

Language:JavaScript59 6 19

seals

Benchmark environments for reward modelling and imitation learning algorithms.

Language:PythonMIT46 9 14

tensor-trust-data

Dataset for the Tensor Trust project

Language:Jupyter Notebook45 4 2

eirli

An Empirical Investigation of Representation Learning for Imitation (EIRLI), NeurIPS'21

Language:Python36 8 2

ranking-challenge

Testing ranking algorithms to improve social cohesion

Language:Python30 7 7

leela-interp

Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"

Language:Jupyter NotebookGPL-3.02500

nn-clustering-pytorch

Checking the divisibility of neural networks, and investigating the nature of the pieces networks can be divided into.

Language:Python6 30

recon-email

Script for automatically creating the reconnaissance email.

Language:HTML5 20

reward-preprocessing

Preprocessing reward functions to make them more interpretable

Language:Python5 30

multiagent-competition

Code for the paper "Emergent Complexity via Multi-agent Competition"

Language:Python4 60

assistance-games

Supporting code for Assistance Games as a Framework paper

Language:PythonMIT3 70

reducing-exploitability

Language:PythonMIT3 2 4

stable-baselines3

PyTorch version of Stable Baselines, improved implementations of reinforcement learning algorithms.

Language:PythonMIT3 10

dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

Language:PythonMIT2 30

ranking-challenge-perspective

Prosocial Ranking Challenge Perspective Ranker

Language:Jupyter NotebookMIT100

reward-function-interpretability

Language:Jupyter Notebook1 3 5

rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Language:PythonMIT100

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonMIT1 20

katago-driver-bug-repro

Docker files to help reproduce bug described in https://forums.developer.nvidia.com/t/kernel-oops-null-pointer-dereference-when-closing-cuda-application-katago/211270/3

Language:Dockerfile050

pytorch-summary

Model summary in PyTorch similar to `model.summary()` in Keras

Language:PythonMIT020

ray

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonApache-2.0020

rc-submission-civirank

PRC: Civirank submission

000

rc-submission-dante

PRC: Testing ranking algorithms to improve social cohesion

Language:JavaScript000

sgf-viewer

A simple webpage that can visualize a sgf string encoded as a url fragment.

Language:CSS030