Center for Human-Compatible AI (HumanCompatibleAI)

Center for Human-Compatible AI

HumanCompatibleAI

Geek Repo

CHAI seeks to develop the conceptual and technical wherewithal to reorient the general thrust of AI research towards provably beneficial systems.

Home Page:https://humancompatible.ai

Github PK Tool:Github PK Tool

Center for Human-Compatible AI's repositories

imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Language:PythonLicense:MITStargazers:1136Issues:18Issues:329

overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

Language:Jupyter NotebookLicense:MITStargazers:635Issues:19Issues:57

adversarial-policies

Find best-response to a fixed policy in multi-agent RL

Language:PythonLicense:MITStargazers:264Issues:14Issues:21

human_aware_rl

Code for "On the Utility of Learning about Humans for Human-AI Coordination"

evaluating-rewards

Library to compare and evaluate reward functions

Language:PythonLicense:Apache-2.0Stargazers:60Issues:8Issues:5

overcooked-demo

Web application where humans can play Overcooked with AI agents.

seals

Benchmark environments for reward modelling and imitation learning algorithms.

Language:PythonLicense:MITStargazers:41Issues:10Issues:14

eirli

An Empirical Investigation of Representation Learning for Imitation (EIRLI), NeurIPS'21

tensor-trust

A prompt injection game to collect data for robust ML research

Language:PythonLicense:BSD-2-ClauseStargazers:34Issues:5Issues:164

ranking-challenge

Testing ranking algorithms to improve social cohesion

tensor-trust-data

Dataset for the Tensor Trust project

Language:Jupyter NotebookStargazers:23Issues:4Issues:0

learning-from-human-preferences

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

Language:PythonLicense:MITStargazers:18Issues:6Issues:0

overcooked-hAI-exp

Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)

nn-clustering-pytorch

Checking the divisibility of neural networks, and investigating the nature of the pieces networks can be divided into.

Language:PythonStargazers:6Issues:4Issues:0

recon-email

Script for automatically creating the reconnaissance email.

Language:HTMLStargazers:5Issues:3Issues:0

multiagent-competition

Code for the paper "Emergent Complexity via Multi-agent Competition"

Language:PythonStargazers:4Issues:7Issues:0

reward-preprocessing

Preprocessing reward functions to make them more interpretable

Language:PythonStargazers:4Issues:2Issues:0

assistance-games

Supporting code for Assistance Games as a Framework paper

Language:PythonLicense:MITStargazers:3Issues:7Issues:0

stable-baselines3

PyTorch version of Stable Baselines, improved implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:3Issues:2Issues:0

dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

Language:PythonLicense:MITStargazers:2Issues:3Issues:0

minerl

MineRL Competition for Sample Efficient Reinforcement Learning - Python Package

Language:PythonLicense:NOASSERTIONStargazers:2Issues:2Issues:0

rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

katago-driver-bug-repro

Docker files to help reproduce bug described in https://forums.developer.nvidia.com/t/kernel-oops-null-pointer-dereference-when-closing-cuda-application-katago/211270/3

Language:DockerfileStargazers:0Issues:5Issues:0

pytorch-summary

Model summary in PyTorch similar to `model.summary()` in Keras

Language:PythonLicense:MITStargazers:0Issues:3Issues:0

ray

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:3Issues:0

sgf-viewer

A simple webpage that can visualize a sgf string encoded as a url fragment.

Language:CSSStargazers:0Issues:3Issues:0

slack-diskbot

low disk space alerts posted to Slack

Language:PythonStargazers:0Issues:4Issues:0