cartgr

followers

following

stars

https://cartgr.github.io/website/

Carter Blair's starred repositories

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonBSD-3-Clause260400

LLMs-for-Social-Robotics

Code and data for our IROS paper: "Are Large Language Models Aligned with People's Social Intuitions for Human–Robot Interactions?"

Language:Jupyter Notebook200

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonApache-2.0177900

awesome-rlhf

An index of algorithms for reinforcement learning from human feedback (rlhf))

Apache-2.07600

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

Shimmy

An API conversion tool for popular external reinforcement learning environments

Language:PythonMIT12600

Value-Augmented-Sampling

Language:PythonMIT1300

altar_game

A simple altar game based on Phaser3

Language:JavaScript100

genetic-mdp

Maximum diversity problem solver in Python using a genetic algorithm

Language:Python200

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonMIT120800

DESlib

A Python library for dynamic classifier and ensemble selection

Language:PythonBSD-3-Clause47600

concordia

A library for generative social simulation

Language:PythonApache-2.043800

neural-mmo

Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"

Language:PythonMIT156800

awesome-rl-envs

nash-mtl

Official implementation of "Multi-Task Learning as a Bargaining Game" [ICML 2022]

Language:Python19800

state_of_nature

Harvard Joint CS + Government Thesis Project 2018-2019: Escaping the State of Nature

Language:Python500

hanabi-learning-environment

hanabi_learning_environment is a research platform for Hanabi experiments.

Language:PythonApache-2.063900

ai-safety-gridworlds

This is a suite of reinforcement learning environments illustrating various safety properties of intelligent agents.

Language:PythonApache-2.060300

openrlbenchmark

Language:PythonMIT18100

morl-baselines

Multi-Objective Reinforcement Learning algorithms implementations.

Language:PythonMIT25600

choices13k

Language:Jupyter Notebook5100

CENTaUR

Language:Python1700

PantheonRL

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Language:PythonMIT11900

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT840800

overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

Language:Jupyter NotebookMIT65900

awesome-phd-advice

Collection of advice for prospective and current PhD students

MIT145600

Melting-Pot-Contest-2023

Language:PythonApache-2.04100

marlgrid

Gridworld for MARL experiments

Language:PythonApache-2.013700

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonMIT631600

meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Language:PythonApache-2.056200