Carter Blair's starred repositories

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2604Issues:0Issues:0

LLMs-for-Social-Robotics

Code and data for our IROS paper: "Are Large Language Models Aligned with People's Social Intuitions for Human–Robot Interactions?"

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1779Issues:0Issues:0

awesome-rlhf

An index of algorithms for reinforcement learning from human feedback (rlhf))

License:Apache-2.0Stargazers:76Issues:0Issues:0

deep_learning_curriculum

Language model alignment-focused deep learning curriculum

Stargazers:1166Issues:0Issues:0

Shimmy

An API conversion tool for popular external reinforcement learning environments

Language:PythonLicense:MITStargazers:126Issues:0Issues:0
Language:PythonLicense:MITStargazers:13Issues:0Issues:0

altar_game

A simple altar game based on Phaser3

Language:JavaScriptStargazers:1Issues:0Issues:0

genetic-mdp

Maximum diversity problem solver in Python using a genetic algorithm

Language:PythonStargazers:2Issues:0Issues:0

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonLicense:MITStargazers:1208Issues:0Issues:0

DESlib

A Python library for dynamic classifier and ensemble selection

Language:PythonLicense:BSD-3-ClauseStargazers:476Issues:0Issues:0

concordia

A library for generative social simulation

Language:PythonLicense:Apache-2.0Stargazers:438Issues:0Issues:0

neural-mmo

Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"

Language:PythonLicense:MITStargazers:1568Issues:0Issues:0

nash-mtl

Official implementation of "Multi-Task Learning as a Bargaining Game" [ICML 2022]

Language:PythonStargazers:198Issues:0Issues:0

state_of_nature

Harvard Joint CS + Government Thesis Project 2018-2019: Escaping the State of Nature

Language:PythonStargazers:5Issues:0Issues:0

hanabi-learning-environment

hanabi_learning_environment is a research platform for Hanabi experiments.

Language:PythonLicense:Apache-2.0Stargazers:639Issues:0Issues:0

ai-safety-gridworlds

This is a suite of reinforcement learning environments illustrating various safety properties of intelligent agents.

Language:PythonLicense:Apache-2.0Stargazers:603Issues:0Issues:0
Language:PythonLicense:MITStargazers:181Issues:0Issues:0

morl-baselines

Multi-Objective Reinforcement Learning algorithms implementations.

Language:PythonLicense:MITStargazers:256Issues:0Issues:0
Language:Jupyter NotebookStargazers:51Issues:0Issues:0
Language:PythonStargazers:17Issues:0Issues:0

PantheonRL

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Language:PythonLicense:MITStargazers:119Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8408Issues:0Issues:0

overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

Language:Jupyter NotebookLicense:MITStargazers:659Issues:0Issues:0

awesome-phd-advice

Collection of advice for prospective and current PhD students

License:MITStargazers:1456Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:41Issues:0Issues:0

marlgrid

Gridworld for MARL experiments

Language:PythonLicense:Apache-2.0Stargazers:137Issues:0Issues:0

Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Language:PythonLicense:MITStargazers:6316Issues:0Issues:0

meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:562Issues:0Issues:0