Beast code in Giters

PyTorch implementation for "Training and Inference on Any-Order Autoregressive Models the Right Way", NeurIPS 2022 Oral, TPM 2023 Best Paper Honorable Mention

Language:Python1100

Conventions-ModularPolicy

PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021

Language:Python1400

HyperSPN

PyTorch implementation for "HyperSPNs: Compact and Expressive Probabilistic Circuits", NeurIPS 2021

Language:Python1300

PantheonRL

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Language:PythonMIT12600

probabilistic-circuits

A curated collection of papers on probabilistic circuits, computational graphs encoding tractable probability distributions.

Language:CSS4700

🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Computational Modeling Prize in High Cognition, and a NeurIPS 2020 CoopAI Workshop Best Paper.

Language:PythonMIT18400

SPN_Variational_Inference

PyTorch implementation for "Probabilistic Circuits for Variational Inference in Discrete Graphical Models", NeurIPS 2020

Language:Python1500

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT883000

cheatsheets

Official Matplotlib cheat sheets

Language:PythonBSD-2-Clause733900

SSDC

Smoothing Structured Decomposable Circuits

Language:C600

SPFlow

Sum Product Flow: An Easy and Extensible Library for Sum-Product Networks

Language:PythonNOASSERTION28600

AndyShih12

Andy Shih's starred repositories

DreamPropeller

IF

Reflected-Diffusion

paradigms

llama

LongHorizonTemperatureScaling

RL4LMs

vizier

mac