Andy Shih's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:84Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:7655Issues:0Issues:0

Reflected-Diffusion

[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)

Language:PythonLicense:MITStargazers:155Issues:0Issues:0

paradigms

PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight

Language:PythonLicense:MITStargazers:121Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:55774Issues:0Issues:0

LongHorizonTemperatureScaling

PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023

Language:PythonStargazers:18Issues:0Issues:0

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2182Issues:0Issues:0

vizier

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

Language:PythonLicense:Apache-2.0Stargazers:1452Issues:0Issues:0

mac

PyTorch implementation for "Training and Inference on Any-Order Autoregressive Models the Right Way", NeurIPS 2022 Oral, TPM 2023 Best Paper Honorable Mention

Language:PythonStargazers:11Issues:0Issues:0

Conventions-ModularPolicy

PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021

Language:PythonStargazers:14Issues:0Issues:0

HyperSPN

PyTorch implementation for "HyperSPNs: Compact and Expressive Probabilistic Circuits", NeurIPS 2021

Language:PythonStargazers:13Issues:0Issues:0

PantheonRL

PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.

Language:PythonLicense:MITStargazers:126Issues:0Issues:0

probabilistic-circuits

A curated collection of papers on probabilistic circuits, computational graphs encoding tractable probability distributions.

Language:CSSStargazers:47Issues:0Issues:0

gym-cooking

🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Computational Modeling Prize in High Cognition, and a NeurIPS 2020 CoopAI Workshop Best Paper.

Language:PythonLicense:MITStargazers:184Issues:0Issues:0

SPN_Variational_Inference

PyTorch implementation for "Probabilistic Circuits for Variational Inference in Discrete Graphical Models", NeurIPS 2020

Language:PythonStargazers:15Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8830Issues:0Issues:0

cheatsheets

Official Matplotlib cheat sheets

Language:PythonLicense:BSD-2-ClauseStargazers:7339Issues:0Issues:0

SSDC

Smoothing Structured Decomposable Circuits

Language:CStargazers:6Issues:0Issues:0

SPFlow

Sum Product Flow: An Easy and Extensible Library for Sum-Product Networks

Language:PythonLicense:NOASSERTIONStargazers:286Issues:0Issues:0