Sandy Tanwisuth (sandguine)

sandguine

User data from Github https://github.com/sandguine

Company:University of California, Berkeley

Location:San Francisco, Bay Area

Home Page:sandytanwisuth.web.app

GitHub:@sandguine

Twitter:@sandguine

Sandy Tanwisuth's repositories

concordia

A library for generative social simulation

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

License:Apache-2.0Stargazers:0Issues:0Issues:0

contrastive_metrics

Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"

Stargazers:0Issues:0Issues:0

distributional-sr

Official implementation of the Ξ΄-model presented in the paper "A Distributional Analogue to the Successor Representation".

License:MITStargazers:0Issues:0Issues:0

effective-horizon

Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"

Stargazers:0Issues:0Issues:0

hanabi.github.io

A list of Hanabi strategies

License:CC-BY-SA-4.0Stargazers:0Issues:0Issues:0

hidden-context

Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"

Language:PythonStargazers:0Issues:0Issues:0

icvf_release

Public code for "Reinforcement Learning from Passive Data via Latent Intentions"

License:MITStargazers:0Issues:0Issues:0

JaxMARL-minimal-information

Multi-Agent Reinforcement Learning with JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lab2d

A customisable 2D platform for agent-based AI research

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

maxtext

A simple, performant and scalable Jax LLM!

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Neural-Network-Zero-to-Hero

Writing keys libraries and core architectures from scratch. Following the tutorials of Neural Network Zero to Hero class from Andrej Karphathy.

License:MITStargazers:0Issues:1Issues:0

overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

License:MITStargazers:0Issues:0Issues:0

paper-reviewer-matcher

Linear programming solver for paper-reviewer matching and mind-matching

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pax

Scalable Opponent Shaping Experiments in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

purejaxrl

Really Fast End-to-End Jax RL Implementations

License:Apache-2.0Stargazers:0Issues:0Issues:0

pycid

Library for graphical models of decision making, based on pgmpy and networkx

License:Apache-2.0Stargazers:0Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SAELens

Training Sparse Autoencoders on Language Models

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0