sandguine

followers

following

stars

University of California, Berkeley

San Francisco, Bay Area

sandytanwisuth.web.app

Sandy Tanwisuth's repositories

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonMIT100

cicero

running cicero on google colab

Language:Jupyter Notebook100

concordia

A library for generative social simulation

Language:PythonApache-2.0100

Melting-Pot-Contest-2023

Language:PythonApache-2.0100

meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Language:PythonApache-2.0100

AbArts

Pilot mTurk project on abstract arts valuation

Language:HTML030

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0000

fast-marl

FAST iteration of MARL research ideas: A starting point for Multi-Agent Reinforcement Learning

Language:Python000

hidden-context

Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"

000

Intrepid

INTeractive learning via REPresentatIon Discovery

Language:PythonMIT000

JaxMARL

Multi-Agent Reinforcement Learning with JAX

Language:PythonApache-2.0000

lab2d

A customisable 2D platform for agent-based AI research

Language:C++Apache-2.0000

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonMIT000

Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Apache-2.0000

maxtext

A simple, performant and scalable Jax LLM!

Apache-2.0000

Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.

000

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookMIT000

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT000

Neural-Network-Zero-to-Hero

Writing keys libraries and core architectures from scratch. Following the tutorials of Neural Network Zero to Hero class from Andrej Karphathy.

MIT010

nn-zero-to-hero

Neural Networks: Zero to Hero

Language:Jupyter NotebookMIT000

optuna

A hyperparameter optimization framework

Language:PythonNOASSERTION000

pax

Scalable Opponent Shaping Experiments in JAX

Language:PythonApache-2.0000

popgym

Partially Observable Process Gym

Language:PythonMIT000

pycid

Library for graphical models of decision making, based on pgmpy and networkx

Apache-2.0000

pytorch-Deep-Learning

Deep Learning (with PyTorch)

Language:Jupyter NotebookNOASSERTION000

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonApache-2.0000

redpoint_hacks

Language:PythonMIT000

SAELens

Training Sparse Autoencoders on Language Models

MIT000

sequential_social_dilemma_games

Repo for reproduction of sequential social dilemmas

Language:PythonMIT010

Voyager-Contracts

CAIF

MIT000