Sandy Tanwisuth (sandguine)

sandguine

Geek Repo

Company:University of California, Berkeley

Location:San Francisco, Bay Area

Home Page:sandytanwisuth.web.app

Twitter:@sandguine

Github PK Tool:Github PK Tool

Sandy Tanwisuth's repositories

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

cicero

running cicero on google colab

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

concordia

A library for generative social simulation

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

meltingpot

A suite of test scenarios for multi-agent reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

AbArts

Pilot mTurk project on abstract arts valuation

Language:HTMLStargazers:0Issues:3Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

fast-marl

FAST iteration of MARL research ideas: A starting point for Multi-Agent Reinforcement Learning

Language:PythonStargazers:0Issues:0Issues:0

hidden-context

Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"

Stargazers:0Issues:0Issues:0

Intrepid

INTeractive learning via REPresentatIon Discovery

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

JaxMARL

Multi-Agent Reinforcement Learning with JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lab2d

A customisable 2D platform for agent-based AI research

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

maddpg

Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

License:Apache-2.0Stargazers:0Issues:0Issues:0

maxtext

A simple, performant and scalable Jax LLM!

License:Apache-2.0Stargazers:0Issues:0Issues:0

Meta-TTS

Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.

Stargazers:0Issues:0Issues:0

micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Neural-Network-Zero-to-Hero

Writing keys libraries and core architectures from scratch. Following the tutorials of Neural Network Zero to Hero class from Andrej Karphathy.

License:MITStargazers:0Issues:1Issues:0

nn-zero-to-hero

Neural Networks: Zero to Hero

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

optuna

A hyperparameter optimization framework

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pax

Scalable Opponent Shaping Experiments in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

popgym

Partially Observable Process Gym

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pycid

Library for graphical models of decision making, based on pgmpy and networkx

License:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-Deep-Learning

Deep Learning (with PyTorch)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SAELens

Training Sparse Autoencoders on Language Models

License:MITStargazers:0Issues:0Issues:0

sequential_social_dilemma_games

Repo for reproduction of sequential social dilemmas

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
License:MITStargazers:0Issues:0Issues:0