Costa Huang (vwxyzjn)

vwxyzjn

Geek Repo

Company:@huggingface

Location:Philadelphia, PA

Home Page:https://costa.sh

Twitter:@vwxyzjn

Github PK Tool:Github PK Tool

Costa Huang's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49461Issues:562Issues:209

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:23642Issues:230Issues:137

uv

An extremely fast Python package and project manager, written in Rust.

Language:RustLicense:Apache-2.0Stargazers:21736Issues:46Issues:3318

openui

OpenUI let's you describe UI using your imagination, then see it rendered live.

Language:TypeScriptLicense:Apache-2.0Stargazers:18874Issues:123Issues:159

mlx

MLX: An array framework for Apple silicon

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12719Issues:101Issues:512

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:9076Issues:83Issues:36

cv

Print-friendly, minimalist CV page

Language:TypeScriptLicense:MITStargazers:8891Issues:21Issues:34

PokemonRedExperiments

Playing Pokemon Red with Reinforcement Learning

Language:Jupyter NotebookLicense:MITStargazers:6869Issues:69Issues:113

deskhop

Fast Desktop Switching Device

Language:CLicense:GPL-3.0Stargazers:6139Issues:53Issues:117

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5567Issues:63Issues:98

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4474Issues:47Issues:191

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:2054Issues:19Issues:81

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonLicense:NOASSERTIONStargazers:976Issues:147Issues:21

huak

My experimental Python package manager.

Language:RustLicense:MITStargazers:615Issues:7Issues:251

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonLicense:Apache-2.0Stargazers:444Issues:33Issues:5
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:271Issues:7Issues:9

Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Language:PythonLicense:Apache-2.0Stargazers:211Issues:6Issues:29

flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

Language:PythonLicense:Apache-2.0Stargazers:201Issues:13Issues:11

RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Language:Jupyter NotebookLicense:MITStargazers:181Issues:5Issues:26

fms-fsdp

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Language:PythonLicense:Apache-2.0Stargazers:162Issues:10Issues:32

seqax

seqax = sequence modeling + JAX

Language:PythonLicense:BSD-3-ClauseStargazers:131Issues:7Issues:2

oaib

Use the OpenAI Batch tool to make async batch requests to the OpenAI API.

Language:PythonLicense:MITStargazers:91Issues:4Issues:6
Language:PythonLicense:Apache-2.0Stargazers:56Issues:1Issues:0

sebulba

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Language:PythonLicense:Apache-2.0Stargazers:46Issues:5Issues:2

paged-attention-minimal

a minimal cache manager for PagedAttention, on top of llama3.

Language:PythonLicense:Apache-2.0Stargazers:31Issues:2Issues:1

PDFwriter

An OSX print to pdf-file printer driver

Language:Objective-CLicense:GPL-2.0Stargazers:30Issues:2Issues:0

cogment-lab

A toolkit for practical Human-AI cooperation research

Language:PythonLicense:Apache-2.0Stargazers:13Issues:3Issues:5

OLMo-core

PyTorch building blocks for OLMo

Language:PythonLicense:Apache-2.0Stargazers:8Issues:2Issues:0
Language:PythonLicense:Apache-2.0Stargazers:7Issues:9Issues:0