vwxyzjn

followers

following

stars

@huggingface

Philadelphia, PA

https://costa.sh

Costa Huang's starred repositories

grok-1

Grok open release

Language:PythonApache-2.049461 562 209

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT23642 230 137

uv

An extremely fast Python package and project manager, written in Rust.

Language:RustApache-2.021736 46 3318

openui

OpenUI let's you describe UI using your imagination, then see it rendered live.

Language:TypeScriptApache-2.018874 123 159

mlx

MLX: An array framework for Apple silicon

Language:C++MIT16609 142 522

mamba

Mamba SSM architecture

Language:PythonApache-2.012719 101 512

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT9076 83 36

cv

Print-friendly, minimalist CV page

Language:TypeScriptMIT8891 21 34

PokemonRedExperiments

Playing Pokemon Red with Reinforcement Learning

Language:Jupyter NotebookMIT6869 69 113

deskhop

Fast Desktop Switching Device

Language:CGPL-3.06139 53 117

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5567 63 98

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.04474 47 191

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.02054 19 81

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonNOASSERTION976 147 21

huak

My experimental Python package manager.

Language:RustMIT615 7 251

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonApache-2.0444 33 5

aimo-progress-prize

Language:Jupyter NotebookApache-2.0271 7 9

Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Language:PythonApache-2.0211 6 29

flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

Language:PythonApache-2.0201 13 11

RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Language:Jupyter NotebookMIT181 5 26

fms-fsdp

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Language:PythonApache-2.0162 10 32

seqax

seqax = sequence modeling + JAX

Language:PythonBSD-3-Clause131 7 2

oaib

Use the OpenAI Batch tool to make async batch requests to the OpenAI API.

Language:PythonMIT91 4 6

jaxued

Language:PythonApache-2.056 10

sebulba

🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX

Language:PythonApache-2.046 5 2

paged-attention-minimal

a minimal cache manager for PagedAttention, on top of llama3.

Language:PythonApache-2.031 2 1

PDFwriter

An OSX print to pdf-file printer driver

Language:Objective-CGPL-2.030 20

cogment-lab

A toolkit for practical Human-AI cooperation research

Language:PythonApache-2.013 3 5

OLMo-core

PyTorch building blocks for OLMo

Language:PythonApache-2.08 20

putting-dune

Language:PythonApache-2.07 90