Xuechen Li's repositories
ml-swissknife
An ML research codebase built with friends :)
LLM-Training-Puzzles
What would you do with 1000 H100s...
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
awesome-ai-agents
A list of AI autonomous agents
bitsandbytes
8-bit CUDA functions for PyTorch
build-your-own-x
Master programming by recreating your favorite technologies from scratch.
codegpt
VSCode extension that allows you to use GPT3 inside the IDE
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
flash-attention
Fast and memory-efficient exact attention
label-studio-ml-backend
Configs and boilerplates for Label Studio's Machine Learning backend
levanter
Legibile, Scalable, Reproducible Foundation Models with Named Tensors and Jax
Megatron-LM
Ongoing research training transformer models at scale
open_flamingo
An open-source framework for training large multimodal models.
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
RL4LMs-lxuechen
A modular RL library to fine-tune language models to human preferences
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
shap
A game theoretic approach to explain the output of any machine learning model.
summarize-from-feedback
Code for "Learning to summarize from human feedback"
tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
thesis
my thesis
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
trl-lxuechen
Train transformer language models with reinforcement learning.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs