Xuechen Li's repositories
ml-swissknife
An ML research codebase built with friends :)
LLM-Training-Puzzles
What would you do with 1000 H100s...
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
awesome-ai-agents
A list of AI autonomous agents
bitsandbytes
8-bit CUDA functions for PyTorch
build-your-own-x
Master programming by recreating your favorite technologies from scratch.
codegpt
VSCode extension that allows you to use GPT3 inside the IDE
flash-attention
Fast and memory-efficient exact attention
label-studio-ml-backend
Configs and boilerplates for Label Studio's Machine Learning backend
levanter
Legibile, Scalable, Reproducible Foundation Models with Named Tensors and Jax
Megatron-LM
Ongoing research training transformer models at scale
open_flamingo
An open-source framework for training large multimodal models.
peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
RL4LMs-lxuechen
A modular RL library to fine-tune language models to human preferences
self-instruct
Aligning pretrained language models with instruction data generated by themselves.
sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
shap
A game theoretic approach to explain the output of any machine learning model.
summarize-from-feedback
Code for "Learning to summarize from human feedback"
tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
thesis
my thesis
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
trl-lxuechen
Train transformer language models with reinforcement learning.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs