Tianjian Li's starred repositories
cartography
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
EKFAC-pytorch
Repository containing Pytorch code for EKFAC and K-FAC perconditioners.
llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
Megatron-LM
Ongoing research training transformer models at scale
Intra-Distillation
This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".
mesh-transformer-jax
Model parallel transformers in JAX and Haiku
pytorch-pruning
PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference
lm-evaluation-harness
A framework for few-shot evaluation of language models.
unify-parameter-efficient-tuning
Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)