Enming Yuan's starred repositories
system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
flash-attention
Fast and memory-efficient exact attention
Startup-CTO-Handbook
The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams
Megatron-LM
Ongoing research training transformer models at scale
inshellisense
IDE style command line auto complete
search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
ChatGPT-AutoExpert
🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).
alignment-handbook
Robust recipes to align language models with human and AI preferences
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
textbook_quality
Generate textbook-quality synthetic LLM pretraining data
NeMo-Aligner
Scalable toolkit for efficient model alignment
multipack_sampler
Multipack distributed sampler for fast padding-free training of LLMs