Enming Yuan's starred repositories

system-design-101

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29059Issues:342Issues:267

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:21413Issues:197Issues:3128

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:11653Issues:104Issues:835

Startup-CTO-Handbook

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9644Issues:85Issues:246

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9150Issues:157Issues:568

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8556Issues:78Issues:956

inshellisense

IDE style command line auto complete

Language:TypeScriptLicense:MITStargazers:8197Issues:23Issues:111

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptLicense:Apache-2.0Stargazers:7243Issues:49Issues:60

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7189Issues:112Issues:148

ChatGPT-AutoExpert

🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).

Language:JavaScriptLicense:NOASSERTIONStargazers:6527Issues:86Issues:32

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5301Issues:63Issues:89

voila

VoilĂ  turns Jupyter notebooks into standalone web applications

Language:PythonLicense:NOASSERTIONStargazers:5286Issues:76Issues:724

agents

An Open-source Framework for Autonomous Language Agents

Language:PythonLicense:Apache-2.0Stargazers:4651Issues:59Issues:68

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4119Issues:112Issues:119

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3056Issues:21Issues:389

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2405Issues:23Issues:23

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1785Issues:19Issues:77

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonLicense:MITStargazers:1326Issues:116Issues:15

basaran

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.

Language:PythonLicense:MITStargazers:1285Issues:22Issues:59

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonLicense:MITStargazers:1261Issues:23Issues:17

MOSS-RLHF

MOSS-RLHF

Language:PythonLicense:Apache-2.0Stargazers:1200Issues:33Issues:50

fine-tune-mistral

Fine-tune mistral-7B on 3090s, a100s, h100s

Language:PythonLicense:MITStargazers:691Issues:6Issues:5

tokenmonster

Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript

Language:GoLicense:MITStargazers:522Issues:10Issues:26

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:458Issues:8Issues:5

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonLicense:Apache-2.0Stargazers:396Issues:9Issues:51

ArXivQA

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

Language:PythonLicense:MITStargazers:157Issues:3Issues:3