yuanenming

Enming Yuan's starred repositories

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonNOASSERTION80694 1742 43532

system-design-101

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

NOASSERTION60829 810 44

HumanSystemOptimization

健康学习到150岁 - 人体系统调优不完全指南

12916 120 14

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonMIT12103 90 340

Startup-CTO-Handbook

The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams

NOASSERTION10069 84 11

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookMIT9738 84 247

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION9490 159 614

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT8790 81 36

inshellisense

IDE style command line auto complete

Language:TypeScriptMIT8246 23 117

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Language:TypeScriptApache-2.07565 52 64

LWM

Language:PythonApache-2.07028 66 68

ChatGPT-AutoExpert

🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).

Language:JavaScriptNOASSERTION6551 86 32

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION5757 46 75

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause5384 64 96

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.04261 111 124

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.03379 24 425

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonApache-2.02464 24 24

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Language:PythonApache-2.01798 41 100

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonMIT1273 23 17

MOSS-RLHF

Language:PythonApache-2.01235 34 51

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.0994 40 65

SPIN

The official implementation of Self-Play Fine-Tuning (SPIN)

Language:PythonApache-2.0902 12 30

fine-tune-mistral

Fine-tune mistral-7B on 3090s, a100s, h100s

Language:PythonMIT696 6 5

deep-learning-pytorch-huggingface

Language:Jupyter NotebookMIT583 11 38

tokenmonster

Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript

Language:GoMIT531 10 26

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonMIT467 8 5

NeMo-Aligner

Scalable toolkit for efficient model alignment

Language:PythonApache-2.0452 12 59

ArXivQA

WIP - Automated Question Answering for ArXiv Papers with Large Language Models (https://arxiv.taesiri.xyz/)

Language:Python269 10 7

multipack_sampler

Multipack distributed sampler for fast padding-free training of LLMs

Language:PythonMIT159 3 3

evalverse-IFEval

Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/master/instruction_following_eval)

Language:Python9 40