yzhangcs

Yu Zhang's starred repositories

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonMIT19493 255 72

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonApache-2.08135 73 396

lit-llama

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Language:PythonApache-2.05898 67 269

grok

Language:PythonMIT4061 150 33

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonApache-2.0947 8 9

punica

Serving multiple LoRA finetuned LLM as one

Language:PythonApache-2.0899 14 37

flash-attention-minimal

Flash Attention in ~100 lines of CUDA (forward pass only)

Language:CudaApache-2.0501 4 4

Rewrite-the-Stars

[CVPR 2024] Rewrite the Stars

Language:PythonApache-2.0226 2 17

long-context-attention

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Language:Python225 4 9

minicons

Utility for behavioral and representational analyses of Language Models

Language:PythonMIT110 6 16

DiJiang

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

Language:Python87 5 5