yzhangcs

Yu Zhang's starred repositories

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonApache-2.08187 73 402

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause3824 43 450

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION1401 15 25

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.01049 42 72

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonApache-2.0953 8 9

tensor_parallel

Automatically split your PyTorch models on multiple GPUs for training & inference

Language:PythonMIT612 8 66

ByteTransformer

optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052

Language:C++Apache-2.0448 10 10

InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Language:PythonMIT327 8 24

long-context-attention

Sequence Parallel Attention for Long Context LLM Model Training and Inference

Language:PythonApache-2.0267 4 14

Rewrite-the-Stars

[CVPR 2024] Rewrite the Stars

Language:PythonApache-2.0239 2 17

minicons

Utility for behavioral and representational analyses of Language Models

Language:PythonMIT113 6 16

DiJiang

[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear attention mechanism.

Language:Python93 5 7