3outeille

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonMIT000

litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Language:PythonApache-2.0000

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION000

nccl

Optimized primitives for collective multi-GPU communication

Language:C++NOASSERTION000

nccl-tests

NCCL Tests

Language:CudaBSD-3-Clause000

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Apache-2.0000

picotron-deepseek

Minimalistic 4D-parallelism distributed training framework for education purpose

Language:PythonApache-2.0000

prime-rl

Decentralized RL Training at Scale

Language:PythonApache-2.0000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonNOASSERTION000

quack

A Quirky Assortment of CuTe Kernels

Apache-2.0000

ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Language:PythonMIT000

tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

NOASSERTION000

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause000

veScale

A PyTorch Native LLM Training Framework

Language:PythonApache-2.0000

vTrain

Language:PythonMIT000