Beast code in Giters

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonApache-2.02735 19 758

SenseVoice

Multilingual Voice Understanding Model

Language:PythonNOASSERTION2698 36 115

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Language:PythonApache-2.02567 18 174

optax

Optax is a gradient processing and optimization library for JAX.

Language:PythonApache-2.01643 34 235

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonApache-2.01458 28 87

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonNOASSERTION1360 4 145

dclm

DataComp for Language Models

Language:HTMLMIT1116 38 49

deduplicate-text-datasets

Language:RustApache-2.01102 13 41

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

1049 12 4

YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

Language:C++MIT953 26 60

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

731 24 9

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonMIT675 30 124

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonApache-2.0657 23 465

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Language:PythonApache-2.0563 13 20

H3

Language Modeling with the H3 State Space Model

Language:AssemblyApache-2.0510 32 26

gemma-cookbook

A collection of guides and examples for the Gemma open models from Google.

Language:Jupyter NotebookApache-2.0504 11 9

open_lm

A repository for research on medium sized language models.

Language:PythonMIT469 21 67

FuseAI

FuseAI Project

Language:Python43700

torchx

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

Language:PythonNOASSERTION326 21 180

bigcodebench

BigCodeBench: Benchmarking Code Generation Towards AGI

Language:PythonApache-2.0185 4 30

eval-scope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Language:PythonApache-2.0123 6 20

DRTK

Differentiable Rendering Toolkit

Language:CudaNOASSERTION29 5 2

Zyda_processing

Language:PythonApache-2.02200