aigaolc's starred repositories

LLM101n

LLM101n: Let's build a Storyteller

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:6373Issues:56Issues:139

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5925Issues:40Issues:85

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5241Issues:39Issues:37

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:4917Issues:52Issues:362

anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Language:Jupyter NotebookLicense:MITStargazers:4864Issues:150Issues:31

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2735Issues:19Issues:758

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:2698Issues:36Issues:115

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:2567Issues:18Issues:174

optax

Optax is a gradient processing and optimization library for JAX.

Language:PythonLicense:Apache-2.0Stargazers:1643Issues:34Issues:235

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonLicense:Apache-2.0Stargazers:1458Issues:28Issues:87

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:1360Issues:4Issues:145

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:1116Issues:38Issues:49

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

Language:C++License:MITStargazers:953Issues:26Issues:60

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:675Issues:30Issues:124

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonLicense:Apache-2.0Stargazers:657Issues:23Issues:465

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Language:PythonLicense:Apache-2.0Stargazers:563Issues:13Issues:20

H3

Language Modeling with the H3 State Space Model

Language:AssemblyLicense:Apache-2.0Stargazers:510Issues:32Issues:26

gemma-cookbook

A collection of guides and examples for the Gemma open models from Google.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:504Issues:11Issues:9

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:469Issues:21Issues:67

FuseAI

FuseAI Project

Language:PythonStargazers:437Issues:0Issues:0

torchx

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

Language:PythonLicense:NOASSERTIONStargazers:326Issues:21Issues:180

bigcodebench

BigCodeBench: Benchmarking Code Generation Towards AGI

Language:PythonLicense:Apache-2.0Stargazers:185Issues:4Issues:30

eval-scope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Language:PythonLicense:Apache-2.0Stargazers:123Issues:6Issues:20

DRTK

Differentiable Rendering Toolkit

Language:CudaLicense:NOASSERTIONStargazers:29Issues:5Issues:2
Language:PythonLicense:Apache-2.0Stargazers:22Issues:0Issues:0