aigaolc's starred repositories

anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Language:Jupyter NotebookLicense:MITStargazers:3796Issues:0Issues:0

DRTK

Differentiable Rendering Toolkit

Language:CudaLicense:NOASSERTIONStargazers:19Issues:0Issues:0

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Language:PythonLicense:Apache-2.0Stargazers:531Issues:0Issues:0

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonLicense:Apache-2.0Stargazers:1384Issues:0Issues:0

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5181Issues:0Issues:0

H3

Language Modeling with the H3 State Space Model

Language:AssemblyLicense:Apache-2.0Stargazers:505Issues:0Issues:0

FuseAI

FuseAI Project

Language:PythonStargazers:375Issues:0Issues:0

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:483Issues:0Issues:0

gemma-cookbook

A collection of guides and examples for the Gemma open models from Google.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:288Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:1490Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:2414Issues:0Issues:0

data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Language:PythonLicense:Apache-2.0Stargazers:1785Issues:0Issues:0

YouTokenToMe

Unsupervised text tokenizer focused on computational efficiency

Language:C++License:MITStargazers:950Issues:0Issues:0

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

Stargazers:895Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:6392Issues:0Issues:0

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5802Issues:0Issues:0

tensorrtllm_backend

The Triton TensorRT-LLM Backend

Language:PythonLicense:Apache-2.0Stargazers:608Issues:0Issues:0

optax

Optax is a gradient processing and optimization library for JAX.

Language:PythonLicense:Apache-2.0Stargazers:1574Issues:0Issues:0

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:5900Issues:0Issues:0

eval-scope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Language:PythonLicense:Apache-2.0Stargazers:115Issues:0Issues:0

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:1123Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:24514Issues:0Issues:0

bigcodebench

BigCodeBench: The Next Generation of HumanEval

Language:PythonLicense:Apache-2.0Stargazers:132Issues:0Issues:0

dclm

DataComp for Language Models

Language:HTMLLicense:MITStargazers:414Issues:0Issues:0

open_lm

A repository for research on medium sized language models.

Language:PythonLicense:MITStargazers:385Issues:0Issues:0

torchx

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

Language:PythonLicense:NOASSERTIONStargazers:311Issues:0Issues:0

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

Stargazers:97Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:21Issues:0Issues:0
Language:RustLicense:Apache-2.0Stargazers:1054Issues:0Issues:0

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2472Issues:0Issues:0