Beast code in Giters

mzchtx's starred repositories

AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Language:Jupyter NotebookApache-2.01018000

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3589200

unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.01476800

skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Language:PythonApache-2.0642800

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonApache-2.0431900

simple-evals

Language:PythonMIT146400

veScale

A PyTorch Native LLM Training Framework

Language:PythonApache-2.055700

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT1278900

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.02958500

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION976800

dlrover

DLRover: An Automatic Distributed Deep Learning System

Language:PythonNOASSERTION114200

Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

MIT99500

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language:PythonMIT1940800

LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Language:PythonMIT25700

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell703000

grok-1

Grok open release

Language:PythonApache-2.04937900

optimum-nvidia

Language:PythonApache-2.086100

SoraReview

The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".

48500

llama2.c

Inference Llama 2 in one file of pure C

Language:CMIT1706700

ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Language:PythonApache-2.0631600

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT658700

EAGLE

Official Implementation of EAGLE-1 and EAGLE-2

Language:PythonApache-2.071700

Yi

A series of large language models trained from scratch by developers @01-ai

Language:Jupyter NotebookApache-2.0756600

tabby

Self-hosted AI coding assistant

Language:RustNOASSERTION2082800

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Language:PythonMIT438000

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language:PythonApache-2.0203400

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookNOASSERTION922700

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.0439000

TigerBot

TigerBot: A multi-language multi-task LLM

Language:PythonApache-2.0223000

RAG-Survey

166500