Unofish

followers

following

stars

Unofish's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT66533 549 3893

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.036697 348 1816

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.032712 204 5029

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.029430 341 268

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION26728 220 251

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonApache-2.018987 173 1378

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonApache-2.017084 121 899

yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Language:PythonAGPL-3.09749 49 408

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonApache-2.07845 97 1601

miniforge

A conda-forge distribution.

Language:ShellNOASSERTION6336 55 369

Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:Python5754 56 278

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonApache-2.04937 31 520

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonMIT4413 31 455

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonApache-2.03856 34 526

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Language:PythonMIT2238 39 30

awesome-model-quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Language:PythonApache-2.01161 25 389

Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

mlc-MiniCPM

MiniCPM on Android platform.

Language:PythonApache-2.0538 90

long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Language:PythonApache-2.0337 4 18

qmoe

Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

Language:PythonApache-2.0260 6 5

LLM-QAT

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Language:PythonNOASSERTION246 5 30

nntrainer

NNtrainer is Software Framework for Training Neural Network Models on Devices.

Language:C++Apache-2.0145 14 679

ChatGLM_mutli_gpu_tuning

deepspeed+trainer简单高效实现多卡微调大模型

Language:PythonMIT116 3 13

LearnDeepSpeed

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

Language:PythonMIT106 10

BabyLlama

Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.

Language:PythonMIT59 2 5

private_llm

Language:PythonApache-2.029 5 2

LLM-inference-optimization-paper

Summary of some awesome work for optimizing LLM inference

profiler-workshop

Example code for profiler workshop

Language:PythonMIT28 10

YanshiShield

Language:PythonApache-2.025 1 35