Unofish's starred repositories

llama.cpp

LLM inference in C/C++

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36697Issues:348Issues:1816

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:32712Issues:204Issues:5029

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29430Issues:341Issues:268

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26728Issues:220Issues:251

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:18987Issues:173Issues:1378

unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:17084Issues:121Issues:899

yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Language:PythonLicense:AGPL-3.0Stargazers:9749Issues:49Issues:408

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7845Issues:97Issues:1601

miniforge

A conda-forge distribution.

Language:ShellLicense:NOASSERTIONStargazers:6336Issues:55Issues:369

Firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Language:PythonLicense:Apache-2.0Stargazers:4937Issues:31Issues:520

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:4413Issues:31Issues:455

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3856Issues:34Issues:526

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Language:PythonLicense:MITStargazers:2238Issues:39Issues:30

awesome-model-quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

Language:PythonLicense:Apache-2.0Stargazers:1161Issues:25Issues:389

Awesome-LLM-Compression

Awesome LLM compression research papers and tools.

mlc-MiniCPM

MiniCPM on Android platform.

Language:PythonLicense:Apache-2.0Stargazers:538Issues:9Issues:0

long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:337Issues:4Issues:18

qmoe

Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

Language:PythonLicense:Apache-2.0Stargazers:260Issues:6Issues:5

LLM-QAT

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Language:PythonLicense:NOASSERTIONStargazers:246Issues:5Issues:30

nntrainer

NNtrainer is Software Framework for Training Neural Network Models on Devices.

Language:C++License:Apache-2.0Stargazers:145Issues:14Issues:679

ChatGLM_mutli_gpu_tuning

deepspeed+trainer简单高效实现多卡微调大模型

Language:PythonLicense:MITStargazers:116Issues:3Issues:13

LearnDeepSpeed

DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)

Language:PythonLicense:MITStargazers:106Issues:1Issues:0

BabyLlama

Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.

Language:PythonLicense:MITStargazers:59Issues:2Issues:5
Language:PythonLicense:Apache-2.0Stargazers:29Issues:5Issues:2

LLM-inference-optimization-paper

Summary of some awesome work for optimizing LLM inference

profiler-workshop

Example code for profiler workshop

Language:PythonLicense:MITStargazers:28Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:25Issues:1Issues:35