Beast code in Giters

putizi-super's starred repositories

Awesome-ChatTTS

官方推荐的 ChatTTS 资源汇总项目，整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

Mixture-of-depths

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:Python11200

mixture-of-depths

An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:PythonNOASSERTION3100

Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

1337000

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT1380300

LLaMA3-Quantization

A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..

Language:Python13600

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2327500

Mixture-of-Depths

Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Language:PythonMIT4400

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.01841400

ShortGPT

Unofficial implementations of block/layer-wise pruning methods for LLMs.

Language:Jupyter NotebookMIT3800

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

86000

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION2584600

Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

39300

call4papers

Language:PHP200

Awesome-LLM-Prune

Awesome list for LLM pruning.

6800

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT586800

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook1049400

putizi-super

putizi-super's starred repositories

Awesome-ChatTTS

SparTA

SparTA

Mixture-of-depths

mixture-of-depths

CATS

Awesome-Chinese-LLM

FCPTS

pykan

HuggingArxiv

LLaMA3-Quantization

DejaVu

llama3

Mixture-of-Depths

alpaca-lora

ShortGPT

adaptively_sparse_attention

Efficient-LLMs-Survey

tuning_playbook

Awesome-Knowledge-Distillation-of-LLMs

call4papers

Awesome-LLM-Prune

lm-evaluation-harness

llama-recipes

llama

XCurve

PowerInfer

BitPack

mmdetection

DeepSpeedExamples