Beast code in Giters

liucc's starred repositories

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.018369 155 467

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookMIT9675 85 246

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Language:PythonApache-2.02945 42 216

neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Language:PythonApache-2.02065 34 188

gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Language:PythonApache-2.01779 29 48

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

1703 59 12

smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonMIT1094 19 81

DiffuSeq

[ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

Language:PythonMIT698 26 78

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Language:PythonMIT613 16 69

SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Language:PythonMIT593 17 25

SpQR

Language:PythonApache-2.0513 22 21

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonMIT479 11 57

QuIP

Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"

Language:Python315 9 10

MambaIR

A simple baseline for image restoration with state-space model.

Language:PythonApache-2.0297 5 34

q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

Language:PythonMIT283 17 36

Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Language:Cuda218 11 11

BlackMamba

Code repository for Black Mamba

Language:Python206 4 7

language-model-arithmetic

Controlled Text Generation via Language Model Arithmetic

Language:PythonMIT182 8 7

RPTQ4LLM

Reorder-based post-training quantization for large language model

Language:PythonMIT176 7 12

LLM-FP4

The official implementation of the EMNLP 2023 paper LLM-FP4

Language:PythonMIT145 5 9

Train_Transformers_with_INT4

Language:Python127 5 3

PTQ4DM

Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)

Language:Python111 5 8

PTQD

The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models

Language:Jupyter Notebook81 5 16

disentangle-semantics-syntax

Code for "A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations" (NAACL 2019)

Language:Python68 2 4

Awesome-Mamba-in-Low-Level-Vision

A paper list of recent mamba efforts for low-level vision.

MIT5600

QuantSR

[NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.

Language:PythonApache-2.037 3 3

DDTB

Pytorch implementation of our paper accepted by ECCV2022 -- Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networks

Language:Python27 1 5

Discriminator-Cooperative-Unlikelihood-Prompt-Tuning

The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Generation

Language:Python25 1 3

DiffuSeq_StylePTB

Language:PythonMIT10 1 3

Residual_Memory_Transformer

This repository contains code, data, checkpoints, and training and evaluation instructions for the paper: Controllable Text Generation with Residual Memory Transformer

Language:Python9 1 1

lcc0504