liucc's starred repositories
alpaca-lora
Instruct-tune LLaMA on consumer hardware
GPTQ-for-LLaMa
4 bits quantization of LLaMA using GPTQ
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
awesome-model-quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.
smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
q-diffusion
[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.
BlackMamba
Code repository for Black Mamba
language-model-arithmetic
Controlled Text Generation via Language Model Arithmetic
disentangle-semantics-syntax
Code for "A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations" (NAACL 2019)
Awesome-Mamba-in-Low-Level-Vision
A paper list of recent mamba efforts for low-level vision.
Discriminator-Cooperative-Unlikelihood-Prompt-Tuning
The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Generation
Residual_Memory_Transformer
This repository contains code, data, checkpoints, and training and evaluation instructions for the paper: Controllable Text Generation with Residual Memory Transformer