llm-quantization

There are 0 repository under llm-quantization topic.

snu-mllab / GuidedQuant
Official PyTorch implementation of "GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance" (ICML 2025)
efficient-inference large-language-models llm-inference llm-quantization quantization
Language:Python 47
GongCheng1919 / bias-compensation
[CAAI AIR'24] Minimize Quantization Output Error with Bias Compensation
llm-compression post-training-quantization bias-compensation llm-quantization output-error-optimization
Language:Python 8
paraglondhe098 / sentiment-classification-llm
Implemented and fine-tuned BERT for a custom sequence classification task, leveraging LoRA adapters for efficient parameter updates and 4-bit quantization to optimize performance and resource utilization.
llm llm-fine-tuning llm-quantization quantization data-augmentation nlp nlp-augmentation lora peft-fine-tuning-llm qlora
Language:Jupyter Notebook 0
MagicTeaMC / AutoGGUF
Let me make GGUF files quickly
gguf llama-cpp llamacpp llm llm-quantization
Language:Python
nagababumo / Quantization-in-Depth
2-bit dequantization hugging-face hugging-face-hub llm-quantization pytorch quantization torch-quantization
Language:Jupyter Notebook

snu-mllab / GuidedQuant