[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs
Home Page:https://arxiv.org/abs/2402.05147
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool