Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool