Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
Home Page:https://arxiv.org/abs/2310.16795
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
win10ogod opened this issue 7 months ago · comments
Can it support Mixtral-8x7B quantization? Is there any teaching?