ModelCloud/GPTQModel Issues
[BUG] [CI] Flaky test_repacking.py
Closed 3[BUG] Gemma 2 - 27B Regression
Updated 6[FEATURE] Add Gemma 2
Closed 1Buffers in marlin setting
Closed 4[FEATURE] Intel/Habana HPU Support
Updated 1Import error
Closed 9
An easy-to-use LLM quantization and inference toolkit based on GPTQ algorithm (weight-only quantization).