ModelCloud / GPTQModel

An easy-to-use LLM quantization and inference toolkit based on GPTQ algorithm (weight-only quantization).

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ModelCloud/GPTQModel Issues