casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Home Page:https://casper-hansen.github.io/AutoAWQ/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

casper-hansen/AutoAWQ Watchers