Vahe1994 / SpQR

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Why no save?

yhyu13 opened this issue · comments

commented

HI, Thanks for sharing this great quantization technique

But I am not sure I understand why saving is not supported at this moment, in the main.py

    if args.save or args.save_safetensors:
        raise NotImplementedError()

What are the obstacles that prevent compressed models from being saved/reload?

Do you know any techniques that allow dumping the model VRAM to disk and reload from disk directly?

commented

Sure, gonna stay tuned!