Can I save the compressed model for direct inference only?

Question

Can I save the compressed model for direct inference only?

SparkJiao opened this issue a year ago · comments

Fangkai Jiao commented a year ago

Excellent work

May I know if I could save the compressed model locally for further inference, e.g., combined with lora adapters?

I see the NotImplementedError raised so I'm not sure if there is something should be noted.

Thanks!

Egiazarian Vage · Answer 1 · Wed Jun 07 2023 23:38:21 GMT+0800 (China Standard Time)

Hey!

Thanks for the interest in our work!

We've released only the evaluation code for now (you can use it to evaluate the compressed model's quality).

We'll add code for efficient inference soon (in ~2 weeks).

Fangkai Jiao · Answer 2 · Thu Jun 08 2023 09:15:19 GMT+0800 (China Standard Time)

Thanks for your clarification!

tower · Answer 3 · Tue Nov 21 2023 17:28:13 GMT+0800 (China Standard Time)

Hey!

Thanks for the interest in our work!

We've released only the evaluation code for now (you can use it to evaluate the compressed model's quality).

We'll add code for efficient inference soon (in ~2 weeks).

has the inferance code released by now? failed to find it!