How evaluation being done without storing quantized weights?

Question

How evaluation being done without storing quantized weights?

tahmiddialpad opened this issue a year ago · comments

I would like to know how the quantized model's performance is evaluated while keeping this line blank?

How are the quantized weights being considered while evaluating the performance of the model since the value of quantizers here is empty?

Poedator · Answer 1 · Mon Jul 17 2023 20:33:37 GMT+0800 (China Standard Time)

Hello, @tahmiddialpad,

Thank you for interest to SpQR compression.

When the quantize() function is called in line 217 it quantizes and then immediately dequantizes the model weights. See lines 178-189 in spqr_engine.py().
The line that you mentioned would return quantizers necessary to save the compressed model. This saving functionality is not yet implemented.