How evaluation being done without storing quantized weights?
tahmiddialpad opened this issue · comments
Hello, @tahmiddialpad,
Thank you for interest to SpQR compression.
When the quantize() function is called in line 217 it quantizes and then immediately dequantizes the model weights. See lines 178-189 in spqr_engine.py().
The line that you mentioned would return quantizers necessary to save the compressed model. This saving functionality is not yet implemented.