Quantization for training / finetuning

Question

Quantization for training / finetuning

torphix opened this issue a year ago · comments

Hi!
Thanks for the lib and tutorial, it is very informative.

With respect to finetuning would it be worth quantizing the model first to fp16 or even int8 before beginning training?
As this might lead to better accuracy when compared to quantizing after the model has been finetuned?

Thanks