How to quantize and compress the trained model？？？

Question

APeiZou opened this issue 3 years ago · comments

How to quantize and compress the trained model？？？？？？

THUMIG_discarded · Answer 1 · Sun Jun 27 2021 10:00:11 GMT+0800 (China Standard Time)

You can load the model and call the torchslim.quantization.qat.QATSolver to do the quantization aware train，the model can be automatically converted into the tensorrt format.
The simple example is here
https://github.com/THU-MIG/torch-model-compression/blob/main/examples/torchslim/pytorch-cifar/qat.py
The source code is here
https://github.com/THU-MIG/torch-model-compression/blob/main/torchslim/quantizing/qat.py