facebookresearch / diffq

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.

facebookresearch/diffq Issues

Will diffq make model faster?
Updated a year ago
Getting error by pip install diffq on Windows
Closed 2 years ago3
Quantized Model Output NaN / 0
Updated 2 years ago8
Why checkpoint.pth on the output folder is not in compliance with true model size?
Closed 3 years ago2
Number of parameters doubled
Closed 3 years ago1
require 'override' keyword
Updated 3 years ago3
where the activation/feature-map is quantized?
Updated 3 years ago1
Is it compatible with transformers library?
Updated 4 years ago2