Nearest Rounding Kernel
Tiiiger opened this issue · comments
Tianyi commented
Right now I have only implemented stochastic rounding CUDA kernel. We need to add nearest rounding CUDA kernel.
linzhiqiu commented
Done. Need testing
Tianyi commented
Need to modify quant_module. Please do.
Tianyi commented
Need to Implement nearest rounding for float kernel. See CUDA implementation. You can ignore clip_exponent for now. @linzhiqiu