int8 quantization not working
homink opened this issue · comments
homink commented
Hi, it looks like int8 quantization was configured int8_float32 mistakenly for a long time. Could anyone please have a look at it?
I think it should be
From:
if (support_int8) {
compute_types.emplace("int8");
compute_types.emplace("int8_float32");
if (support_float16)
compute_types.emplace("int8_float16");
if (support_bfloat16)
compute_types.emplace("int8_bfloat16");
}
To:
if (support_int8)
compute_types.emplace("int8");
if (support_float16)
compute_types.emplace("int8_float16");
if (support_bfloat16)
compute_types.emplace("int8_bfloat16");