tflite to JSON to tflite quantization error

Question

tflite to JSON to tflite quantization error

tforgaard opened this issue a year ago · comments

Hi!

When I convert my int8 quantized tflite model to JSON and back the quantization scale parameters are different (they are correct up to ~4 decimals).

This makes the edgetpu compiler return an error when I try to convert it

edgetpu_compiler -m 13 -sa model_unroll.tflite
Edge TPU Compiler version 16.0.384591198
Started a compilation timeout timer of 180 seconds.
ERROR: :514 output->params.scale == 1. / 256 was not true.
ERROR: Node number 6 (LOGISTIC) failed to prepare.

Compilation failed: Model failed in Tflite interpreter. Please ensure model can be loaded/run in Tflite interpreter.
Compilation child process completed within timeout period.
Compilation failed!

Any idea of how to fix this?

Inputs and outputs of the model before conversion

Inputs and outputs of the model after conversion

Katsuya Hyodo · Answer 1 · Mon Feb 20 2023 20:13:14 GMT+0800 (China Standard Time)

This tool (environment) only provides a simple mechanism to call Google's official tool, flatc. I don't know what flatc did when you generated the JSON from tflite.

Have you checked the contents of the generated JSON file?
What kind of processing did you do to the JSON file?

If you are only shown the results, it doesn't tell you anything. Therefore, it is a good idea to read and look at the source code for flatbuffers.
https://github.com/google/flatbuffers

Theodor Forgaard · Answer 2 · Mon Feb 20 2023 20:28:15 GMT+0800 (China Standard Time)

I see now that the scale parameters in the JSON file only contain six decimals
I originally tried to remove an unnecessary Unpack operation, but I tried to convert to JSON and back without doing any processing as a sanity check as well

I think I found a related issue to work with: google/flatbuffers#5371

Katsuya Hyodo · Answer 3 · Mon Feb 20 2023 20:37:46 GMT+0800 (China Standard Time)

I see. This issue seems to have stopped the discussion in the middle. It doesn't look like a good idea to go through a JSON file to process the quantization parameters.

I wish there was a way to read and process the binary value of flatbuffer directly.
Btw, I have tried to convert a byte array of flatbuffer as follows. 2 years ago.

https://github.com/PINTO0309/tflite2tensorflow/blob/c13504df2f82dc234f1009e34dbab9c8b65c7ce4/tflite2tensorflow/tflite2tensorflow.py#L5278-L5509

I am parsing the binary values read by TFLite runtime, but I have used JSON for intermediate files, so the same phenomenon is likely occurring.

Theodor Forgaard · Answer 4 · Mon Feb 20 2023 22:33:37 GMT+0800 (China Standard Time)

It might not be the best solution, but I think I'm gonna go for changing the source code for flatbuffers to use more decimals when parsing floats as described here google/flatbuffers#5371 (comment) .

Katsuya Hyodo · Answer 5 · Mon Feb 20 2023 22:44:16 GMT+0800 (China Standard Time)

Excellent.

So you are saying that you intend to rewrite and recompile here.

return FloatToString(t, 6);

Theodor Forgaard · Answer 6 · Mon Feb 20 2023 22:52:07 GMT+0800 (China Standard Time)

Yes, I will let you know if it works or not.

Theodor Forgaard · Answer 7 · Tue Feb 21 2023 02:21:37 GMT+0800 (China Standard Time)

Worked like a charm!

Katsuya Hyodo · Answer 8 · Tue Feb 21 2023 09:17:45 GMT+0800 (China Standard Time)

OK. 17 digits confirmed it for me too. I will update the repository.

Katsuya Hyodo · Answer 9 · Tue Feb 21 2023 09:38:31 GMT+0800 (China Standard Time)

https://github.com/PINTO0309/tflite2json2tflite/releases/tag/1.1.0

https://github.com/PINTO0309/flatbuffers

Theodor Forgaard · Answer 10 · Tue Feb 21 2023 15:20:25 GMT+0800 (China Standard Time)

Nice! Thanks for updating the repo