Can you provide the specific time and accuracy of each epoch of tflite_cpu training the conv-bn-relu kernel?

Question

Can you provide the specific time and accuracy of each epoch of tflite_cpu training the conv-bn-relu kernel?

lorena527 opened this issue a year ago · comments

Can you provide the specific time and accuracy of each epoch of tflite_cpu training the conv-bn-relu kernel? I used the nn-Meter method on huawei p30 to train, but the accuracy of the first epoch of conv-bn-relu could only reach 30%, and the first epoch of training took two days.

lorena527 · Answer 1 · Sun Apr 23 2023 11:08:00 GMT+0800 (China Standard Time)

In my experiment, I set init_sample_num as 10000, the number of finegrained_sample_num as 20, and the iteration set to 3.

Jiahang Xu · Answer 2 · Wed Apr 26 2023 16:55:48 GMT+0800 (China Standard Time)

Hi,

Thanks for raising this issue.

Our settings (init_sample_num, finegrained_sample_num, iteration) are basically the same as yours. In our previous experiments, the first round of training achieved an 10% accuracy of at least 70%. In my opinion, there should be something wrong with your current accuracy. You can try checking the profiling process on your phone. Our experiment fixed the frequency of specific cores in the phone and specified certain cores by taskset command.

You can try a simple check: run the same model 20 times under your settings and observe the latency differences in 10 times of the runs. Based on our experience, TFLite profiling is quite stable, and the difference should generally not exceed 5%-10%.

Hope this information could help.

Best regards,
Jiahang

lorena527 · Answer 3 · Thu May 11 2023 14:43:49 GMT+0800 (China Standard Time)

Dear Jiahang,

Thank you for your assistance with our project. I tried your method and ran a model 20 times under the same settings while observing the latency fluctuation 10 times. It seems that the sampled data is fine at the moment.

Thank you very much for your time and help.

Best regards,
nn