About the benchmark device

Question

About the benchmark device

austingg opened this issue 7 years ago · comments

Yubin Wang commented 7 years ago

hi @yonghenglh6 ,
which gpu do you use for the benchmark time of depthwise conv?

刘灏@megvii.com · Answer 1 · Tue Jun 27 2017 13:30:30 GMT+0800 (China Standard Time)

GeForce GTX 1080

Yubin Wang · Answer 2 · Tue Jun 27 2017 17:44:28 GMT+0800 (China Standard Time)

@yonghenglh6 what's your cudnn version? I use GTX1080 cudnn v5.1, the example net costs about 7ms for forward pass and 10 ms for backward pass (take bn into consideration).

Beside, the example network prototxt with its' name *** 128 *** , however its' input is 224, and on 224 case , the last avg pooling layer's kernel size should be 7 instead 4.

刘灏@megvii.com · Answer 3 · Tue Jun 27 2017 18:02:07 GMT+0800 (China Standard Time)

You are right at all.
I mismatch the performance with the my half mobilenet. I will fix it. Thanks

刘灏@megvii.com · Answer 4 · Tue Jun 27 2017 20:08:13 GMT+0800 (China Standard Time)

@austingg
It is fixed now. The speed-up performance is less attractive.

Yubin Wang · Answer 5 · Tue Jun 27 2017 20:10:53 GMT+0800 (China Standard Time)

@yonghenglh6 doesn't matter. We can make it faster step by step. And Now it is indeed faster than Depthwise with group.