Hi Gustav I have questions!

Question

Hi Gustav I have questions!

sankim90 opened this issue 6 years ago · comments

First, I was amazed at your work. It fits perfectly in my work.

I am working at JetsonTX2 & DrivePX2, and as you know, there is a speed issue.

I got information about the various works and github.

Q1. How can you achieve 30 fps at SSD mobilenet JetsonTX2?
AS mentioned (1), you manually assigning the CNN related nodes on GPU and the rest nodes on CPU at tensorflow? How?

Q2. Have you experimented with other frameworks?
I have experimented with openCV DNN (SSD-mobilenet), caffe (SSD-mobilenet), darknet (YOLO v2, v3) and tensorflow (SSD-mobilenet).

However, i got performance up to only 9 fps.

Do you think the above frameworks lacks the ability to optimize GPU / CPU allocation?

Thank you

Gustav von Zitzewitz · Answer 1 · Wed Jul 24 2019 15:12:40 GMT+0800 (China Standard Time)

Q1: The problem is that the tensorflow NMS implementation is not running fast on gpu, therefore i go through all layers/nodes and place the ones connected to the NMS on CPU, which does it much faster.

Q2: No, only darknet, which wperforms well also. But still slower than my approach.