Speed Question(Wrong Speed Test)

Question

Speed Question(Wrong Speed Test)

lxtGH opened this issue 5 years ago · comments

Hi!! Thanks for sharing your codes.
I have seen your result in your paper about bi-seg. I tried to reproduce the result of bi-seg, however I only got 71%IOU（single scale）, have you successfully got 74% IOU results on that? May be he use ms test.
what is your advantages compared with bi-seg? More Light (Less memory cost)?

Xiangtai Li · Answer 1 · Mon Nov 26 2018 21:43:31 GMT+0800 (China Standard Time)

Hi! @wutianyiRosun
There are two questions I want to ask:

I test your model with 1024 * 2048 as input images. However I only got 9.5 fps on single 1080TI.
I use the code like the following
start = time.time()
out = model(img)
torch.cuda.synchronize()
print('Speed: {} fps.'.format(1.0/(time.time()-start)))
In your paper, ICnet only 16 fps which is not consistent with his paper. (30 fps)
I don't know why. The gap between K80 and 1080TI is such large ??
Is my testing method wrong? I want to know the way you evalute your model for speed.
Also, I use my test method to test ESP-net which result in nearly the same as your paper report(49 fps)

Xiangtai Li · Answer 2 · Mon Jan 28 2019 22:18:51 GMT+0800 (China Standard Time)

I think it is wrong speed testing.

Rosun · Answer 3 · Tue Jan 29 2019 18:40:38 GMT+0800 (China Standard Time)

Hi, @lxtGH
A1: The speed we report is the average speed on the verification set. What is your cuda and cudnn version?
A2: The speed of ICNet is tested on K80, The paper of ICNet reported 30 fps, which is tested on 1080TI. The speed between them is about two to three times the difference.

Xiangtai Li · Answer 4 · Sat Feb 16 2019 11:52:10 GMT+0800 (China Standard Time)

Hi , @wutianyiRosun
Did you use this line ?
torch.cuda.synchronize() for speed test?

yyfyan · Answer 5 · Tue Feb 19 2019 15:29:48 GMT+0800 (China Standard Time)

@lxtGH @wutianyiRosun
I have a problem:
Pytorch DwConv's implement is not good,it's very slow. The CGNet'paper result is based on Pytorch?

Xiangtai Li · Answer 6 · Thu Feb 21 2019 13:58:20 GMT+0800 (China Standard Time)

@wutianyiRosun If you didn't use the line of code, your speed reported in your paper is wrong.
@yyfyan What speed results you get ?

Gen Li · Answer 7 · Wed Mar 06 2019 09:08:09 GMT+0800 (China Standard Time)

I got the same result as you, a single 1080Ti got about 9 FPS. @lxtGH
And if I didn't use torch.cuda.synchronize()
The FPS will increase to ~70Fps, this code is necessary.

Xiangtai Li · Answer 8 · Wed Mar 13 2019 14:03:15 GMT+0800 (China Standard Time)

@Reagan1311 Yes, GPU and CPU must synchronize (CPU must wait until the end of GPU forwarding )

XuanyiLi · Answer 9 · Tue Apr 02 2019 21:42:21 GMT+0800 (China Standard Time)

10FPS only! I test it on 1080Ti.
I suggest that you change your arxiv paper.

Lin-Zhuo Chen · Answer 10 · Tue Apr 02 2019 21:46:06 GMT+0800 (China Standard Time)

I met the same question. I hope I can get an answer. Thank you.

Rosun · Answer 11 · Wed Apr 03 2019 09:23:14 GMT+0800 (China Standard Time)

@meteorshowers, @LinZhuoChen We will update in the next few days.