BVLC / caffe

Caffe: a fast open framework for deep learning.

Home Page:http://caffe.berkeleyvision.org/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

caffe time -model -weights -gpu=0

everjcc opened this issue · comments

caffe time -gpu

Issue summary

caffe time -model=xxx -weighs=xxx -gpu=0
the log is:
I0312 15:29:30.427956 2367 caffe.cpp:406] Average time per layer:
I0312 15:29:30.427961 2367 caffe.cpp:409] data forward: 0.0018944 ms.
I0312 15:29:30.427969 2367 caffe.cpp:412] data backward: 0.0018848 ms.
I0312 15:29:30.427975 2367 caffe.cpp:409] conv1 forward: 0.10807 ms.
I0312 15:29:30.427982 2367 caffe.cpp:412] conv1 backward: 0.182646 ms.
I0312 15:29:30.427989 2367 caffe.cpp:409] relu1 forward: 0.0140288 ms.
I0312 15:29:30.427994 2367 caffe.cpp:412] relu1 backward: 0.0018432 ms.
I0312 15:29:30.428000 2367 caffe.cpp:409] norm1 forward: 0.0628864 ms.
I0312 15:29:30.428007 2367 caffe.cpp:412] norm1 backward: 0.105226 ms.
I0312 15:29:30.428014 2367 caffe.cpp:409] pool1 forward: 0.0158592 ms.
I0312 15:29:30.428020 2367 caffe.cpp:412] pool1 backward: 0.0018784 ms.
I0312 15:29:30.428027 2367 caffe.cpp:409] conv2 forward: 0.291235 ms.
I0312 15:29:30.428033 2367 caffe.cpp:412] conv2 backward: 0.515402 ms.
I0312 15:29:30.428040 2367 caffe.cpp:409] relu2 forward: 0.0101152 ms.
I0312 15:29:30.428048 2367 caffe.cpp:412] relu2 backward: 0.0018592 ms.
I0312 15:29:30.428056 2367 caffe.cpp:409] norm2 forward: 0.137219 ms.
I0312 15:29:30.428066 2367 caffe.cpp:412] norm2 backward: 0.256826 ms.
I0312 15:29:30.428073 2367 caffe.cpp:409] pool2 forward: 0.0133536 ms.
I0312 15:29:30.428084 2367 caffe.cpp:412] pool2 backward: 0.0024768 ms.
I0312 15:29:30.428092 2367 caffe.cpp:409] conv3 forward: 0.14239 ms.
I0312 15:29:30.428098 2367 caffe.cpp:412] conv3 backward: 0.3532 ms.
I0312 15:29:30.428107 2367 caffe.cpp:409] relu3 forward: 0.008976 ms.
I0312 15:29:30.428114 2367 caffe.cpp:412] relu3 backward: 0.0020128 ms.
I0312 15:29:30.428123 2367 caffe.cpp:409] conv4 forward: 0.117597 ms.
I0312 15:29:30.428130 2367 caffe.cpp:412] conv4 backward: 0.292886 ms.
I0312 15:29:30.428138 2367 caffe.cpp:409] relu4 forward: 0.0090048 ms.
I0312 15:29:30.428145 2367 caffe.cpp:412] relu4 backward: 0.001872 ms.
I0312 15:29:30.428153 2367 caffe.cpp:409] conv5 forward: 0.109824 ms.
I0312 15:29:30.428160 2367 caffe.cpp:412] conv5 backward: 0.368051 ms.
I0312 15:29:30.428165 2367 caffe.cpp:409] relu5 forward: 0.0088512 ms.
I0312 15:29:30.428174 2367 caffe.cpp:412] relu5 backward: 0.0018848 ms.
I0312 15:29:30.428182 2367 caffe.cpp:409] pool5 forward: 0.0117792 ms.
I0312 15:29:30.428189 2367 caffe.cpp:412] pool5 backward: 0.00256 ms.
I0312 15:29:30.428197 2367 caffe.cpp:409] fc6 forward: 0.417875 ms.
I0312 15:29:30.428205 2367 caffe.cpp:412] fc6 backward: 3.15267 ms.
I0312 15:29:30.428212 2367 caffe.cpp:409] relu6 forward: 0.0122656 ms.
I0312 15:29:30.428264 2367 caffe.cpp:412] relu6 backward: 0.0018912 ms.
I0312 15:29:30.428273 2367 caffe.cpp:409] drop6 forward: 0.0127136 ms.
I0312 15:29:30.428282 2367 caffe.cpp:412] drop6 backward: 0.001856 ms.
I0312 15:29:30.428292 2367 caffe.cpp:409] fc7 forward: 0.1988 ms.
I0312 15:29:30.428300 2367 caffe.cpp:412] fc7 backward: 2.72682 ms.
I0312 15:29:30.428308 2367 caffe.cpp:409] relu7 forward: 0.0122848 ms.
I0312 15:29:30.428316 2367 caffe.cpp:412] relu7 backward: 0.0019136 ms.
I0312 15:29:30.428328 2367 caffe.cpp:409] drop7 forward: 0.0126016 ms.
I0312 15:29:30.428339 2367 caffe.cpp:412] drop7 backward: 0.0018944 ms.
I0312 15:29:30.428347 2367 caffe.cpp:409] fc8 forward: 0.109283 ms.
I0312 15:29:30.428378 2367 caffe.cpp:412] fc8 backward: 2.68584 ms.
I0312 15:29:30.428388 2367 caffe.cpp:409] prob forward: 0.0146496 ms.
I0312 15:29:30.428395 2367 caffe.cpp:412] prob backward: 0.0018528 ms.
I0312 15:29:30.428421 2367 caffe.cpp:417] Average Forward pass: 55.8925 ms.
I0312 15:29:30.428429 2367 caffe.cpp:419] Average Backward pass: 65.4428 ms.
I0312 15:29:30.430272 2367 caffe.cpp:421] Average Forward-Backward: 127.954 ms.
I0312 15:29:30.430285 2367 caffe.cpp:423] Total Time: 1279.54 ms.
I0312 15:29:30.430291 2367 caffe.cpp:424] *** Benchmark ends ***

The sum of forward_time_per_layer is not equal to the average forward pass(2.01ms < 55.89ms) , please help me to solve it, thanks very much.