terrychenism / caffe-model

Caffe models (includimg classification, detection and segmentation) and deploy files for famouse networks

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Caffe-model

Caffe models (include classification, detection and segmentation) and deploy prototxt for resnet, resnext, inception_v3, inception_v4, inception_resnet, wider_resnet, densenet and other networks.

We recommend using these caffe models with py-RFCN-priv

Please install py-RFCN-priv for evaluating and finetuning.

Disclaimer

Most of the pre-train models are converted from other projects, the main contribution belongs to the original authors.

Project links:

mxnet-model-gallerytensorflow slimcraftGBDResNeXtDenseNetwide-residual-networkskeras deep-learning-modelsademxapp

CLS (Classification, more details are in cls)

Performance on imagenet validation.

Top-1/5 error of pre-train models in this repository (Pre-train models download urls).

Network 224/299
(single-crop)
224/299
(12-crop)
320/395
(single-crop)
320/395
(12-crop)
resnet18-priv 29.11/10.07 26.69/8.64 27.54/8.98 26.23/8.21
resnext26-32x4d-priv 24.93/7.75 23.54/6.89 24.20/7.21 23.19/6.60
resnet101-v2 21.95/6.12 19.99/5.04 20.37/5.16 19.29/4.57
resnet152-v2 20.85/5.42 19.24/4.68 19.66/4.73 18.84/4.32
resnet269-v2 19.71/5.00 18.25/4.20 18.70/4.33 17.87/3.85
resnet38a 20.66/5.27 ../.. 19.25/4.66 ../..
inception-v3 21.67/5.75 19.60/4.73 20.10/4.82 19.25/4.24
xception 20.90/5.49 19.68/4.90 19.58/4.77 18.91/4.39
inception-v4 20.03/5.09 18.60/4.30 18.68/4.32 18.12/3.92
inception-resnet-v2 19.86/4.83 18.46/4.08 18.75/4.02 18.15/3.71
resnext50-32x4d 22.37/6.31 20.53/5.35 21.10/5.53 20.37/5.03
resnext101-32x4d 21.30/5.79 19.47/4.89 19.91/4.97 19.19/4.59
resnext101-64x4d 20.60/5.41 18.88/4.59 19.26/4.63 18.48/4.31
wrn50-2
(resnet50-1x128d)
22.13/6.13 20.09/5.06 20.68/5.28 19.83/4.87
  • The resnet18-priv, resnext26-32x4d-priv are trained under pytorch by bupt-priv.
  • The pre-train models are tested on original caffe by evaluation_cls.py, but ceil_mode:false(pooling_layer) is used for the models converted from torch, the detail in https://github.com/BVLC/caffe/pull/3057/files. If you remove ceil_mode:false, the performance will decline about 1% top1.
  • 224x224(base_size=256) and 320x320(base_size=320) crop size for resnet-v2/resnext/wrn, 299x299(base_size=320) and 395x395(base_size=395) crop size for inception.

DET (Detection, more details are in det)

Object Detection Performance on PASCAL VOC.

Original faster rcnn train on VOC 2007+2012 trainval and test on VOC 2007 test.

Network mAP@50 train speed train memory test speed test memory
resnet18 70.02 9.5 img/s 1,235MB 17.5 img/s 989MB
resnet101 -- -- -- -- --
resnet101-v2 79.6 3.1 img/s 6,495MB 7.1 img/s 4,573MB
resnet152-v2 80.72 2.8 img/s 9,315MB 6.2 img/s 6,021MB
wrn50-2 78.59 2.1 img/s 4,895MB 4.9 img/s 3,499MB
resnext50-32x4d 77.99 3.6 img/s 5,315MB 7.4 img/s 4,305MB
resnext101-32x4d 79.98 2.7 img/s 7,836MB 6.3 img/s 5,705MB
resnext101-64x4d 80.71 2.0 img/s
(batch=96)
11,277MB 3.7 img/s 9,461MB
inception-v3 78.6 4.1 img/s 4,325MB 7.3 img/s 3,445MB
inception-v4 81.49 2.6 img/s 6,759MB 5.4 img/s 4,683MB
inception-resnet-v2 80.0 2.0 img/s
(batch=112)
11,497MB 3.2 img/s 8,409MB
densenet-161 -- -- -- -- --
densenet-201 77.53 3.9 img/s
(batch=72)
10,073MB 5.5 img/s 9,955MB
resnet38a 80.1 1.4 img/s 8,723MB 3.4 img/s 5,501MB
  • To reduce memory usage, we merge all the models batchnorm layer parameters into scale layer, more details please refer to faster-rcnn-resnet or pva-faster-rcnn;
  • We also split the deploy file to rpn deploy file and rcnn deploy file for adopting more testing tricks.
  • Performanc, speed and memory are calculated on py-RFCN-priv with Nvidia Titan pascal, we do not guarantee that the results can be reproduced under any other conditions;
  • All the models are trained on a single scale (600*1000) with image flipping and train-batch=128 for 80,000 iterations, tested on the same single scale with test-batch=300 and nms=0.3;

License

caffe-model is released under the MIT License (refer to the LICENSE file for details).

Acknowlegement

I greatly thank Yangqing Jia and BVLC group for developing Caffe.

And I would like to thank all the authors of every network.

About

Caffe models (includimg classification, detection and segmentation) and deploy files for famouse networks

License:MIT License


Languages

Language:Python 100.0%