Caffe-SSD

caffe & ssd

License and Citation

Please cite Caffe, SSD in your publications if it helps your research:

@inproceedings{liu2016ssd,
  title = {{SSD}: Single Shot MultiBox Detector},
  author = {Liu, Wei and Anguelov, Dragomir and Erhan, Dumitru and Szegedy, Christian and Reed, Scott and Fu, Cheng-Yang and Berg, Alexander C.},
  booktitle = {ECCV},
  year = {2016}
}  

@article{jia2014caffe,
  Author = {Jia, Yangqing and Shelhamer, Evan and Donahue, Jeff and Karayev, Sergey and Long, Jonathan and Girshick, Ross and Guadarrama, Sergio and Darrell, Trevor},
  Journal = {arXiv preprint arXiv:1408.5093},
  Title = {Caffe: Convolutional Architecture for Fast Feature Embedding},
  Year = {2014}
}

Resources

mobilenetv1-ssd Ref
mobilenetv2-ssd Ref
mobilenetv2-ssdlite Ref
FocalLoss Ref
DepthwiseConvolution Ref
ShuffleNet Ref

FocalLoss
mobilenetv1-ssd
[mobilenetv2-ssd(ssdlite)](#mobilenetv2-ssd ssdlite)
DepthwiseConvolution
ShuffleNet

FocalLoss

Caffe implementation of FAIR paper "Focal Loss for Dense Object Detection" for SSD.

layer {
  name: "mbox_loss"
  type: "MultiBoxFocalLoss" #change the type
  bottom: "mbox_loc"
  bottom: "mbox_conf"
  bottom: "mbox_priorbox"
  bottom: "label"
  top: "mbox_loss"
  include {
    phase: TRAIN
  }
  propagate_down: true
  propagate_down: true
  propagate_down: false
  propagate_down: false
  loss_param {
    normalization: VALID
  }
  focal_loss_param { #set the alpha and gamma, default is alpha=0.25, gamma=2.0
    alpha: 0.25
    gamma: 2.0
  }
  multibox_loss_param {
    loc_loss_type: SMOOTH_L1
    conf_loss_type: SOFTMAX
    loc_weight: 1.0
    num_classes: 21
    share_location: true
    match_type: PER_PREDICTION
    overlap_threshold: 0.5
    use_prior_for_matching: true
    background_label_id: 0
    use_difficult_gt: true
    neg_pos_ratio: 3.0
    neg_overlap: 0.5
    code_type: CENTER_SIZE
    ignore_cross_boundary_bbox: false
    mining_type: NONE #do not use OHEM
  }
}

mobilenetv1-ssd

A caffe implementation of MobileNet-SSD detection network, with pretrained weights on VOC0712 and mAP=0.727.

Network	mAP	Download	Download
MobileNet-SSD	72.7	train	deploy

Run

Download source code and compile (follow the SSD README).
Download the pretrained deploy weights from the link above.
Put all the files in SSD_HOME/examples/ss/MobileNetv1-SSD
Run demo.py to show the detection result.
You can run merge_bn.py to generate a no bn model, it will be much faster.

Train your own dataset

Convert your own dataset to lmdb database (follow the SSD README), and create symlinks to current directory.

ln -s PATH_TO_YOUR_TRAIN_LMDB trainval_lmdb
ln -s PATH_TO_YOUR_TEST_LMDB test_lmdb

Create the labelmap.prototxt file and put it into current directory.
Use gen_model.sh to generate your own training prototxt.
Download the training weights from the link above, and run train.sh, after about 30000 iterations, the loss should be 1.5 - 2.5.
Run test.sh to evaluate the result.
Run merge_bn.py to generate your own no-bn caffemodel if necessary.

python merge_bn.py --model example/MobileNetSSD_deploy.prototxt --weights snapshot/mobilenet_iter_xxxxxx.caffemodel

About some details

There are 2 primary differences between this model and MobileNet-SSD on tensorflow:

ReLU6 layer is replaced by ReLU.
For the conv11_mbox_prior layer, the anchors is [(0.2, 1.0), (0.2, 2.0), (0.2, 0.5)] vs tensorflow's [(0.1, 1.0), (0.2, 2.0), (0.2, 0.5)].

Reproduce the result

I trained this model from a MobileNet classifier(caffemodel and prototxt) converted from tensorflow. I first trained the model on MS-COCO and then fine-tuned on VOC0712. Without MS-COCO pretraining, it can only get mAP=0.68.

mobilenetv2-ssd(ssdlite)

Caffe implementation of SSD detection on MobileNetv2, converted from tensorflow.

Prerequisites

Tensorflow and Caffe version SSD is properly installed on your computer.

Usage

Firstly you should download the original model from tensorflow. cd SSD_HOME/examples/ss/Mobilenetv2-SSDLite
Use gen_model.py to generate the train.prototxt and deploy.prototxt (or use the default prototxt).

python gen_model.py -s deploy -c 91 >deploy.prototxt

Use dump_tensorflow_weights.py to dump the weights of conv layer and batchnorm layer.
Use load_caffe_weights.py to load the dumped weights to deploy.caffemodel.
Use the code in src to accelerate your training if you have a cudnn7, or add "engine: CAFFE" to your depthwise convolution layer to solve the memory issue.
The original tensorflow model is trained on MSCOCO dataset, maybe you need deploy.caffemodel for VOC dataset, use coco2voc.py to get deploy_voc.caffemodel.

Train your own dataset

Generate the trainval_lmdb and test_lmdb from your dataset.
Write a labelmap.prototxt
Use gen_model.py to generate some prototxt files, replace the "CLASS_NUM" with class number of your own dataset.

python gen_model.py -s train -c CLASS_NUM >train.prototxt
python gen_model.py -s test -c CLASS_NUM >test.prototxt
python gen_model.py -s deploy -c CLASS_NUM >deploy.prototxt

Copy coco/solver_train.prototxt and coco/train.sh to your project and start training.

Note

There are some differences between caffe and tensorflow implementation:

The padding method 'SAME' in tensorflow sometimes use the [0, 0, 1, 1] paddings, means that top=0, left=0, bottom=1, right=1 padding. In caffe, there is no parameters can be used to do that kind of padding.
MobileNet on Tensorflow use ReLU6 layer y = min(max(x, 0), 6), but caffe has no ReLU6 layer. Replace ReLU6 with ReLU cause a bit accuracy drop in ssd-mobilenetv2, but very large drop in ssdlite-mobilenetv2. There is a ReLU6 layer implementation in my fork of ssd.

DepthwiseConvolution

Replacing the type of mobile convolution layer with "DepthwiseConvolution" is all.

ShuffleNet

This is caffe implementation of ShuffleNet, For details, please read the original paper:
"ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices" by Xiangyu Zhang et. al. 2017. This code is based on camel007's implementation(https://github.com/camel007/Caffe-ShuffleNet), but I recode the cuda file for acceleration.

Update: add caffe deploy file of shufflenet v2 1x.

How to use?

caffe.proto:

message LayerParameter {
...
optional ShuffleChannelParameter shuffle_channel_param = 164;
...
}
...
message ShuffleChannelParameter {
  optional uint32 group = 1[default = 1]; // The number of group
}

wqvbjhc / caffe-ssd

Caffe-SSD

License and Citation

Resources

Contents

FocalLoss

mobilenetv1-ssd

Run

Train your own dataset

About some details

Reproduce the result

mobilenetv2-ssd(ssdlite)

Prerequisites

Usage

Train your own dataset

Note

DepthwiseConvolution

ShuffleNet

Update: add caffe deploy file of shufflenet v2 1x.

How to use?

caffe.proto:

About

Languages