what is im -= self.cfgs['mean_pixel_value'] mean？

Question

what is im -= self.cfgs['mean_pixel_value'] mean？

CangHaiQingYue opened this issue 7 years ago · comments

In data_parser.py ，I find ‘ im -= self.cfgs['mean_pixel_value'] ’，
mean_pixel_value: [103.939, 116.779, 123.68]
I dont understand what is this op mean? Is it for normalization?
Can I use tf.image.per_image_standardization() instead?

Harsimrat Sandhawalia · Answer 1 · Tue Dec 12 2017 19:01:40 GMT+0800 (China Standard Time)

A good start to data processing is what we commonly refer to as normalised representations. One example of which would be whitening. Which sets dataset statistics to mean 0 and variance 1.0 . Here we use a simplified version of that and set to dataset statistics to mean 0.

tf.image.per_image_standardization() use per sample statistics to scale each image to mean 0 and variance 1. What we would like it to have mean 0 variance 1 over the whole dataset and not just each sample separately.

CangHaiQingYue · Answer 2 · Tue Dec 12 2017 19:07:11 GMT+0800 (China Standard Time)

Thanks, I'd read the paper of VGG, and found the reason.
This op will speed up the convergence.

Priyanka Chaudhary · Answer 3 · Thu Dec 21 2017 22:23:56 GMT+0800 (China Standard Time)

I have a question on the same topic. I am using my own dataset for this project I wanted to ask is the mean_pixel_value: [103.939, 116.779, 123.68] is specific to the BSDS dataset or can be used for any dataset?
Thank you.

Harsimrat Sandhawalia · Answer 4 · Thu Dec 21 2017 22:32:40 GMT+0800 (China Standard Time)

Hi Thanks for your question. The mean value is usually computed on the training set which was used to train the base (VGG) Model. In this case the mean value is computed over the entire ImageNet data-set which was used to train the VGG base model. Hence you don't have to change the mean pixel value if you train on your own dataset.