During testing，processing of input image

Question

During testing，processing of input image

chuangchuangtan opened this issue 5 years ago · comments

hi，
I have a question about processing of input image.
For CAM and Has, they resize the input images to 256×256 and average 10 crops (4 corners plus center, and same with horizontal flip) to obtain class probabilities and localization maps.
For SPG, they directly resize the input images to 224×224.
But for ADL,you resize the input images to 256×256 and central crop to 224x224.
Is this appropriate? Thank you

Junsuk Choe · Answer 1 · Tue Oct 08 2019 11:43:38 GMT+0800 (China Standard Time)

You are right. Each method uses different pre-processing method during testing. Previous papers (including ADL) directly compare their methods with other method despite the different pre-processing method.

The reason why I use the pre-processing method is that I think similar pre-processing methods have to be applied during training and testing. So, I use 256x256 -> 224x224 random crop (training) and 256x256 -> 224x224 center crop (testing).

chuangchuangtan · Answer 2 · Tue Oct 08 2019 16:37:33 GMT+0800 (China Standard Time)

It's necessary that following the baseline methods for a fair comparison.

Junsuk Choe · Answer 3 · Tue Oct 08 2019 16:59:55 GMT+0800 (China Standard Time)

It is difficult to compare with previous methods under fair condition in that each method use different configurations (different batch size, training epochs, pre-processing scheme etc.).

The problem is that each method does not clearly show the configurations. I think that this is why many papers do not reproduce each method, but just bring the scores from the original papers.

But I totally agree with you. In the future, we will release the scores of each method produced under the fair condition.