Worse results by reproducing MoCo, InsDIS and CMC on ImageNet100

Question

Worse results by reproducing MoCo, InsDIS and CMC on ImageNet100

nkliuyifang opened this issue 4 years ago · comments

Thanks for your nice paper and publlic code!

I have reproduced results of MoCo and InsDIS on ImageNet100 following your steps.
I got 67.44 for MoCo and 66.02 for InsDIS, which are worse than the expected 73.4 and 69.1.

Could you please help me about this?

Best
Mengyuan

Yonglong Tian · Answer 1 · Mon Jan 06 2020 23:14:24 GMT+0800 (China Standard Time)

Hi, @nkliuyifang ,

Can you share more details? e.g., training command line, testing command line.

How is the result for CMC on your end? I want to see if this is a common issue for all of the three or only InsDis and MoCo.

nkliuyifang · Answer 2 · Tue Jan 07 2020 15:25:25 GMT+0800 (China Standard Time)

Hi, @HobbitLong ,

The best result of CMC using "--view YCbCr" and "--nce_k 16384" is 70.54

The corresponding training command line is:

The corresponding testing command line is:

All related results of InsDIS, MoCo and CMC on ImageNet100 dataset is listed as follows, where the training set of ImageNet100 is used for both unsupervised learning and fine-tuning, and the testing set of ImageNet100 is used for testing classifiaction results.

Yonglong Tian · Answer 3 · Wed Jan 08 2020 00:40:53 GMT+0800 (China Standard Time)

Hi, @nkliuyifang,

I can see several gaps as below, which apply to all MoCo, InsDis, and CMC:
(1) For pre-training on ImageNet100, use batch_size 128 instead of 256. This is equivalent to increase the learning rate a bit. For full ImageNet, keep using 256.
(2) Switch from NCE loss to softmax-ce for pre-training
(3) Use a learning rate of 10 for linear probing stage, as can be found in README

nkliuyifang · Answer 4 · Wed Jan 08 2020 09:30:49 GMT+0800 (China Standard Time)

Hi, @HobbitLong,

Thanks! I will modify these settings and try again.

nkliuyifang · Answer 5 · Thu Jan 09 2020 16:57:39 GMT+0800 (China Standard Time)

I have reached the expected result. Thanks!