The final test accuracy

Question

The final test accuracy

ustctf-zz opened this issue 6 years ago · comments

Hi, Thanks for this implementation ! I'm wondering how to obtain the quite strong test set result on CIFAR-10, as reported in the original densenet paper (e.g., error rate <=3.5 on C-10+, with depth =190, growth_rate = 40). When I run the script as:

CUDA_VISIBLE_DEVICES=0,1,2,3 python demo.py --depth 190 --efficient False --data ./data --save ./ckpts

The final test error is reported as 0.0535. I'm wondering whether the high error is due to no data augmentation is conducted in the default setting. May I know whether it is C10+ dataset or C10?

Best

Danlu Chen · Answer 1 · Thu Apr 26 2018 20:08:12 GMT+0800 (China Standard Time)

Could you please show the configuration (e.g., pytorch version, python version)?

Geoff Pleiss · Answer 2 · Thu Apr 26 2018 20:30:48 GMT+0800 (China Standard Time)

I noticed recently that I was missing the appropriate initialization of the models. Try it again and let me know what the error is.

Geoff Pleiss · Answer 3 · Fri Apr 27 2018 20:54:09 GMT+0800 (China Standard Time)

With the new initialization in c609c0c I was able to match the errors reported in the original paper. Closing this issue for now. If you're still noticing issues feel free to reopen.

Fei Tian · Answer 4 · Sat Apr 28 2018 11:48:00 GMT+0800 (China Standard Time)

@gpleiss

Sorry but I got the same test set error rate (>5) using your new initialization.

The command:

CUDA_VISIBLE_DEVICES=0,1,2,3 python demo.py --depth 190 --efficient False --data ./data --save ./ckpts

Would you please show your running script to match the claimed error rate? What is the exact error rate reported finally?

BTW @taineleau
the python version: Python 3.6.1 |Anaconda custom (64-bit)|
pytorch version: 0.3.1.post2
gpu: 4 titanXp
system: Ubuntu 14.04
cuda: 8.0

Thanks.

Geoff Pleiss · Answer 5 · Sat Apr 28 2018 21:44:58 GMT+0800 (China Standard Time)

python demo.py --depth 100 --efficient False --data ./data --save ./ckpts --batch_size 64 --valid_size 0

These are the same hyperparameters as in the paper. I got a final test error of 0.047

Fei Tian · Answer 6 · Sun May 27 2018 21:05:20 GMT+0800 (China Standard Time)

Thanks @gpleiss , I can reproduce now and with --depth 190, --growth_rate 40, the test error is 0.035.