Swin-L config without Objects365 pretraining and Objects365 pretraining setting

Question

Swin-L config without Objects365 pretraining and Objects365 pretraining setting

jay-lcchen opened this issue 10 months ago · comments

Hello,

Thank you for sharing the great work! We really find it useful.

We have one question regarding how to train your model using Swin-L, without Objects365 pretraining (i.e, only ImageNet-21K pretraining). Do you mind sharing the config or any settings for us to try?

Additionally, the script to pretrain the Swin-L on Objects365 (deta_swin_pre.sh) is missing. Is it also possible that you can share that script and Objects365's setting (e.g., all images are used? and so on?).

Thanks,

Jeffrey Ouyang-Zhang · Answer 1 · Sat Oct 28 2023 02:43:07 GMT+0800 (China Standard Time)

Hi Jay,

I'm glad you like the work!

Swin-L without Objects365 pretraining: we have not tried this, but I think you can just replace the Objects365 checkpoint with the IN21K checkpoint from https://github.com/microsoft/Swin-Transformer
the script to pretrain the Swin-L on Objects365: We trained on Objects365 training set (no merge with val) on 24epoch schedule (details in the paper). The only change to the code are increasing num_classes and training with higher resolution images. Happy to answer any other questions I can.

jay-lcchen · Answer 2 · Sat Oct 28 2023 13:08:22 GMT+0800 (China Standard Time)

Awesome. Thank you, Jeffrey, for the quick response and for the clarification.
It is clear to me and we will run the experiments as suggested and see how it goes.

Thanks,