the choice of optimizer

Question

the choice of optimizer

Sword-keeper opened this issue 4 years ago · comments

Hi I just found that in the pretrain phase, you choose the SGD optimizer. However, in the meta-train phase, you choose the Adam optimizer. I wonder that why you choose different optimizer in the different phase?

Yaoyao Liu · Answer 1 · Sun May 24 2020 15:53:36 GMT+0800 (China Standard Time)

We choose the optimizer by empirical results. You may change the optimizer and re-run the experiments to see the difference.

Sword-keeper · Answer 2 · Fri May 29 2020 21:11:49 GMT+0800 (China Standard Time)

hi In the torch code,I found that you set the train_aug=false in the meta-training phase.However, in the pretrain phase, you set the train_aug = true. So the train_aug is designed for the pretrain phase? I set train_aug=ture in the meta-training phase, and runned several epochs. The result lower than aug=False.

Yaoyao Liu · Answer 3 · Sat May 30 2020 02:22:55 GMT+0800 (China Standard Time)

We apply data augmentation during pre-training to solve the overfitting problem. You may also apply data augmentation during meta-training as well. Please note that you cannot apply data augmentation on the episode test (the test set for each small task).