Changing init learning rate

Question

Changing init learning rate

Kraut-Inferences opened this issue 3 years ago · comments

Does modifying the initial learning rate hurt the algorithm in any way? Wanting to use exponential decay but don't know if it would improve the performance.

Juntang Zhuang · Answer 1 · Sat Jul 24 2021 22:15:15 GMT+0800 (China Standard Time)

From my experience with a ViT model on ImageNet, AdaBelief improves over Adam when both use a default cosine learning rate. I think it should work with other models.

Garrett Kraut · Answer 2 · Sun Jul 25 2021 04:18:43 GMT+0800 (China Standard Time)

thank you.