Question about the adaptive optimizer
chenwydj opened this issue · comments
Wuyang commented
Thanks for this great work!
I failed to find more details about the adaptive optimizer mentioned in the paper. Could you point me any reference or github link about this adaptive optimizer?
Thank you!
Liyuan Liu commented
Hi thanks for reaching out.
In the paper, we use adaptive optimizer to refer a class of optimizers including Adam, Adamax, RMSProp, RAdam, etc. In our experiments, we are using RAdam as the optimizer.