AdaBelief

Question

AdaBelief

KochiseBennett opened this issue 4 years ago · comments

Thank you very much for your work on this project! It really is an excellent contribution to provide an up-to-date version of AdamW that allows layer-dependent learning rates. I'm wondering what your thoughts are about AdaBelief and if you'd want to add it as an option to this package?

John Muradeli · Answer 1 · Fri Nov 13 2020 02:47:35 GMT+0800 (China Standard Time)

Glad you found it useful.

No plans on any optimizers, I'm afraid, but layerwise LR's should be easily transferable to others. Further, I've moved to PyTorch and won't be developing any more TensorFlow packages (though I may still fix compatibility bugs for later TF versions).