Weight Decay

Question

Weight Decay

vateye opened this issue 2 years ago · comments

Hi, as stated in the issue, the ALPRO does use weight decay. But I did not find the process that passing the parameter "weight_decay" during the optimizer initialization.

optimizer = OptimCls(model.parameters(), lr=opts.learning_rate, betas=opts.betas)

Dongxu · Answer 1 · Thu Apr 21 2022 23:00:47 GMT+0800 (China Standard Time)

Thanks a lot for pointing this out. It seems the current repo indeed does not pass in the weight decay. This may be an issue during open-sourcing. We'll update the repo with required fix.

Vateye · Answer 2 · Fri May 27 2022 21:52:15 GMT+0800 (China Standard Time)

Thanks a lot for pointing this out. It seems the current repo indeed does not pass in the weight decay. This may be an issue during open-sourcing. We'll update the repo with required fix.

Hi, any following for this question?

Dongxu · Answer 3 · Mon May 30 2022 23:12:00 GMT+0800 (China Standard Time)

Hi @vateye, an easy fix would be to pass the weight_decay to the optimizer.

We will resolve this issue in future releases but would expect some delay.

Nice catch and thanks for your kind understanding.