Question on the function of the normalized linear layer

Question

Question on the function of the normalized linear layer

PeiqinZhuang opened this issue 3 years ago · comments

Hi, I just read the released code and found the implementation of the normalized linear layer and the scale factor. I wonder if there exists a special purpose for this design, like training stability?

Long(Tony) Lian · Answer 1 · Fri Jul 30 2021 17:53:45 GMT+0800 (China Standard Time)

Hi @PeiqinZhuang,

Thanks for your question. We use normalized linear and scaling for simplified calculation of the margin in LDAM component of our framework. The normalized linear layer is first proposed by LDAM. Since we mainly demonstrated our methods with it, we use the normalized linear in LDAM. If you use our method with methods such as decouple (cRT, t-norm, LWS), you don't need to use normalized linear layer. We refer you to LDAM code for more details: https://github.com/kaidic/LDAM-DRW/blob/master/cifar_train.py#L106.