weight init

Question

weight init

skrish13 opened this issue 7 years ago · comments

For resnet and other architectures in general, the weight init has to be done separately right? The default init for nn modules of pytorch is uniform? From my understanding, there are better inits like glorot uniform, he init etc which we explicitly need to do. Because other libraries like Keras, Lasagne has the best init as the default, unlike PyTorch.

References:
[1] - ResNet official: https://github.com/pytorch/vision/blob/master/torchvision/models/resnet.py#L113-L119
[2] - VGG official: https://github.com/pytorch/vision/blob/master/torchvision/models/vgg.py#L46-L59

[3] - Conv2D nn module: https://github.com/pytorch/pytorch/blob/master/torch/nn/modules/conv.py#L40-L47

Changqian · Answer 1 · Tue Jul 18 2017 10:38:07 GMT+0800 (China Standard Time)

There is special weight init in pytorch. You can find it in the document. In this repo, I don't care the init because I load the official weight.

Sri Krishna · Answer 2 · Tue Jul 18 2017 18:20:57 GMT+0800 (China Standard Time)

Could you explain what you mean by 'official weight'. (Since there are other modules than just pre-trained layers in your code)

Changqian · Answer 3 · Tue Aug 22 2017 13:56:36 GMT+0800 (China Standard Time)

Just the model weights Pytorch offer.