Calculate correctly the fan-in for DDPG model

Question

Calculate correctly the fan-in for DDPG model

dantp-ai opened this issue 5 years ago · comments

fan_in = layer.weight.data.size()[0]. This is wrong, because fan-in is defined as the maximum number of input units to the layer. The weight matrix is transposed (!), thus we need to access the second component of the size, i.e. fan_in = layer.weight.data.size()[1]

See example of correct implementation using fan-in here: https://pytorch.org/docs/stable/_modules/torch/nn/init.html#kaiming_normal_
specifically def _calculate_fan_in_and_fan_out(tensor)

Lifeng Wei · Answer 1 · Fri Feb 22 2019 17:12:26 GMT+0800 (China Standard Time)

This finding is really interesting! But actually the special and careful initialization has no influence on the performance.....

Daniel Plop · Answer 2 · Fri Feb 22 2019 21:38:46 GMT+0800 (China Standard Time)

Evidence for your claim?

Lifeng Wei · Answer 3 · Sat Feb 23 2019 04:53:06 GMT+0800 (China Standard Time)

I just tried to run the experiment without this initialization and nothing changes. Here is my report for this environment, you can have a look if interested. https://github.com/ZeratuuLL/Reinforcement-Learning/blob/master/Continuous%20Control/Report_Reacher.pdf

ronny-udacity · Answer 4 · Fri Dec 10 2021 05:52:17 GMT+0800 (China Standard Time)

Linked to PR #15