Implement dropout

Question

Implement dropout

drahnr opened this issue 7 years ago · comments

Bernhard Schuster commented 7 years ago

Should be pretty straight forward, warmup for #10

expand the cudnn bindings in rcudnn
use the rcudnn bindings in coaster-nn
create a apropriate interface in coaster
use that interface to define a layer in juice
implement tests

Paper: http://www.cs.toronto.edu/~rsalakhu/papers/srivastava14a.pdf

Andrey Tkachenko · Answer 1 · Tue Jun 18 2019 16:23:51 GMT+0800 (China Standard Time)

Why backprop of it commented out?

Bernhard Schuster · Answer 2 · Wed Jun 19 2019 00:26:09 GMT+0800 (China Standard Time)

If I read the paper correctly, the backpropagation is just a unit factor which can be skipped.
I am on my phone so I cannot review the code right now, the backprop will skip all non existent elements during backprop which enables a good speedup IIRC.

Bernhard Schuster · Answer 3 · Wed Jun 19 2019 05:01:08 GMT+0800 (China Standard Time)

Actually that is incorrect, backprop should only propagate back on the thinned network (section 5.1 of the linked paper) so unless the weights are zero, backprop may not be skipped

Bernhard Schuster · Answer 4 · Thu Jan 02 2020 11:36:02 GMT+0800 (China Standard Time)

Reviewing the paper, the thinned paper essentially is setting the gradient to zero which is easily done.
The normalization should be realized by means of an additional mechanic or variation parameter which can be introduced in a separate PR.