parametersNoGrad

Question

parametersNoGrad

nicholas-leonard opened this issue 10 years ago · comments

Nicholas Léonard commented 10 years ago

What is parametersNoGrad?

https://github.com/wojzaremba/lstm/blob/master/base.lua#L40

mszlazak commented 9 years ago

NP

mszlazak · Answer 1 · Tue May 12 2015 16:18:51 GMT+0800 (China Standard Time)

Don't see why it's there or what it is. That cloning code can be reduced and you can just use Torch 7's clone() like the following. BTW Core is just core_network, (i.e. LSTM plus).

     local core       = Core(opt);
     param, gradParam = core:getParameters()
     local p, gradP   = core:parameters()
     rnn.core = {}
     for i = 1, opt.seqLength do
        local clone = core:clone()
        local cloneP, cloneGradP = clone:parameters()
        -- All clones set to same view.
        for i = 1, #p do
           cloneP[i]:set(p[i])
           cloneGradP[i]:set(gradP[i])
        end
        rnn.core[#rnn.core + 1] = clone
        collectgarbage()
     end

Nicholas Léonard · Answer 2 · Tue May 12 2015 22:15:51 GMT+0800 (China Standard Time)

That is what I thought. Thank you for the explanation.

Hello :) · Answer 3 · Tue May 12 2015 23:03:13 GMT+0800 (China Standard Time)

@mszlazak Thank you very much for posting your insight 👍

It's really neat :)

Trying to get it to work right now :)

To be fair, I don't think Wojciech wrote the g_cloneManyTimes function, so it's not really his problem?

Wojciech Zaremba · Answer 4 · Thu May 21 2015 02:57:23 GMT+0800 (China Standard Time)

Cleaned according to your suggestion.