type casting?

Question

type casting?

clementfarabet opened this issue 9 years ago · comments

How do you guys do type casting? The current :type() method seems very partial; running the following code prints out lots of DoubleTensors.

setprintlevel(20)

require 'nngraph'

local i1 = nn.Identity()()
local i2 = nn.Identity()()

local o1 = nn.Tanh()( nn.Linear(10,10)(i1) )
local o2 = nn.Tanh()( nn.Linear(10,10)(i2) )

local m = nn.gModule({i1,i2}, {o1,o2})

m:forward({torch.randn(10), torch.randn(10)})
m:backward({torch.randn(10), torch.randn(10)}, {torch.randn(10), torch.randn(10)})

m:float()

print{m}

Soumith Chintala · Answer 1 · Mon Sep 14 2015 13:38:50 GMT+0800 (China Standard Time)

from my understanding, adam's open PR #56 should fix this.

koray kavukcuoglu · Answer 2 · Mon Sep 14 2015 17:55:21 GMT+0800 (China Standard Time)

As far as I can see, all the double's are input and gradoutput variables. They are goign to be reset at the next forward and backward. They are just pointers to module.output from previous run and since the module.output is now a new tensor, these references remain. But at every forward these are reset.

https://github.com/torch/nngraph/blob/master/gmodule.lua#L300
https://github.com/torch/nngraph/blob/master/gmodule.lua#L379

I think the type casting is actually fine, I guess ;) But I will go through Adam's and couple other PRs today.

Joost van Doorn · Answer 3 · Fri Apr 08 2016 03:54:31 GMT+0800 (China Standard Time)

This unfortunately does not seem to be completely fixed. clementfarabet's example still gives doubles.

Personally I use it on a model that I convert from cuda to float, and then clone. The input and gradOutput take up a lot of memory in my case and therefore I get cuda memory errors.

Edit: Actually calling :clearState() solves my immediate problem.