lua/5.1/torch error
MironaGamil opened this issue · comments
@LuoweiZhou I have this error when i train, any help please ?
the train command is th train_new.lua -gpuid 0 -finetune_cnn_after 10000 -max_iters 60000 -cnn_weight_decay 0.001 -cnn_learning_rate 0.00001 -learning_rate_decay_every 10000 -learning_rate_decay_start 10000
THCudaCheck FAIL file=/tmp/luarocks_cutorch-scm-1-2683/cutorch/lib/THC/generic/THCStorage.cu line=66 error=2 : out of memory
/home/mri/torch/install/bin/lua: /home/mri/torch/install/share/lua/5.1/torch/File.lua:351: cuda runtime error (2) : out of memory at /tmp/luarocks_cutorch-scm-1-2683/cutorch/lib/THC/generic/THCStorage.cu:66
stack traceback:
[C]: in function 'read'
/home/mri/torch/install/share/lua/5.1/torch/File.lua:351: in function </home/mri/torch/install/share/lua/5.1/torch/File.lua:245>
[C]: in function 'read'
/home/mri/torch/install/share/lua/5.1/torch/File.lua:351: in function 'readObject'
/home/mri/torch/install/share/lua/5.1/torch/File.lua:369: in function 'readObject'
/home/mri/torch/install/share/lua/5.1/nn/Module.lua:192: in function 'read'
/home/mri/torch/install/share/lua/5.1/torch/File.lua:351: in function 'readObject'
/home/mri/torch/install/share/lua/5.1/torch/File.lua:369: in function 'readObject'
/home/mri/torch/install/share/lua/5.1/torch/File.lua:369: in function 'readObject'
/home/mri/torch/install/share/lua/5.1/nn/Module.lua:192: in function 'read'
/home/mri/torch/install/share/lua/5.1/torch/File.lua:351: in function 'readObject'
/home/mri/torch/install/share/lua/5.1/nn/Module.lua:140: in function 'clone'
train_new.lua:152: in main chunk
[C]: in function 'dofile'
.../torch/install/lib/luarocks/rocks/trepl/scm-1/bin/th:150: in main chunk
[C]: ?
Try reducing the batch size
@LuoweiZhou i have tried to reduce batch size until 2 and the same error occurred