rocmarchive / cunn_hip

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CreateKernel() failure in CUNN

bensander opened this issue · comments

root@3c838b17492a:~/cunn_hip# luajit -l cunn -e 'cunn.test()'
not found: THNN_CudaRReLU_updateOutput/root/torch/install/share/lua/5.1/nn/THNN.lua:108: /root/torch/install/lib/lua/5.1/libTHCUNN.so: undefined symbol: THNN_CudaRReLU_updateOutput
not found: THNN_CudaRReLU_updateGradInput/root/torch/install/share/lua/5.1/nn/THNN.lua:108: /root/torch/install/lib/lua/5.1/libTHCUNN.so: undefined symbol: THNN_CudaRReLU_updateGradInput
not found: THNN_CudaSparseLinear_updateOutput/root/torch/install/share/lua/5.1/nn/THNN.lua:108: /root/torch/install/lib/lua/5.1/libTHCUNN.so: undefined symbol: THNN_CudaSparseLinear_updateOutput
not found: THNN_CudaSparseLinear_accGradParameters/root/torch/install/share/lua/5.1/nn/THNN.lua:108: /root/torch/install/lib/lua/5.1/libTHCUNN.so: undefined symbol: THNN_CudaSparseLinear_accGradParameters
not found: THNN_CudaSparseLinear_legacyUpdateOutput/root/torch/install/share/lua/5.1/nn/THNN.lua:108: /root/torch/install/lib/lua/5.1/libTHCUNN.so: undefined symbol: THNN_CudaSparseLinear_legacyUpdateOutput
not found: THNN_CudaSparseLinear_legacyAccGradParameters/root/torch/install/share/lua/5.1/nn/THNN.lua:108: /root/torch/install/lib/lua/5.1/libTHCUNN.so: undefined symbol: THNN_CudaSparseLinear_legacyAccGradParameters
not found: THNN_CudaSparseLinear_zeroGradParameters/root/torch/install/share/lua/5.1/nn/THNN.lua:108: /root/torch/install/lib/lua/5.1/libTHCUNN.so: undefined symbol: THNN_CudaSparseLinear_zeroGradParameters
not found: THNN_CudaSparseLinear_updateParameters/root/torch/install/share/lua/5.1/nn/THNN.lua:108: /root/torch/install/lib/lua/5.1/libTHCUNN.so: undefined symbol: THNN_CudaSparseLinear_updateParameters
not found: THNN_CudaVolumetricMaxUnpooling_updateOutput/root/torch/install/share/lua/5.1/nn/THNN.lua:108: /root/torch/install/lib/lua/5.1/libTHCUNN.so: undefined symbol: THNN_CudaVolumetricMaxUnpooling_updateOutput
seed: 415006929
Running 169 tests
1/169 SpatialDilatedConvolution_backward_batch ........................ [ERROR]
2/169 SpatialUpSamplingBilinear_backward_batch ........................ [WAIT]HSADevice::CreateKernel(): Unable to create kernel

Command often runs out-of-memory with luajit. Maybe try this one which only runs the requested test:

luajit -l cunn -e 'cunn.test("SpatialDilatedConvolution_backward_batch")'

This is the kernel that can't be created:
1/169 SpatialDilatedConvolution_backward_batch ........................ [WAIT]HSADevice::CreateKernel(): Unable to create kernel 'void hip_impl::grid_launch_hip_impl_<void im2col(ihipStream_t*, float const*, int, int, int, int, int, int, int, int, int, int, int, float*)::{lambda(hip_impl::Empty_launch_parm const&, int const&, float const* const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, float* const&)#1}, hip_impl::Empty_launch_parm, int&, float const*&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, void im2col(ihipStream_t*, float const*, int, int, int, int, int, int, int, int, int, int, int, float*)::{lambda(hip_impl::Empty_launch_parm const&, int const&, float const* const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, float* const&)#1}, void im2col(ihipStream_t*, float const*, int, int, int, int, int, int, int, int, int, int, int, float*)::{lambda(hip_impl::Empty_launch_parm const&, int const&, float const* const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, float* const&)#1}, float*&>(hip_impl::(anonymous namespace)::New_grid_launch_tag, dim3, hip_impl::(anonymous namespace)::New_grid_launch_tag, int, hc::accelerator_view const&, void im2col(ihipStream_t*, float const*, int, int, int, int, int, int, int, int, int, int, int, float*)::{lambda(hip_impl::Empty_launch_parm const&, int const&, float const* const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, float* const&)#1}, hip_impl::Empty_launch_parm&&, int&, float const*&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, void im2col(ihipStream_t*, float const*, int, int, int, int, int, int, int, int, int, int, int, float*)::{lambda(hip_impl::Empty_launch_parm const&, int const&, float const* const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, float* const&)#1}&&, void im2col(ihipStream_t*, float const*, int, int, int, int, int, int, int, int, int, int, int, float*)::{lambda(hip_impl::Empty_launch_parm const&, int const&, float const* const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, int const&, float* const&)#1}&&, float*&)::{lambda(dim3::tiled_index<3> const&)#1}::__cxxamp_trampoline(int, float const*, int, int, int, int, int, int, int, int, int, int, int, int, float*)'

I believe this is the mangled version - this is the only "im2col" symbol in the attached ISA so still guessing a bit:
ZZN8hip_impl21grid_launch_hip_impl_IZ6im2colIfEvP12ihipStream_tPKT_iiiiiiiiiiiPS4_EUlRKNS_17Empty_launch_parmERKiRKPKfSC_SC_SC_SC_SC_SC_SC_SC_SC_SC_SC_SC_RKPfE_JS8_RiRSE_SC_SC_SC_SC_SC_SC_SC_SC_SC_SC_SL_SL_RSH_EEEvNS_12_GLOBAL__N_119New_grid_launch_tagE4dim3SQ_iRKN2hc16accelerator_viewES4_DpOT0_ENUlRKNSR_11tiled_indexILi3EEEE_19__cxxamp_trampolineEiSE_iiiiiiiiiiiiSH