This code is operational for global memory and shared memory implementation.
Convolution by seperable kernels implementation.
Convolution operation on CUDA C++ and performance tests
This code is operational for global memory and shared memory implementation.
Convolution by seperable kernels implementation.
Convolution operation on CUDA C++ and performance tests