OpenCL GPU acceleration with caffeOnACL

Question

OpenCL GPU acceleration with caffeOnACL

JammyZhou opened this issue 7 years ago · comments

To enable the build for GPU support with caffeOnACL, I did two things below:

Comment out “CPU_ONLY := 1” in Makefile.config
Comment out “COMMON_FLAGS += -DCPU_ONLY” in Makefile

But the build failed with error below. It looks like the GPU support is not ready yet for cafffeOnACL, since the CUDA related files and code are still there in that path. Did I miss something?

CXX src/caffe/solvers/nesterov_solver.cpp
In file included from ./include/caffe/common.hpp:19:0,
                 from ./include/caffe/blob.hpp:8,
                 from ./include/caffe/net.hpp:10,
                 from ./include/caffe/solver.hpp:7,
                 from ./include/caffe/sgd_solvers.hpp:7,
                 from src/caffe/solvers/nesterov_solver.cpp:3:
./include/caffe/util/device_alternate.hpp:38:23: fatal error: cublas_v2.h: No such file or directory
 #include <cublas_v2.h>
                       ^
compilation terminated.
Makefile:622: recipe for target '.build_release/src/caffe/solvers/nesterov_solver.o' failed
make: *** [.build_release/src/caffe/solvers/nesterov_solver.o] Error 1

Honggui · Answer 1 · Sun Aug 13 2017 16:23:09 GMT+0800 (China Standard Time)

Hi Jammy，
We use “CPU_ONLY” mode to support ACL。Although we compile source code with CPU_ONLY，we can use Caffe::set_mode(Caffe::GPU) to use ARM's GPU in our application. You could refer ./examples/cpp_classification/xxx_gpu.cpp as an example.
Regards,
Honggui

Jammy Zhou · Answer 2 · Tue Aug 15 2017 16:43:15 GMT+0800 (China Standard Time)

Hi Honggui,

Thanks for your reply. I can confirm that OpenCL can be used by classification_profiling_gpu.bin, but I ran into some error below, which is similar with mxnetOnACL as I reported in OAID/MXNet-HRT#3. Do you have some insights about it?

classification_profiling_gpu.bin: tools/intern/llvmufgen/HalfSupport.cpp:163: llvm::Value* {anonymous}::HalfSupportPass::getValueAs(llvm::Value*) [with bool ToHalf = false]: Assertion `(isa<Constant>(val) || is<Half>(val) != 0) && "Requested value isn't half."' failed.
Stack dump:
0.	Running pass 'HalfSupportPass' on module 'BuildGroup_2'.
Aborted

Jammy Zhou · Answer 3 · Fri Aug 25 2017 14:06:57 GMT+0800 (China Standard Time)

The problem is caused by missing cl_khr_fp16 support on my ARM platform