ROCm / HIP

HIP: C++ Heterogeneous-Compute Interface for Portability

Home Page:https://rocmdocs.amd.com/projects/HIP/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[Issue]: lld: error: undefined hidden symbol: __ockl_get_group_id, __ockl_get_local_size, __ockl_get_local_id

mostwanted002 opened this issue · comments

Problem Description

Linker ldd throwing error while compiling any OpenCL kernel from hashcat

Current driver version: 23.40.02-240111a-399551C-AMD-Software-Adrenalin-Edition

# .\hashcat.exe -b -m1000
hashcat (v6.2.6-848-gc1a10518f+) starting in benchmark mode

Benchmarking uses hand-optimized kernel code by default.
You can use it in your cracking session by setting the -O option.
Note: Using optimized kernel code limits the maximum supported password length.
To disable the optimized kernel code in benchmark mode, use the -w option.

The device #3 specifically listed was skipped because it is an alias of device #1

The device #4 specifically listed was skipped because it is an alias of device #2

HIP API (HIP 5.7.32000)
=======================
* Device #1: AMD Radeon RX 6900 XT, 16240/16368 MB, 40MCU
* Device #2: AMD Radeon RX 6900 XT, 16240/16368 MB, 40MCU

OpenCL API (OpenCL 2.1 AMD-APP (3608.0)) - Platform #1 [Advanced Micro Devices, Inc.]
=====================================================================================
* Device #3: AMD Radeon RX 6900 XT, skipped
* Device #4: AMD Radeon RX 6900 XT, skipped

Benchmark relevant options:
===========================
* --backend-devices-virtual=1
* --optimized-kernel-enable

-----------------------
* Hash-Mode 1000 (NTLM)
-----------------------

hiprtcCompileProgram(): HIPRTC_ERROR_COMPILATION

lld: error: undefined hidden symbol: __ockl_get_group_id
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-7723ee\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-7723ee\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-7723ee\input\LLVMBitcode.bc.o:(gpu_memset)
>>> referenced 7 more times

lld: error: undefined hidden symbol: __ockl_get_local_size
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-7723ee\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-7723ee\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-7723ee\input\LLVMBitcode.bc.o:(gpu_memset)
>>> referenced 7 more times

lld: error: undefined hidden symbol: __ockl_get_local_id
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-7723ee\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-7723ee\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-7723ee\input\LLVMBitcode.bc.o:(gpu_memset)
>>> referenced 7 more times

* Device #1: Kernel ./OpenCL/shared.cl build failed.

* Device #1: Kernel ./OpenCL/shared.cl build failed.

Started: Thu Feb 08 10:26:10 2024
Stopped: Thu Feb 08 10:26:12 2024

The backend information works perfectly and reports the detected HIP version.

# .\hashcat.exe -I
hashcat (v6.2.6-848-gc1a10518f+) starting in backend information mode

The device #3 specifically listed was skipped because it is an alias of device #1

The device #4 specifically listed was skipped because it is an alias of device #2

HIP Info:
=========

HIP.Version.: 5.7.32000

Backend Device ID #1 (Alias: #3)
  Name...........: AMD Radeon RX 6900 XT
  Processor(s)...: 40
  Clock..........: 2400
  Memory.Total...: 16368 MB
  Memory.Free....: 16240 MB
  Local.Memory...: 64 KB
  PCI.Addr.BDFe..: 0000:12:00.0

Backend Device ID #2 (Alias: #4)
  Name...........: AMD Radeon RX 6900 XT
  Processor(s)...: 40
  Clock..........: 2255
  Memory.Total...: 16368 MB
  Memory.Free....: 16240 MB
  Local.Memory...: 64 KB
  PCI.Addr.BDFe..: 0000:0f:00.0

OpenCL Info:
============

OpenCL Platform ID #1
  Vendor..: Advanced Micro Devices, Inc.
  Name....: AMD Accelerated Parallel Processing
  Version.: OpenCL 2.1 AMD-APP (3608.0)

  Backend Device ID #3 (Alias: #1)
    Type...........: GPU
    Vendor.ID......: 1
    Vendor.........: Advanced Micro Devices, Inc.
    Name...........: AMD Radeon RX 6900 XT
    Version........: OpenCL 2.0 AMD-APP (3608.0)
    Processor(s)...: 40
    Clock..........: 2400
    Memory.Total...: 16368 MB (limited to 13912 MB allocatable in one block)
    Memory.Free....: 16256 MB
    Local.Memory...: 64 KB
    OpenCL.Version.: OpenCL C 2.0
    Driver.Version.: 3608.0 (PAL,LC)
    PCI.Addr.BDF...: 12:00.0

  Backend Device ID #4 (Alias: #2)
    Type...........: GPU
    Vendor.ID......: 1
    Vendor.........: Advanced Micro Devices, Inc.
    Name...........: AMD Radeon RX 6900 XT
    Version........: OpenCL 2.0 AMD-APP (3608.0)
    Processor(s)...: 40
    Clock..........: 2255
    Memory.Total...: 16368 MB (limited to 13912 MB allocatable in one block)
    Memory.Free....: 16256 MB
    Local.Memory...: 64 KB
    OpenCL.Version.: OpenCL C 2.0
    Driver.Version.: 3608.0 (PAL,LC)
    PCI.Addr.BDF...: 0f:00.0

It used to work fine with HIP SDK 5.5 for Windows

Operating System

Windows 11 10.0.22631

CPU

AMD Ryzen 9 5950X 16-Core Processor

GPU

AMD Radeon Pro W6800

ROCm Version

ROCm 5.7.1

ROCm Component

clang-ocl, HIP

Steps to Reproduce

  1. Clone the hashcat repo: https://github.com/mostwanted002/hashcat/
  2. Checkout the branch hip_rtc_windows_env_implementation
  3. Build for windows with make win
  4. Execute any of the benchmark .\hashcat.exe -b -m1000

(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

No response

Additional Information

No response

Could you please set the following environment variables, rerun the test, and attach the output. Thanks.

export AMD_COMGR_EMIT_VERBOSE_LOGS=1
export AMD_COMGR_REDIRECT_LOGS=stderr
export AMD_COMGR_SAVE_TEMPS=1

hashcat (v6.2.6-848-gc1a10518f+) starting in benchmark mode

Benchmarking uses hand-optimized kernel code by default.
You can use it in your cracking session by setting the -O option.
Note: Using optimized kernel code limits the maximum supported password length.
To disable the optimized kernel code in benchmark mode, use the -w option.

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_ADD_PRECOMPILED_HEADERS
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "-O3" "-cl-kernel-arg-info" "-D__OPENCL_VERSION__=200" "-D__IMAGE_SUPPORT__=1" "-Xclang" "-cl-ext=+cl_khr_fp64,+cl_khr_global_int32_base_atomics,+cl_khr_global_int32_extended_atomics,+cl_khr_local_int32_base_atomics,+cl_khr_local_int32_extended_atomics,+cl_khr_int64_base_atomics,+cl_khr_int64_extended_atomics,+cl_khr_3d_image_writes,+cl_khr_byte_addressable_store,+cl_khr_fp16,+cl_khr_gl_sharing,+cl_amd_device_attribute_query,+cl_amd_media_ops,+cl_amd_media_ops2,+cl_khr_d3d10_sharing,+cl_khr_d3d11_sharing,+cl_khr_dx9_media_sharing,+cl_khr_image2d_from_buffer,+cl_khr_subgroups,+cl_khr_gl_event,+cl_khr_mipmap_image,+cl_khr_mipmap_image_writes,+cl_amd_copy_buffer_p2p,+cl_amd_planar_yuv" "-mllvm" "-amdgpu-prelink" "-mcode-object-version=5"
                Path:
            Language: AMD_COMGR_LANGUAGE_OPENCL_1_2
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_COMPILE_SOURCE_TO_BC
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "-O3" "-cl-kernel-arg-info" "-D__OPENCL_VERSION__=200" "-D__IMAGE_SUPPORT__=1" "-Xclang" "-cl-ext=+cl_khr_fp64,+cl_khr_global_int32_base_atomics,+cl_khr_global_int32_extended_atomics,+cl_khr_local_int32_base_atomics,+cl_khr_local_int32_extended_atomics,+cl_khr_int64_base_atomics,+cl_khr_int64_extended_atomics,+cl_khr_3d_image_writes,+cl_khr_byte_addressable_store,+cl_khr_fp16,+cl_khr_gl_sharing,+cl_amd_device_attribute_query,+cl_amd_media_ops,+cl_amd_media_ops2,+cl_khr_d3d10_sharing,+cl_khr_d3d11_sharing,+cl_khr_dx9_media_sharing,+cl_khr_image2d_from_buffer,+cl_khr_subgroups,+cl_khr_gl_event,+cl_khr_mipmap_image,+cl_khr_mipmap_image_writes,+cl_amd_copy_buffer_p2p,+cl_amd_planar_yuv" "-mllvm" "-amdgpu-prelink" "-mcode-object-version=5"
                Path:
            Language: AMD_COMGR_LANGUAGE_OPENCL_1_2
    Compilation Args:  "-target" "amdgcn-amd-amdhsa" "-mcpu=gfx1030" "-I" "C:\Users\mayan\AppData\Local\Temp\comgr-f1d7be\include" "-include-pch" "C:\Users\mayan\AppData\Local\Temp\comgr-f1d7be\include\opencl1.2-c.pch" "-Xclang" "-fno-validate-pch" "-x" "cl" "-std=cl1.2" "-cl-no-stdinc" "-c" "-emit-llvm" "-fshort-wchar" "-O3" "-cl-kernel-arg-info" "-D__OPENCL_VERSION__=200" "-D__IMAGE_SUPPORT__=1" "-Xclang" "-cl-ext=+cl_khr_fp64,+cl_khr_global_int32_base_atomics,+cl_khr_global_int32_extended_atomics,+cl_khr_local_int32_base_atomics,+cl_khr_local_int32_extended_atomics,+cl_khr_int64_base_atomics,+cl_khr_int64_extended_atomics,+cl_khr_3d_image_writes,+cl_khr_byte_addressable_store,+cl_khr_fp16,+cl_khr_gl_sharing,+cl_amd_device_attribute_query,+cl_amd_media_ops,+cl_amd_media_ops2,+cl_khr_d3d10_sharing,+cl_khr_d3d11_sharing,+cl_khr_dx9_media_sharing,+cl_khr_image2d_from_buffer,+cl_khr_subgroups,+cl_khr_gl_event,+cl_khr_mipmap_image,+cl_khr_mipmap_image_writes,+cl_amd_copy_buffer_p2p,+cl_amd_planar_yuv" "-mllvm" "-amdgpu-prelink" "-mcode-object-version=5" "-nogpulib" "C:\Users\mayan\AppData\Local\Temp\comgr-f1d7be\input\CompileSource" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-f1d7be\output\CompileSource.bc"
     Driver Job Args: clang "-cc1" "-mcode-object-version=5" "-mllvm" "--amdhsa-code-object-version=5" "-triple" "amdgcn-amd-amdhsa" "-emit-llvm-bc" "-emit-llvm-uselists" "-clear-ast-before-backend" "-disable-llvm-verifier" "-discard-value-names" "-main-file-name" "CompileSource" "-mrelocation-model" "pic" "-pic-level" "2" "-fhalf-no-semantic-interposition" "-mframe-pointer=none" "-ffp-contract=on" "-fno-rounding-math" "-mconstructor-aliases" "-fvisibility=hidden" "-fapply-global-visibility-to-externs" "-target-cpu" "gfx1030" "-mllvm" "-treat-scalable-fixed-error-as-warning" "-debugger-tuning=gdb" "-resource-dir" "lib\clang\17.0.0" "-include-pch" "C:\Users\mayan\AppData\Local\Temp\comgr-f1d7be\include\opencl1.2-c.pch" "-I" "C:\Users\mayan\AppData\Local\Temp\comgr-f1d7be\include" "-D" "__OPENCL_VERSION__=200" "-D" "__IMAGE_SUPPORT__=1" "-O3" "-std=cl1.2" "-fdebug-compilation-dir=G:\hashcat" "-ferror-limit" "19" "-fmessage-length=280" "-cl-kernel-arg-info" "-nogpulib" "-fwchar-type=short" "-fno-signed-wchar" "-fno-threadsafe-statics" "-fcolor-diagnostics" "-vectorize-loops" "-vectorize-slp" "-fno-validate-pch" "-cl-ext=+cl_khr_fp64,+cl_khr_global_int32_base_atomics,+cl_khr_global_int32_extended_atomics,+cl_khr_local_int32_base_atomics,+cl_khr_local_int32_extended_atomics,+cl_khr_int64_base_atomics,+cl_khr_int64_extended_atomics,+cl_khr_3d_image_writes,+cl_khr_byte_addressable_store,+cl_khr_fp16,+cl_khr_gl_sharing,+cl_amd_device_attribute_query,+cl_amd_media_ops,+cl_amd_media_ops2,+cl_khr_d3d10_sharing,+cl_khr_d3d11_sharing,+cl_khr_dx9_media_sharing,+cl_khr_image2d_from_buffer,+cl_khr_subgroups,+cl_khr_gl_event,+cl_khr_mipmap_image,+cl_khr_mipmap_image_writes,+cl_amd_copy_buffer_p2p,+cl_amd_planar_yuv" "-mllvm" "-amdgpu-prelink" "-faddrsig" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-f1d7be\output\CompileSource.bc" "-x" "cl" "C:\Users\mayan\AppData\Local\Temp\comgr-f1d7be\input\CompileSource"
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_ADD_DEVICE_LIBRARIES
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "code_object_v5"
                Path:
            Language: AMD_COMGR_LANGUAGE_OPENCL_1_2
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_LINK_BC_TO_BC
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "code_object_v5"
                Path:
            Language: AMD_COMGR_LANGUAGE_OPENCL_1_2
             Linking: LLVM Binary
             Linking: opencl_lib.bc
             Linking: ocml_lib.bc
             Linking: ockl_lib.bc
             Linking: oclc_isa_version_1030.bc
             Linking: oclc_correctly_rounded_sqrt_off_lib.bc
             Linking: oclc_daz_opt_off_lib.bc
             Linking: oclc_finite_only_off_lib.bc
             Linking: oclc_unsafe_math_off_lib.bc
             Linking: oclc_wavefrontsize64_off_lib.bc
             Linking: oclc_abi_version_500_lib.bc
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_CODEGEN_BC_TO_RELOCATABLE
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "-O3" "-cl-kernel-arg-info" "-mllvm" "-amdgpu-internalize-symbols" "-mcode-object-version=5"
                Path:
            Language: AMD_COMGR_LANGUAGE_NONE
    Compilation Args:  "-target" "amdgcn-amd-amdhsa" "-mcpu=gfx1030" "-c" "-mllvm" "-amdgpu-internalize-symbols" "-O3" "-cl-kernel-arg-info" "-mllvm" "-amdgpu-internalize-symbols" "-mcode-object-version=5" "-nogpulib" "C:\Users\mayan\AppData\Local\Temp\comgr-c6ac7b\input\linked.bc" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-c6ac7b\output\linked.bc.o"
     Driver Job Args: clang "-cc1" "-mcode-object-version=5" "-mllvm" "--amdhsa-code-object-version=5" "-triple" "amdgcn-amd-amdhsa" "-emit-obj" "-clear-ast-before-backend" "-disable-llvm-verifier" "-discard-value-names" "-main-file-name" "linked.bc" "-mrelocation-model" "pic" "-pic-level" "2" "-fhalf-no-semantic-interposition" "-mframe-pointer=none" "-ffp-contract=on" "-fno-rounding-math" "-mconstructor-aliases" "-fvisibility=hidden" "-fapply-global-visibility-to-externs" "-target-cpu" "gfx1030" "-mllvm" "-treat-scalable-fixed-error-as-warning" "-debugger-tuning=gdb" "-resource-dir" "lib\clang\17.0.0" "-O3" "-fdebug-compilation-dir=G:\hashcat" "-ferror-limit" "19" "-fmessage-length=280" "-cl-kernel-arg-info" "-nogpulib" "-fcolor-diagnostics" "-vectorize-loops" "-vectorize-slp" "-mllvm" "-amdgpu-internalize-symbols" "-mllvm" "-amdgpu-internalize-symbols" "-faddrsig" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-c6ac7b\output\linked.bc.o" "-x" "ir" "C:\Users\mayan\AppData\Local\Temp\comgr-c6ac7b\input\linked.bc"
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_LINK_RELOCATABLE_TO_EXECUTABLE
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options:
                Path:
            Language: AMD_COMGR_LANGUAGE_NONE
    Compilation Args:  "-target" "amdgcn-amd-amdhsa" "-mcpu=gfx1030" "C:\Users\mayan\AppData\Local\Temp\comgr-0bb887\input\linked.bc.o" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-0bb887\output\a.so"
     Driver Job Args: lld "C:\Users\mayan\AppData\Local\Temp\comgr-0bb887\input\linked.bc.o" "-shared" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-0bb887\output\a.so"
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_ADD_PRECOMPILED_HEADERS
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "-O3" "-cl-kernel-arg-info" "-D__OPENCL_VERSION__=200" "-D__IMAGE_SUPPORT__=1" "-Xclang" "-cl-ext=+cl_khr_fp64,+cl_khr_global_int32_base_atomics,+cl_khr_global_int32_extended_atomics,+cl_khr_local_int32_base_atomics,+cl_khr_local_int32_extended_atomics,+cl_khr_int64_base_atomics,+cl_khr_int64_extended_atomics,+cl_khr_3d_image_writes,+cl_khr_byte_addressable_store,+cl_khr_fp16,+cl_khr_gl_sharing,+cl_amd_device_attribute_query,+cl_amd_media_ops,+cl_amd_media_ops2,+cl_khr_d3d10_sharing,+cl_khr_d3d11_sharing,+cl_khr_dx9_media_sharing,+cl_khr_image2d_from_buffer,+cl_khr_subgroups,+cl_khr_gl_event,+cl_khr_mipmap_image,+cl_khr_mipmap_image_writes,+cl_amd_copy_buffer_p2p,+cl_amd_planar_yuv" "-mllvm" "-amdgpu-prelink" "-mcode-object-version=5"
                Path:
            Language: AMD_COMGR_LANGUAGE_OPENCL_1_2
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_COMPILE_SOURCE_TO_BC
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "-O3" "-cl-kernel-arg-info" "-D__OPENCL_VERSION__=200" "-D__IMAGE_SUPPORT__=1" "-Xclang" "-cl-ext=+cl_khr_fp64,+cl_khr_global_int32_base_atomics,+cl_khr_global_int32_extended_atomics,+cl_khr_local_int32_base_atomics,+cl_khr_local_int32_extended_atomics,+cl_khr_int64_base_atomics,+cl_khr_int64_extended_atomics,+cl_khr_3d_image_writes,+cl_khr_byte_addressable_store,+cl_khr_fp16,+cl_khr_gl_sharing,+cl_amd_device_attribute_query,+cl_amd_media_ops,+cl_amd_media_ops2,+cl_khr_d3d10_sharing,+cl_khr_d3d11_sharing,+cl_khr_dx9_media_sharing,+cl_khr_image2d_from_buffer,+cl_khr_subgroups,+cl_khr_gl_event,+cl_khr_mipmap_image,+cl_khr_mipmap_image_writes,+cl_amd_copy_buffer_p2p,+cl_amd_planar_yuv" "-mllvm" "-amdgpu-prelink" "-mcode-object-version=5"
                Path:
            Language: AMD_COMGR_LANGUAGE_OPENCL_1_2
    Compilation Args:  "-target" "amdgcn-amd-amdhsa" "-mcpu=gfx1030" "-I" "C:\Users\mayan\AppData\Local\Temp\comgr-44d52d\include" "-include-pch" "C:\Users\mayan\AppData\Local\Temp\comgr-44d52d\include\opencl1.2-c.pch" "-Xclang" "-fno-validate-pch" "-x" "cl" "-std=cl1.2" "-cl-no-stdinc" "-c" "-emit-llvm" "-fshort-wchar" "-O3" "-cl-kernel-arg-info" "-D__OPENCL_VERSION__=200" "-D__IMAGE_SUPPORT__=1" "-Xclang" "-cl-ext=+cl_khr_fp64,+cl_khr_global_int32_base_atomics,+cl_khr_global_int32_extended_atomics,+cl_khr_local_int32_base_atomics,+cl_khr_local_int32_extended_atomics,+cl_khr_int64_base_atomics,+cl_khr_int64_extended_atomics,+cl_khr_3d_image_writes,+cl_khr_byte_addressable_store,+cl_khr_fp16,+cl_khr_gl_sharing,+cl_amd_device_attribute_query,+cl_amd_media_ops,+cl_amd_media_ops2,+cl_khr_d3d10_sharing,+cl_khr_d3d11_sharing,+cl_khr_dx9_media_sharing,+cl_khr_image2d_from_buffer,+cl_khr_subgroups,+cl_khr_gl_event,+cl_khr_mipmap_image,+cl_khr_mipmap_image_writes,+cl_amd_copy_buffer_p2p,+cl_amd_planar_yuv" "-mllvm" "-amdgpu-prelink" "-mcode-object-version=5" "-nogpulib" "C:\Users\mayan\AppData\Local\Temp\comgr-44d52d\input\CompileSource" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-44d52d\output\CompileSource.bc"
     Driver Job Args: clang "-cc1" "-mcode-object-version=5" "-mllvm" "--amdhsa-code-object-version=5" "-triple" "amdgcn-amd-amdhsa" "-emit-llvm-bc" "-emit-llvm-uselists" "-clear-ast-before-backend" "-disable-llvm-verifier" "-discard-value-names" "-main-file-name" "CompileSource" "-mrelocation-model" "pic" "-pic-level" "2" "-fhalf-no-semantic-interposition" "-mframe-pointer=none" "-ffp-contract=on" "-fno-rounding-math" "-mconstructor-aliases" "-fvisibility=hidden" "-fapply-global-visibility-to-externs" "-target-cpu" "gfx1030" "-mllvm" "-treat-scalable-fixed-error-as-warning" "-debugger-tuning=gdb" "-resource-dir" "lib\clang\17.0.0" "-include-pch" "C:\Users\mayan\AppData\Local\Temp\comgr-44d52d\include\opencl1.2-c.pch" "-I" "C:\Users\mayan\AppData\Local\Temp\comgr-44d52d\include" "-D" "__OPENCL_VERSION__=200" "-D" "__IMAGE_SUPPORT__=1" "-O3" "-std=cl1.2" "-fdebug-compilation-dir=G:\hashcat" "-ferror-limit" "19" "-fmessage-length=280" "-cl-kernel-arg-info" "-nogpulib" "-fwchar-type=short" "-fno-signed-wchar" "-fno-threadsafe-statics" "-fcolor-diagnostics" "-vectorize-loops" "-vectorize-slp" "-fno-validate-pch" "-cl-ext=+cl_khr_fp64,+cl_khr_global_int32_base_atomics,+cl_khr_global_int32_extended_atomics,+cl_khr_local_int32_base_atomics,+cl_khr_local_int32_extended_atomics,+cl_khr_int64_base_atomics,+cl_khr_int64_extended_atomics,+cl_khr_3d_image_writes,+cl_khr_byte_addressable_store,+cl_khr_fp16,+cl_khr_gl_sharing,+cl_amd_device_attribute_query,+cl_amd_media_ops,+cl_amd_media_ops2,+cl_khr_d3d10_sharing,+cl_khr_d3d11_sharing,+cl_khr_dx9_media_sharing,+cl_khr_image2d_from_buffer,+cl_khr_subgroups,+cl_khr_gl_event,+cl_khr_mipmap_image,+cl_khr_mipmap_image_writes,+cl_amd_copy_buffer_p2p,+cl_amd_planar_yuv" "-mllvm" "-amdgpu-prelink" "-faddrsig" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-44d52d\output\CompileSource.bc" "-x" "cl" "C:\Users\mayan\AppData\Local\Temp\comgr-44d52d\input\CompileSource"
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_ADD_DEVICE_LIBRARIES
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "code_object_v5"
                Path:
            Language: AMD_COMGR_LANGUAGE_OPENCL_1_2
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_LINK_BC_TO_BC
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "code_object_v5"
                Path:
            Language: AMD_COMGR_LANGUAGE_OPENCL_1_2
             Linking: LLVM Binary
             Linking: opencl_lib.bc
             Linking: ocml_lib.bc
             Linking: ockl_lib.bc
             Linking: oclc_isa_version_1030.bc
             Linking: oclc_correctly_rounded_sqrt_off_lib.bc
             Linking: oclc_daz_opt_off_lib.bc
             Linking: oclc_finite_only_off_lib.bc
             Linking: oclc_unsafe_math_off_lib.bc
             Linking: oclc_wavefrontsize64_off_lib.bc
             Linking: oclc_abi_version_500_lib.bc
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_CODEGEN_BC_TO_RELOCATABLE
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "-O3" "-cl-kernel-arg-info" "-mllvm" "-amdgpu-internalize-symbols" "-mcode-object-version=5"
                Path:
            Language: AMD_COMGR_LANGUAGE_NONE
    Compilation Args:  "-target" "amdgcn-amd-amdhsa" "-mcpu=gfx1030" "-c" "-mllvm" "-amdgpu-internalize-symbols" "-O3" "-cl-kernel-arg-info" "-mllvm" "-amdgpu-internalize-symbols" "-mcode-object-version=5" "-nogpulib" "C:\Users\mayan\AppData\Local\Temp\comgr-8d859e\input\linked.bc" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-8d859e\output\linked.bc.o"
     Driver Job Args: clang "-cc1" "-mcode-object-version=5" "-mllvm" "--amdhsa-code-object-version=5" "-triple" "amdgcn-amd-amdhsa" "-emit-obj" "-clear-ast-before-backend" "-disable-llvm-verifier" "-discard-value-names" "-main-file-name" "linked.bc" "-mrelocation-model" "pic" "-pic-level" "2" "-fhalf-no-semantic-interposition" "-mframe-pointer=none" "-ffp-contract=on" "-fno-rounding-math" "-mconstructor-aliases" "-fvisibility=hidden" "-fapply-global-visibility-to-externs" "-target-cpu" "gfx1030" "-mllvm" "-treat-scalable-fixed-error-as-warning" "-debugger-tuning=gdb" "-resource-dir" "lib\clang\17.0.0" "-O3" "-fdebug-compilation-dir=G:\hashcat" "-ferror-limit" "19" "-fmessage-length=280" "-cl-kernel-arg-info" "-nogpulib" "-fcolor-diagnostics" "-vectorize-loops" "-vectorize-slp" "-mllvm" "-amdgpu-internalize-symbols" "-mllvm" "-amdgpu-internalize-symbols" "-faddrsig" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-8d859e\output\linked.bc.o" "-x" "ir" "C:\Users\mayan\AppData\Local\Temp\comgr-8d859e\input\linked.bc"
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_LINK_RELOCATABLE_TO_EXECUTABLE
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options:
                Path:
            Language: AMD_COMGR_LANGUAGE_NONE
    Compilation Args:  "-target" "amdgcn-amd-amdhsa" "-mcpu=gfx1030" "C:\Users\mayan\AppData\Local\Temp\comgr-aa9a9d\input\linked.bc.o" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-aa9a9d\output\a.so"
     Driver Job Args: lld "C:\Users\mayan\AppData\Local\Temp\comgr-aa9a9d\input\linked.bc.o" "-shared" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-aa9a9d\output\a.so"
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

The device #3 specifically listed was skipped because it is an alias of device #1

The device #4 specifically listed was skipped because it is an alias of device #2

HIP API (HIP 5.7.32000)
=======================
* Device #1: AMD Radeon RX 6900 XT, 16240/16368 MB, 40MCU
* Device #2: AMD Radeon RX 6900 XT, 16240/16368 MB, 40MCU

OpenCL API (OpenCL 2.1 AMD-APP (3608.0)) - Platform #1 [Advanced Micro Devices, Inc.]
=====================================================================================
* Device #3: AMD Radeon RX 6900 XT, skipped
* Device #4: AMD Radeon RX 6900 XT, skipped

Benchmark relevant options:
===========================
* --backend-devices-virtual=1
* --optimized-kernel-enable

-----------------------
* Hash-Mode 1000 (NTLM)
-----------------------

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_COMPILE_SOURCE_WITH_DEVICE_LIBS_TO_BC
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "-O3" "-mcumode" "--hip-version=5.7.32000" "-DHIP_VERSION_MAJOR=5" "-DHIP_VERSION_MINOR=7" "-DHIP_VERSION_PATCH=32000" "-D__HIPCC_RTC__" "-include" "hiprtc_runtime.h" "-std=c++14" "-nogpuinc" "-Wno-gnu-line-marker" "-Wno-missing-prototypes" "-target" "x86_64-pc-windows-msvc" "-fms-extensions" "-fms-compatibility" "--gpu-max-threads-per-block=32" "-nocudainc" "-nocudalib" "-D INCLUDE_PATH=G:\\hashcat/OpenCL/" "-D XM2S(x)=#x" "-D M2S(x)=XM2S(x)" "-D" "KERNEL_STATIC" "-D" "LOCAL_MEM_TYPE=1" "-D" "VENDOR_ID=256" "-D" "CUDA_ARCH=1030" "-D" "HAS_ADD=0" "-D" "HAS_ADDC=0" "-D" "HAS_SUB=0" "-D" "HAS_SUBC=0" "-D" "HAS_VADD=1" "-D" "HAS_VADDC=1" "-D" "HAS_VADD_CO=1" "-D" "HAS_VADDC_CO=1" "-D" "HAS_VSUB=1" "-D" "HAS_VSUBB=1" "-D" "HAS_VSUB_CO=1" "-D" "HAS_VSUBB_CO=1" "-D" "HAS_VPERM=1" "-D" "HAS_VADD3=1" "-D" "HAS_VBFE=1" "-D" "HAS_BFE=0" "-D" "HAS_LOP3=0" "-D" "HAS_MOV64=0" "-D" "HAS_PRMT=0" "-D" "VECT_SIZE=1" "-D" "DEVICE_TYPE=4" "-D" "DGST_R0=0" "-D" "DGST_R1=3" "-D" "DGST_R2=2" "-D" "DGST_R3=1" "-D" "DGST_ELEM=4" "-D" "KERN_TYPE=1000" "-D" "ATTACK_EXEC=11" "-D" "ATTACK_KERN=3" "-D" "ATTACK_MODE=3" "-w" "-Xclang" "-disable-llvm-passes"
                Path:
            Language: AMD_COMGR_LANGUAGE_HIP
    Compilation Args:  "--offload-arch=gfx1030" "-I" "C:\Users\mayan\AppData\Local\Temp\comgr-0e2bbf\include" "-x" "hip" "-std=c++11" "-target" "x86_64-unknown-linux-gnu" "--cuda-device-only" "-isystem" "/include" "-isystem" "C:\Program Files\AMD\ROCm\5.7\/include" "-isystem" "llvm/lib/clang/17.0.0" "-isystem" "llvm/lib/clang/17.0.0/include" "-c" "-emit-llvm" "-fshort-wchar" "--rocm-path=C:\Users\mayan\AppData\Local\Temp\comgr-0e2bbf\rocm" "-O3" "-mcumode" "--hip-version=5.7.32000" "-DHIP_VERSION_MAJOR=5" "-DHIP_VERSION_MINOR=7" "-DHIP_VERSION_PATCH=32000" "-D__HIPCC_RTC__" "-include" "hiprtc_runtime.h" "-std=c++14" "-nogpuinc" "-Wno-gnu-line-marker" "-Wno-missing-prototypes" "-target" "x86_64-pc-windows-msvc" "-fms-extensions" "-fms-compatibility" "--gpu-max-threads-per-block=32" "-nocudainc" "-nocudalib" "-D INCLUDE_PATH=G:\hashcat/OpenCL/" "-D XM2S(x)=#x" "-D M2S(x)=XM2S(x)" "-D" "KERNEL_STATIC" "-D" "LOCAL_MEM_TYPE=1" "-D" "VENDOR_ID=256" "-D" "CUDA_ARCH=1030" "-D" "HAS_ADD=0" "-D" "HAS_ADDC=0" "-D" "HAS_SUB=0" "-D" "HAS_SUBC=0" "-D" "HAS_VADD=1" "-D" "HAS_VADDC=1" "-D" "HAS_VADD_CO=1" "-D" "HAS_VADDC_CO=1" "-D" "HAS_VSUB=1" "-D" "HAS_VSUBB=1" "-D" "HAS_VSUB_CO=1" "-D" "HAS_VSUBB_CO=1" "-D" "HAS_VPERM=1" "-D" "HAS_VADD3=1" "-D" "HAS_VBFE=1" "-D" "HAS_BFE=0" "-D" "HAS_LOP3=0" "-D" "HAS_MOV64=0" "-D" "HAS_PRMT=0" "-D" "VECT_SIZE=1" "-D" "DEVICE_TYPE=4" "-D" "DGST_R0=0" "-D" "DGST_R1=3" "-D" "DGST_R2=2" "-D" "DGST_R3=1" "-D" "DGST_ELEM=4" "-D" "KERN_TYPE=1000" "-D" "ATTACK_EXEC=11" "-D" "ATTACK_KERN=3" "-D" "ATTACK_MODE=3" "-w" "-Xclang" "-disable-llvm-passes" "C:\Users\mayan\AppData\Local\Temp\comgr-0e2bbf\input\shared_kernel" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-0e2bbf\output\shared_kernel.bc"
     Driver Job Args: clang "-cc1" "-triple" "amdgcn-amd-amdhsa" "-aux-triple" "x86_64-pc-windows-msvc" "-emit-llvm-bc" "-emit-llvm-uselists" "-clear-ast-before-backend" "-disable-llvm-verifier" "-discard-value-names" "-main-file-name" "shared_kernel" "-mrelocation-model" "pic" "-pic-level" "2" "-fhalf-no-semantic-interposition" "-mframe-pointer=none" "-fno-rounding-math" "-mconstructor-aliases" "-aux-target-cpu" "x86-64" "-fcuda-is-device" "-mllvm" "-amdgpu-internalize-symbols" "--gpu-max-threads-per-block=32" "-fcuda-allow-variadic-functions" "-fvisibility=hidden" "-fapply-global-visibility-to-externs" "-target-cpu" "gfx1030" "-target-feature" "+cumode" "-mllvm" "-treat-scalable-fixed-error-as-warning" "-debugger-tuning=gdb" "-resource-dir" "lib\clang\17.0.0" "-internal-isystem" "lib\clang\17.0.0\include\cuda_wrappers" "-isystem" "/include" "-isystem" "C:\Program Files\AMD\ROCm\5.7\/include" "-isystem" "llvm/lib/clang/17.0.0" "-isystem" "llvm/lib/clang/17.0.0/include" "-include" "hiprtc_runtime.h" "-D" "HIP_VERSION_MAJOR=5" "-D" "HIP_VERSION_MINOR=7" "-D" "HIP_VERSION_PATCH=32000" "-D" "__HIPCC_RTC__" "-D" " INCLUDE_PATH=G:\hashcat/OpenCL/" "-D" " XM2S(x)=#x" "-D" " M2S(x)=XM2S(x)" "-D" "KERNEL_STATIC" "-D" "LOCAL_MEM_TYPE=1" "-D" "VENDOR_ID=256" "-D" "CUDA_ARCH=1030" "-D" "HAS_ADD=0" "-D" "HAS_ADDC=0" "-D" "HAS_SUB=0" "-D" "HAS_SUBC=0" "-D" "HAS_VADD=1" "-D" "HAS_VADDC=1" "-D" "HAS_VADD_CO=1" "-D" "HAS_VADDC_CO=1" "-D" "HAS_VSUB=1" "-D" "HAS_VSUBB=1" "-D" "HAS_VSUB_CO=1" "-D" "HAS_VSUBB_CO=1" "-D" "HAS_VPERM=1" "-D" "HAS_VADD3=1" "-D" "HAS_VBFE=1" "-D" "HAS_BFE=0" "-D" "HAS_LOP3=0" "-D" "HAS_MOV64=0" "-D" "HAS_PRMT=0" "-D" "VECT_SIZE=1" "-D" "DEVICE_TYPE=4" "-D" "DGST_R0=0" "-D" "DGST_R1=3" "-D" "DGST_R2=2" "-D" "DGST_R3=1" "-D" "DGST_ELEM=4" "-D" "KERN_TYPE=1000" "-D" "ATTACK_EXEC=11" "-D" "ATTACK_KERN=3" "-D" "ATTACK_MODE=3" "-I" "C:\Users\mayan\AppData\Local\Temp\comgr-0e2bbf\include" "-D" "HIP_VERSION_MAJOR=5" "-D" "HIP_VERSION_MINOR=7" "-D" "HIP_VERSION_PATCH=32000" "-D" "__HIPCC_RTC__" "-D" " INCLUDE_PATH=G:\hashcat/OpenCL/" "-D" " XM2S(x)=#x" "-D" " M2S(x)=XM2S(x)" "-D" "KERNEL_STATIC" "-D" "LOCAL_MEM_TYPE=1" "-D" "VENDOR_ID=256" "-D" "CUDA_ARCH=1030" "-D" "HAS_ADD=0" "-D" "HAS_ADDC=0" "-D" "HAS_SUB=0" "-D" "HAS_SUBC=0" "-D" "HAS_VADD=1" "-D" "HAS_VADDC=1" "-D" "HAS_VADD_CO=1" "-D" "HAS_VADDC_CO=1" "-D" "HAS_VSUB=1" "-D" "HAS_VSUBB=1" "-D" "HAS_VSUB_CO=1" "-D" "HAS_VSUBB_CO=1" "-D" "HAS_VPERM=1" "-D" "HAS_VADD3=1" "-D" "HAS_VBFE=1" "-D" "HAS_BFE=0" "-D" "HAS_LOP3=0" "-D" "HAS_MOV64=0" "-D" "HAS_PRMT=0" "-D" "VECT_SIZE=1" "-D" "DEVICE_TYPE=4" "-D" "DGST_R0=0" "-D" "DGST_R1=3" "-D" "DGST_R2=2" "-D" "DGST_R3=1" "-D" "DGST_ELEM=4" "-D" "KERN_TYPE=1000" "-D" "ATTACK_EXEC=11" "-D" "ATTACK_KERN=3" "-D" "ATTACK_MODE=3" "-internal-isystem" "lib\clang\17.0.0\include" "-internal-isystem" "C:\Program Files\Microsoft Visual Studio\2022\Community\VC\Tools\MSVC\14.38.33130\include" "-internal-isystem" "C:\Program Files\Microsoft Visual Studio\2022\Community\VC\Tools\MSVC\14.38.33130\atlmfc\include" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\ucrt" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\shared" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\um" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\winrt" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\cppwinrt" "-internal-isystem" "lib\clang\17.0.0\include" "-internal-isystem" "C:\Program Files\Microsoft Visual Studio\2022\Community\VC\Tools\MSVC\14.38.33130\include" "-internal-isystem" "C:\Program Files\Microsoft Visual Studio\2022\Community\VC\Tools\MSVC\14.38.33130\atlmfc\include" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\ucrt" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\shared" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\um" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\winrt" "-internal-isystem" "C:\Program Files (x86)\Windows Kits\10\Include\10.0.22621.0\cppwinrt" "-O3" "-Wno-gnu-line-marker" "-Wno-missing-prototypes" "-w" "-std=c++14" "-fdeprecated-macro" "-fno-autolink" "-fdebug-compilation-dir=G:\hashcat" "-ferror-limit" "19" "-fmessage-length=280" "-fhip-new-launch-api" "-nogpulib" "-fwchar-type=short" "-fno-signed-wchar" "-fms-extensions" "-fms-compatibility" "-fms-compatibility-version=19.38.33134" "-fdelayed-template-parsing" "-fcxx-exceptions" "-fexceptions" "-fcolor-diagnostics" "-vectorize-loops" "-vectorize-slp" "-disable-llvm-passes" "-cuid=ae5fdedfe6983874" "-fcuda-allow-variadic-functions" "-faddrsig" "-D__GCC_HAVE_DWARF2_CFI_ASM=1" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-0e2bbf\output\shared_kernel.bc" "-x" "hip" "C:\Users\mayan\AppData\Local\Temp\comgr-0e2bbf\input\shared_kernel"
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_LINK_BC_TO_BC
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options:
                Path:
            Language: AMD_COMGR_LANGUAGE_HIP
             Linking: linked
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_CODEGEN_BC_TO_RELOCATABLE
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options: "-O3" "-mllvm" "-amdgpu-internalize-symbols"
                Path:
            Language: AMD_COMGR_LANGUAGE_NONE
    Compilation Args:  "-target" "amdgcn-amd-amdhsa" "-mcpu=gfx1030" "-c" "-mllvm" "-amdgpu-internalize-symbols" "-O3" "-mllvm" "-amdgpu-internalize-symbols" "-nogpulib" "C:\Users\mayan\AppData\Local\Temp\comgr-8dc889\input\LLVMBitcode.bc" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-8dc889\output\LLVMBitcode.bc.o"
     Driver Job Args: clang "-cc1" "-triple" "amdgcn-amd-amdhsa" "-emit-obj" "-clear-ast-before-backend" "-disable-llvm-verifier" "-discard-value-names" "-main-file-name" "LLVMBitcode.bc" "-mrelocation-model" "pic" "-pic-level" "2" "-fhalf-no-semantic-interposition" "-mframe-pointer=none" "-ffp-contract=on" "-fno-rounding-math" "-mconstructor-aliases" "-fvisibility=hidden" "-fapply-global-visibility-to-externs" "-target-cpu" "gfx1030" "-mllvm" "-treat-scalable-fixed-error-as-warning" "-debugger-tuning=gdb" "-resource-dir" "lib\clang\17.0.0" "-O3" "-fdebug-compilation-dir=G:\hashcat" "-ferror-limit" "19" "-fmessage-length=280" "-nogpulib" "-fcolor-diagnostics" "-vectorize-loops" "-vectorize-slp" "-mllvm" "-amdgpu-internalize-symbols" "-mllvm" "-amdgpu-internalize-symbols" "-faddrsig" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-8dc889\output\LLVMBitcode.bc.o" "-x" "ir" "C:\Users\mayan\AppData\Local\Temp\comgr-8dc889\input\LLVMBitcode.bc"
        ReturnStatus: AMD_COMGR_STATUS_SUCCESS

amd_comgr_do_action:
          ActionKind: AMD_COMGR_ACTION_LINK_RELOCATABLE_TO_EXECUTABLE
             IsaName: amdgcn-amd-amdhsa--gfx1030
             Options:
                Path:
            Language: AMD_COMGR_LANGUAGE_NONE
    Compilation Args:  "-target" "amdgcn-amd-amdhsa" "-mcpu=gfx1030" "C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-093102\output\a.so"
     Driver Job Args: lld "C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o" "-shared" "-o" "C:\Users\mayan\AppData\Local\Temp\comgr-093102\output\a.so"
lld: error: undefined hidden symbol: __ockl_get_group_id
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o:(gpu_memset)
>>> referenced 7 more times

lld: error: undefined hidden symbol: __ockl_get_local_size
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o:(gpu_memset)
>>> referenced 7 more times

lld: error: undefined hidden symbol: __ockl_get_local_id
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o:(gpu_decompress)
>>> referenced by C:\Users\mayan\AppData\Local\Temp\comgr-093102\input\LLVMBitcode.bc.o:(gpu_memset)
>>> referenced 7 more times
        ReturnStatus: AMD_COMGR_STATUS_ERROR

hiprtcCompileProgram(): HIPRTC_ERROR_COMPILATION


* Device #1: Kernel ./OpenCL/shared.cl build failed.

* Device #1: Kernel ./OpenCL/shared.cl build failed.

Started: Fri Feb 09 04:24:22 2024
Stopped: Fri Feb 09 04:24:24 2024

It seems there is an issue with the app. It compiles HIP kernel with option "-nocudalib". This is an alias to "-nogpulib" for comgr/clang. This causes comgr/clang not linking device libraries. Removing this option should fix the issue.

I'll give it a go post and update.

yup! that fixed it. Thanks @yxsamliu !