eth-cscs / alps-uenv

Recipes for software stacks on Alps vClusters.

Home Page:https://eth-cscs.github.io/alps-uenv/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

nvhpc issue with intel-oneapi-tbb

toxa81 opened this issue · comments

Desired functionality: intel-oneapi-mkl is installed in spack stack and available for any programming environment

Fisrt attpempt: install intel-oneapi-mkl only in gcc-env

Result: squashfs in built, but when I build QE with nvhpc, the intel-one-api is fetched again and installed for %nvhpc

Second attempt: add intel-oneapi-mkl to nvhpc-env

    nvhpc-env:
      compiler:
          - toolchain: llvm
            spec: nvhpc@22.11
          - toolchain: gcc
            spec: gcc@11
      unify: true
      specs:
      - libxc%nvhpc
      - cuda@11.8%gcc
      - intel-oneapi-mkl+cluster%nvhpc

Result: broken installation of dependency intel-oneapi-tbb which is not reproducible in a standalone container with clean spack and nvhpc installations.

[+] /user-environment/linux-sles15-zen3/gcc-11.3.0/pkgconf-1.8.0-e3ts5emzovmwgyiq262u76slp62yevny
spack -e '/dev/shm/pe-dft/packages/gcc-env' install   --only-concrete --only=package --no-add /6fufv6rzstbwbvhe6hie4j3jvb5ai22d # intel-oneapi-tbb@2021.8.0%gcc@11.3.0+envmods build_system=generic arch=linux-sles15-zen3
==> Installing intel-oneapi-tbb-2021.8.0-6fufv6rzstbwbvhe6hie4j3jvb5ai22d
==> No binary for intel-oneapi-tbb-2021.8.0-6fufv6rzstbwbvhe6hie4j3jvb5ai22d found: installing from source
==> Fetching https://registrationcenter-download.intel.com/akdlm/irc_nas/19143/l_tbb_oneapi_p_2021.8.0.25334_offline.sh
==> No patches needed for intel-oneapi-tbb
==> intel-oneapi-tbb: Executing phase: 'install'
==> Error: ProcessError: Command exited with status 1:
    'bash' 'l_tbb_oneapi_p_2021.8.0.25334_offline.sh' '-s' '-a' '-s' '--action' 'install' '--eula' 'accept' '--install-dir' '/user-environment/linux-sles15-zen3/gcc-11.3.0/intel-oneapi-tbb-2021.8.0-6fufv6rzstbwbvhe6hie4j3jvb5ai22d'

1 error found in build log:
     5     Wait while the installer is preparing...
     6     Done.
     7     Launching the installer...
     8     Start installation flow...
     9     Installation of component has failed.
     10    Component id: intel.oneapi.lin.oneapi-common.vars, name: oneAPI Common, version: 2023.0.0-25325.
  >> 11    Error: Sequence execution failed.
     12
     13    An error has been encountered during the installation process. Detailed installation log files are located under '/tmp/unknown_user/intel_oneapi_installer/2023.02.03.10.16.28.
           127'.
     14    Please submit this error and the log files using one of the following support options:
     15     - Report your issue on the Intel Community Forum - https://community.intel.com/t5/Intel-oneAPI-Registration/bd-p/registration-download-licensing-instal
     16     - If you have Priority Support, submit a Service request at Online Service Center - https://supporttickets.intel.com/servicecenter?lang=en-US
     17    Installer completed with code 1

See build log for details:
  /tmp/antonk/spack-stage/spack-stage-intel-oneapi-tbb-2021.8.0-6fufv6rzstbwbvhe6hie4j3jvb5ai22d/spack-build-out.txt

==> Error: intel-oneapi-tbb-2021.8.0-6fufv6rzstbwbvhe6hie4j3jvb5ai22d: Package was not installed
==> Error: Installation request failed.  Refer to reported errors for failing package(s).
make[1]: *** [gcc-env/Makefile:23: gcc-env/generated/install/intel-oneapi-tbb-2021.8.0-6fufv6rzstbwbvhe6hie4j3jvb5ai22d] Error 1
make[1]: *** Waiting for unfinished jobs....
spack -e '/dev/shm/pe-dft/packages/nvhpc-env' install   --only-concrete --only=package --no-add /lmdnphjb3zjaouhaxy2zy6nhmtkmrjnm # intel-oneapi-tbb@2021.8.0%nvhpc@22.11+envmods build_system=generic arch=linux-sles15-zen3

Third attempt: add - intel-oneapi-mkl+cluster%gcc to nvhpc-env

Result:

==> Error: Name clashes detected in module files:

file: /user-environment/modules/intel-oneapi-mkl/2023.0.0-gcc
spec: intel-oneapi-mkl@2023.0.0%gcc@11.3.0+cluster+envmods~ilp64+shared build_system=generic arch=linux-sles15-zen3
spec: intel-oneapi-mkl@2023.0.0%gcc@11.3.0+cluster+envmods~ilp64+shared build_system=generic arch=linux-sles15-zen3

The projecion on modules.yaml has intel-oneapi-mkl: '{name}/{version}-{compiler.name}'. Removing projection leads to the same error, which is strage as it works for cuda.

There is nothing to build there, it is just a binary package