Rawhide builds time out
junghans opened this issue · comments
During espressomd
detection:
f37b48de85b5:pid43.python3: Failed to get eth0 (unit 0) cpu set
f37b48de85b5:pid43: PSM3 can't open nic unit: 0 (err=23)
PMIx Log Report:[43]: (nic/PSM)[43]: PSM3 can't open nic unit: 0 (err=23)
f37b48de85b5:pid43.python3: Failed to get eth0 (unit 0) cpu set
f37b48de85b5:pid43: PSM3 can't open nic unit: 0 (err=23)
PMIx Log Report:[43]: (nic/PSM)[43]: PSM3 can't open nic unit: 0 (err=23)
f37b48de85b5:pid43.python3: Failed to get eth0 (unit 0) cpu set
f37b48de85b5:pid43: PSM3 can't open nic unit: 0 (err=23)
PMIx Log Report:[43]: (nic/PSM)[43]: PSM3 can't open nic unit: 0 (err=23)
f37b48de85b5:pid43.python3: Failed to get eth0 (unit 0) cpu set
f37b48de85b5:pid43: PSM3 can't open nic unit: 0 (err=23)
PMIx Log Report:[43]: (nic/PSM)[43]: PSM3 can't open nic unit: 0 (err=23)
--------------------------------------------------------------------------
Open MPI failed an OFI Libfabric library call (fi_endpoint). This is highly
unusual; your job may behave unpredictably (and/or abort) after this.
Local host: f37b48de85b5
Location: mtl_ofi_component.c:512
Error: Invalid argument (22)
--------------------------------------------------------------------------
[votca@f37b48de85b5 ~]$ python3 -c "import espressomd"
f37b48de85b5:pid50.python3: Failed to get eth0 (unit 0) cpu set
f37b48de85b5:pid50: PSM3 can't open nic unit: 0 (err=23)
PMIx Log Report:[50]: (nic/PSM)[50]: PSM3 can't open nic unit: 0 (err=23)
f37b48de85b5:pid50.python3: Failed to get eth0 (unit 0) cpu set
f37b48de85b5:pid50: PSM3 can't open nic unit: 0 (err=23)
PMIx Log Report:[50]: (nic/PSM)[50]: PSM3 can't open nic unit: 0 (err=23)
f37b48de85b5:pid50.python3: Failed to get eth0 (unit 0) cpu set
f37b48de85b5:pid50: PSM3 can't open nic unit: 0 (err=23)
PMIx Log Report:[50]: (nic/PSM)[50]: PSM3 can't open nic unit: 0 (err=23)
f37b48de85b5:pid50.python3: Failed to get eth0 (unit 0) cpu set
f37b48de85b5:pid50: PSM3 can't open nic unit: 0 (err=23)
PMIx Log Report:[50]: (nic/PSM)[50]: PSM3 can't open nic unit: 0 (err=23)
[f37b48de85b5:00051] [[3492,0],0] ORTE_ERROR_LOG: Out of resource in file util/show_help.c at line 501
/cc @jngrad
-- Found Python: /usr/bin/python3.11 (found version "3.11.0") found components: Interpreter
3b537707efea:pid763: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid763: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid763: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid763.python3.11: Failed to get eth0 (unit 0) cpu set
3b537707efea:pid763.python3.11: Failed to get eth0 (unit 0) cpu set
3b537707efea:pid763.python3.11: Failed to get eth0 (unit 0) cpu set
3b537707efea:pid763: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid763.python3.11: Failed to get eth0 (unit 0) cpu set
--------------------------------------------------------------------------
Open MPI failed an OFI Libfabric library call (fi_endpoint). This is highly
unusual; your job may behave unpredictably (and/or abort) after this.
Local host: 3b537707efea
Location: mtl_ofi_component.c:512
Error: Invalid argument (22)
--------------------------------------------------------------------------
3b537707efea:pid770: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid770.python3.11: Failed to get eth0 (unit 0) cpu set
3b537707efea:pid770.python3.11: Failed to get eth0 (unit 0) cpu set
3b537707efea:pid770: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid770.python3.11: Failed to get eth0 (unit 0) cpu set
3b537707efea:pid770: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid770: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid770.python3.11: Failed to get eth0 (unit 0) cpu set
--------------------------------------------------------------------------
Open MPI failed an OFI Libfabric library call (fi_endpoint). This is highly
unusual; your job may behave unpredictably (and/or abort) after this.
Local host: 3b537707efea
Location: mtl_ofi_component.c:512
Error: Invalid argument (22)
--------------------------------------------------------------------------
-- Found ESPRESSO: TRUE
3b537707efea:pid778: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid778.python3.11: Failed to get eth0 (unit 0) cpu set
3b537707efea:pid778: PSM3 can't open nic unit: 0 (err=23)
3b537707efea:pid778.python3.11: Failed to get eth0 (unit 0) cpu set
I am a bit confused it says Found ESPRESSO: TRUE
and then fails afterwards....
Ok, actually the problem is import espressopp
.
/CC @XzzX, can you have a look at that?
$ docker run -it ghcr.io/votca/buildenv/fedora:rawhide /bin/bash
<in docker>
$ python3 -c "import espressopp"
python3 -c "import espressopp"
ModuleNotFoundError: No module named 'espressopp'
Have a look at votca/buildenv#186
python3 -c "import espressopp"
ModuleNotFoundError: No module named 'espressopp'
I disabled the espressopp
build, but you can do:
$ docker run -it ghcr.io/votca/buildenv/fedora:rawhide /bin/bash
<in docker>
$ git clone https://github.com/espressopp/espressopp && \
cmake -S espressopp -B espressopp/build -DCMAKE_INSTALL_PREFIX=/usr && \
cmake --build espressopp/build && \
sudo cmake --install espressopp/build && \
sudo pip3 install pyh5md
$ python3 -c "import espressopp"