You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
A GNU debug version of a ufs-weather-model regression test in development for GEFS fails in initialization with error of
[../../../../../opal/mca/btl/tcp/btl_tcp_endpoint.c:730:mca_btl_tcp_endpoint_start_connect] bind on local address (removed) failed: Address already in use (98).
This seems similar to an existing open-mpi issue (open-mpi/ompi#7246) and something to do with use of all available ports.
It would be nice to confirm that is indeed the issue and resolve if possible (maybe change # of tasks or ports?)
To Reproduce
Try to run gnu cpld_debug_gefs regression test on Hera:
git clone https://github.com/NickSzapiro-NOAA/ufs-weather-model/tree/RT_bmark_gefs
cd ufs-weather-model
git checkout RT_bmark_gefs
git submodule update --init --recursive
cd tests
./rt.sh -a {ACCT} -n "cpld_debug_gefs gnu"
Expected behavior
Regression test should run to completion
System:
Hera
Additional context
As this seems like an issue involving open-mpi, NOAA RDHPCS help desk suggested making an issue here
The text was updated successfully, but these errors were encountered:
After changing tasks/memory, I get this error now instead:
7 682: The OSC pt2pt component does not support MPI_THREAD_MULTIPLE in this release.
682: Workarounds are to run on a single node, or to use a system with an RDMA 682: capable network such as Infiniband.
6: [h2c02:3430684] *** An error occurred in MPI_Win_create
6: [h2c02:3430684] *** reported by process [3376939008,6] 6: [h2c02:3430684] *** on communicator MPI COMMUNICATOR 74 DUP FROM 73
6: [h2c02:3430684] *** MPI_ERR_WIN: invalid window
6: [h2c02:3430684] *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
Describe the bug
A GNU debug version of a ufs-weather-model regression test in development for GEFS fails in initialization with error of
This seems similar to an existing open-mpi issue (open-mpi/ompi#7246) and something to do with use of all available ports.
It would be nice to confirm that is indeed the issue and resolve if possible (maybe change # of tasks or ports?)
To Reproduce
Try to run gnu cpld_debug_gefs regression test on Hera:
Expected behavior
Regression test should run to completion
System:
Hera
Additional context
As this seems like an issue involving open-mpi, NOAA RDHPCS help desk suggested making an issue here
The text was updated successfully, but these errors were encountered: