-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
S4 intel environment is not working for MPI related ctests #1356
Comments
Please report this error to the SSEC helpdesk - we use their intel module, we don't compile the intel compiler or MPI library ourselves. However, a 3-minute internet search revealed this: openucx/ucx#4742
Might be a good idea to let Jesse know - maybe he can set those vars or something equivalent in the Intel module so that we don't have to set them. |
I emailed Jesse. |
@InnocentSouopgui-NOAA, thanks for taking care of this |
Jesse recently modified the setting of FI_PROVIDER in the intel/2023.2 lua module script ( |
With further testing I have discovered that the Note I still see 9 test failures, but in this run of ctest there were no MPI crashes:
|
Describe the bug
The spack-stack-1.8.0 intel compiler unified-env appears to be working for building jedi-bundle, but I'm having trouble getting the MPI related ctests to work. When I run the SLURM script below, I get many test failues with messages like this:
It's quite possible that I've got something configured incorrectly. For example using srun instead of mipexec for the ctest system. This choice was based on trying a simple test on the compute nodes where I ran the following SLURM script:
Which gave the following result:
To Reproduce
Steps to reproduce the behavior:
Set up the intel environment by sourcing the following settings:
Run ecbuild for jedi-bundle, followed by make:
Run the ctests on the compute nodes using the following SLURM script:
Expected behavior
All tests including the MPI related ctests complete successfully (pass).
System:
What system(s) are you running the code on?
S4, Intel
Additional context
The text was updated successfully, but these errors were encountered: