Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with UCC Initialization: "mca_coll_ucc_save_coll_handlers" #183

Open
j34ni opened this issue Oct 18, 2024 · 1 comment
Open

Issue with UCC Initialization: "mca_coll_ucc_save_coll_handlers" #183

j34ni opened this issue Oct 18, 2024 · 1 comment

Comments

@j34ni
Copy link
Contributor

j34ni commented Oct 18, 2024

Description:

Users may encounter this kind of error related to UCC initialization in OpenMPI:

Error: coll_ucc_module.c:474 - mca_coll_ucc_module_enable() mca_coll_ucc_save_coll_handlers failed

This was documented in the OpenMPI repository (see: open-mpi/ompi#9885).

Apparently the problem arises because the default priority for UCC is 10, which conflicts with the priorities of basic and libnbc.

Workaround:
Users can bypass this issue by setting a higher priority for UCC:

export OMPI_MCA_coll_ucc_priority=100

Recommendation:
Should we not inform users about this issue (and workaround), and possibly look into adjusting the default configurations to avoid such conflict?

@dalcinl
Copy link
Contributor

dalcinl commented Oct 18, 2024

If there is an upstream patch accepted, once that patch is merged, then we can add that patch here and produce a new build.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants