Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sync_slurm_acct issue with too many cancel jobs #127

Open
stdweird opened this issue Mar 2, 2022 · 2 comments · May be fixed by #137
Open

sync_slurm_acct issue with too many cancel jobs #127

stdweird opened this issue Mar 2, 2022 · 2 comments · May be fixed by #137
Assignees

Comments

@stdweird
Copy link
Member

stdweird commented Mar 2, 2022

  • the current limit when bunch of jobs are not cancelled (adn sync stops) is not configurable
  • probably needs 2 limits: one on number of unique users involved and 2nd on total number of cancel commands. also the log mesage needs to be fixed (it report number of cancel commands, not number of users involved)
  • if possible, we should add new users; even if this limit is reached (so it still fails, and nobody is removed/jobs cancelled; but should not block new accoutns given access)
@stdweird
Copy link
Member Author

stdweird commented Mar 2, 2022

it should also only generate scancel commands for users iwth running jobs, to check eg sacct -L -s r -u vscxyz

@stdweird
Copy link
Member Author

stdweird commented Mar 2, 2022

also, when users are being moved to different vo, perhaps first remove then from old VO, and then add them to new one (now, it will add you to new vo, and then try to delete it, which might fail)

@wdpypere wdpypere linked a pull request Jun 30, 2022 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants