-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support partially allocated jobs across scheduler reload #6445
base: master
Are you sure you want to change the base?
Commits on Nov 19, 2024
-
job-manager: support partial-ok in hello request
Problem: RFC 27 allows the scheduler to send a partial-ok flag in the hello request, and then receive partially allocated jobs in hello responses. If the hello request includes this flag, pass it on to housekeeping. For each partially released housekeeping job, include the 'free' idset in the response per RFC 27.
Configuration menu - View commit details
-
Copy full SHA for ad86f6a - Browse repository at this point
Copy the full SHA ad86f6aView commit details -
libschedutil: add SCHEDUTIL_HELLO_PARTIAL_OK flag
Problem: libschedutil provides no way for the scheduler to indicate that the partial-ok flag should be set in the hello request. Add the SCHEDUTIL_HELLO_PARTIAL_OK flag which is passed to schedutil_create().
Configuration menu - View commit details
-
Copy full SHA for c7fc95d - Browse repository at this point
Copy the full SHA c7fc95dView commit details
Commits on Nov 20, 2024
-
libschedutil: support hello 'free' key
Problem: when processing hello responses, all schedulers now need to process R - free for partial releases. As a convenience, change the libschedutil hello callback to subtract the free idset from the R it fetched from the KVS. Note that the scheduling key, if present, remains the full object which is opaque to flux-core.
Configuration menu - View commit details
-
Copy full SHA for 8d1a441 - Browse repository at this point
Copy the full SHA 8d1a441View commit details -
sched-simple: support partial hello responses
Problem: sched-simple does not support partial hello responses. Set the SCHEDUTIL_HELLO_PARTIAL_OK flag. Add a 'test-hello-nopartial' module option to get the old behavior. Set test-hello-nopartial in the current test of partial housekeeping release.
Configuration menu - View commit details
-
Copy full SHA for 8e7e06b - Browse repository at this point
Copy the full SHA 8e7e06bView commit details -
testsuite: cover hello with partial allocation
Problem: there is no coverage of reloading the scheduler with partially released jobs in housekeeping. Add a test.
Configuration menu - View commit details
-
Copy full SHA for 409670a - Browse repository at this point
Copy the full SHA 409670aView commit details -
sched-simple: improve error log message
Problem: when the hello protocol cannot process a job, it logs the name of the wrong rlist function. Make the log message a little more high level.
Configuration menu - View commit details
-
Copy full SHA for 24436e3 - Browse repository at this point
Copy the full SHA 24436e3View commit details