- 21 Apr, 2017 2 commits
-
-
Morris Jette authored
Fix to backfill scheduling with respect to QOS and association limits. Jobs submitted to multiple partitions are most likley to be effected. bugs 3680 and 3689
-
Danny Auble authored
-
- 20 Apr, 2017 1 commit
-
-
Danny Auble authored
are free.
-
- 16 Mar, 2017 3 commits
-
-
Danny Auble authored
Association. This reverts commits 92d2c645 and 37be42ec. This caused incorrect behavior, original code was correct. This also corrects documentation additions in commit 4cfe6bde. This code caused the first clause never to be correct and if you were over the limit you would get the third clause reporting a huge number available where it should be a negative number. The reality is the first clause should had been triggered and handled correctly.
-
Danny Auble authored
This reverts commit af52111c.
-
Josh Samuelson authored
Association. This reverts commits 92d2c645 and 37be42ec. This caused incorrect behavior, original code was correct. This also corrects documentation additions in commit 4cfe6bde. This code caused the first clause never to be correct and if you were over the limit you would get the third clause reporting a huge number available where it should be a negative number. The reality is the first clause should had been triggered and handled correctly.
-
- 10 Mar, 2017 2 commits
-
-
Danny Auble authored
-
Dominik Bartkiewicz authored
-
- 07 Mar, 2017 3 commits
-
-
Morris Jette authored
capmc_resume (Cray resume node script) - Do not disable changing a node's active features if SyscfgPath is configured in the knl.conf file. bug 3533
-
Morris Jette authored
If a job is cancelled by the user while it's allocated nodes are being reconfigured (i.e. the capmc_resume program is rebooting nodes for the job) and the node reconfiguration fails (i.e. the reboot fails), then don't requeue the job but leave it in a cancelled state. Note the JOB_RECONFIG_FAIL state flag is currently only used by capmc_resume, but could be used for other programs responsible for node reboots. bug 3392
-
Morris Jette authored
bug 3538
-
- 03 Mar, 2017 1 commit
-
-
Danny Auble authored
-
- 02 Mar, 2017 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
bug 3516
-
Morris Jette authored
from 10 to 100. bug 3516
-
- 01 Mar, 2017 1 commit
-
-
Alejandro Sanchez authored
-
- 27 Feb, 2017 2 commits
-
-
Morris Jette authored
This will be triggered after either a burst buffer job_begin function or select plugin job_begin function fails. Without this change, the "squeue -i" and "scontrol show job" commands can report old job state information. bug 3504
-
Tim Wickberg authored
Burst_buffer/cray - Prevent slurmctld daemon abort if "paths" operation fails. Now job will be held. bug 3504
-
- 23 Feb, 2017 5 commits
-
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Danny Auble authored
reason to 32 chars.
-
Morris Jette authored
-
Morris Jette authored
For job resize, correct logic to build "resize" script with new values. Previously the scripts were based upon the original job size. bug 3498
-
- 22 Feb, 2017 3 commits
-
-
Morris Jette authored
If node boot in progress when slurmctld daemon is restarted, then allow sufficient time for reboot to complete and not prematurely DOWN the node as "Not responding". bug 3494
-
Morris Jette authored
Could result in squeue abort Coverity error CID 44969
-
Morris Jette authored
Reduces possibility of old data if job_id or user_id option specified with iterate option Coverity error CID 44783
-
- 16 Feb, 2017 4 commits
-
-
Josh Samuelson authored
association GrpWall limit.
-
Danny Auble authored
limits.
-
Josh Samuelson authored
Bug 3476
-
Danny Auble authored
old ones. This is cosmetic only, no code change. Bug 3476
-
- 15 Feb, 2017 2 commits
-
-
Danny Auble authored
Bug 3472
-
Tim Wickberg authored
regcomp() is not safe to use across a fork in older glibc versions. Reinitialize the keyvalue_re structure after the fork through an atfork() handler. Bug 3276.
-
- 14 Feb, 2017 6 commits
-
-
Morris Jette authored
Honor --ntasks-per-node and --ntasks option when used with job constraints that contain node counts. bug 3458
-
Danny Auble authored
-
Danny Auble authored
This reverts commit 8ea967d5.
-
Danny Auble authored
-
Morris Jette authored
Defer interactive job allocation until ALL allocated nodes are ready rather than after PrologSlurmctld (if any) completes.
-
Dominik Bartkiewicz authored
Bug 3467.
-
- 13 Feb, 2017 2 commits
-
-
Morris Jette authored
burst_buffer/cray - Do not execute "pre_run" operation until after all nodes are booted and ready for use. bug 3461
-
Danny Auble authored
-