- 17 Dec, 2015 2 commits
-
-
Morris Jette authored
When extra pools were reported by Cray API, it was not multiplying the buffer count by granularity to get the total size, but was just getting the counts.
-
Danny Auble authored
-
- 16 Dec, 2015 8 commits
-
-
Alejandro Sanchez authored
job's TimeLimit if the job did't request --time
-
David Matthews authored
-
Brian Christiansen authored
Bug 2130
-
Morris Jette authored
Move slurmctld mail handler to separate thread for improved performance. Original logic did fork/exec without separate thread and if the slurmctld memory size is huge, then the time required for fork() to complete can be significant. bug 2252
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Avoid hard-link/copy of script/environment files for job arrays. Use the master job record file for all tasks of the job array. back port of commit 41eead64 bug 2220
-
Morris Jette authored
Refactor logic to synchronize active bach jobs and their script/environment files, reducing overhead dramatically for large numbers of active jobs. backport of commit 41eead64 bug 2220
-
- 15 Dec, 2015 9 commits
-
-
Morris Jette authored
-
Morris Jette authored
If a job can never be started due to unsatisfied job dependencies, report the full original job dependency specification rather than the dependencies remaining to be satisfied (typically NULL). bug 2197
-
Danny Auble authored
in use.
-
Danny Auble authored
Conflicts: src/slurmctld/proc_req.c
-
Danny Auble authored
_slurm_rpc_complete_job_allocation. This is a rewrite of 438365ec which didn't catch the job_ptr wasn't in a lock so the memory issue could still of existed. This hopefully fixes all the spots the job_ptr wasn't in the lock. Fixes bug 2146
-
Danny Auble authored
This reverts commit 438365ec.
-
Brian Christiansen authored
Bug 2261
-
Danny Auble authored
15.08.5. Bug 2243
-
David Bigagli authored
bug 2171
-
- 14 Dec, 2015 3 commits
-
-
Morris Jette authored
Decrease parallelism in job cancel request to prevent denial of service when cancelling huge numbers of jobs. bug 2256
-
Morris Jette authored
Prevent triggering gang scheduling within a partition if configured with PreemptType=partition_prio and PreemptMode=suspend,gang. The essence of this fix is to change a "<=" to "<" in cons_res/job_test.c: - if ((p_ptr->part_ptr->priority <= jp_ptr->part_ptr->priority) && + if ((p_ptr->part_ptr->priority < jp_ptr->part_ptr->priority) && but logic was also added to insure that a partition configuration with PreemptMode did not override PreemptType != partition_prio. bug 2232
-
David Bigagli authored
-
- 11 Dec, 2015 8 commits
-
-
Tim Wickberg authored
No changes to logic
-
Morris Jette authored
-
Tim Wickberg authored
Previously an error() would be logged when the attempt to open the job script using the new directory format failed but the successive fallback to the old directory structure was successful, leading to confusion when troubleshooting. Move emitted warnings to debug(), and only error() after failing to open in both directory structures. Add a note about backwards compatibility to both functions - we cannot remove these fallbacks as directory structure for pending jobs does not change on Slurm version update, and people may need to chain multiple version update together to get to a current slurm version which would correctly update slurmctld state files but leave pending jobs in the old directory structure. Bug #2244.
-
Morris Jette authored
If a job is requeued while in the process of being launch, remove it's job ID from slurmd's record of active jobs in order to avoid generating a duplicate job ID error when launched for the second time (which would drain the node). bug 2240
-
Morris Jette authored
In slurmctld log file, log duplicate job ID found by slurmd. Previously was being logged as prolog/epilog failure. bug 2240
-
David Bigagli authored
-
Morris Jette authored
-
Morris Jette authored
-
- 10 Dec, 2015 7 commits
-
-
Danny Auble authored
The problem comes from situations where a node range is given, leaving the task count at NO_VAL is the desired behavior in this case.
-
Danny Auble authored
so we can set it correctly before tasks are set.
-
Tim Wickberg authored
-
Danny Auble authored
-
Tim Wickberg authored
messages like FAILURE: 1 scontrol show assoc_mgr did not show correct information (6 != 5). to happen erroneously.
-
David Bigagli authored
-
Tim Wickberg authored
The error messages associated with these are: /home/tim/slurm/src/common/callerid.c:337:21: warning: address of array 'entryp->d_name' will always evaluate to 'true' [-Wpointer-bool-conversion] else if (!entryp->d_name) ~~~~~~~~~^~~~~~ 1 warning generated. /home/tim/slurm/src/common/gres.c:831:19: warning: comparison of unsigned expression < 0 is always false [-Wtautological-compare] if ((tmp_uint64 < 0) || (tmp_uint64 >= NO_VAL)) { ~~~~~~~~~~ ^ ~ 1 warning generated. /home/tim/slurm/src/slurmctld/node_mgr.c:1684:26: warning: if statement has empty body [-Wempty-body] if (first_new == NULL); ^ /home/tim/slurm/src/slurmctld/node_mgr.c:1684:26: note: put the semicolon on a separate line to silence this warning /home/tim/slurm/src/slurmctld/node_mgr.c:1756:26: warning: if statement has empty body [-Wempty-body] if (first_new == NULL); ^ /home/tim/slurm/src/slurmctld/node_mgr.c:1756:26: note: put the semicolon on a separate line to silence this warning /home/tim/slurm/src/slurmctld/node_mgr.c:1830:26: warning: if statement has empty body [-Wempty-body] if (first_new == NULL); ^ /home/tim/slurm/src/slurmctld/node_mgr.c:1830:26: note: put the semicolon on a separate line to silence this warning /home/tim/slurm/src/plugins/switch/generic/switch_generic.c:837:27: warning: address of array 'if_rec->ifa_addr->sa_data' will always evaluate to 'true' [-Wpointer-bool-conversion] if (!if_rec->ifa_addr->sa_data) ~~~~~~~~~~~~~~~~~~~^~~~~~~
-
- 09 Dec, 2015 3 commits
-
-
Morris Jette authored
news
-
Danny Auble authored
-
Danny Auble authored
-