- 11 Aug, 2016 4 commits
-
-
Tim Wickberg authored
Bug 2983.
-
Tim Wickberg authored
Continuation of 18947480. Commit cff2effc made cpu_freq_init unconditionally set up the structures, so this reinitialization was discarding the true original info, which prevents the frequency and governor from being restored properly after the step completes.
-
Morris Jette authored
Don't about step launch if job reaches expected end time while node is configuring/booting (NOTE: The job end time will be adjusted after node becomes ready for use). bug 2985
-
Morris Jette authored
The sleep starts at 0.1 second and doubles each time to a maximum value. Previous maximum was 2 seconds, increased to 5 seconds to account for very large KNL boot times (tens of minutes) and to limit overhead on slurmctld daemon.
-
- 10 Aug, 2016 7 commits
-
-
Morris Jette authored
If the KNL node modes reported by cnselect and capmc differ, log that using "info" rather than "debug".
-
Danny Auble authored
frequency other than AcctGatherNodeFreq.
-
Danny Auble authored
Filesystem instead of Lustre.
-
Iakovos Panourgias authored
Network dataset.
-
Morris Jette authored
Locking slurmstepd in memory can result in exhausting real memory in some cases, resulting in failure of the slurmstepd process. This reverts commit 03cf4a5d, but the logic will be returned using a configuration parameter in Slurm version 17.02. bug 2334
-
Morris Jette authored
This should improve performance and prevent failure if a local group ID lookup fails. bug 2928
-
Alejandro Sanchez authored
Minor correction to commit 30baec8d
-
- 09 Aug, 2016 12 commits
-
-
Morris Jette authored
Prevent slurmd abort if hwloc library fails to populate the "children" arrays (observed with hwloc version "dev-333-g85ea6e4").
-
Morris Jette authored
-
Morris Jette authored
-
Tim Wickberg authored
Bug 2955.
-
Morris Jette authored
Document the partition limits considered in the EnforcePartLimits logic. bug 2920
-
Morris Jette authored
Correction to commit b45a55bc but 2920
-
Morris Jette authored
Make EnforcePartLimit support logic work with any ordering of partitions in job submit request. Developed jointly with Alejandro Sanchez <alex@schedmd.com> bug 2920
-
Dominik Bartkiewicz authored
Calculation switched the node count in place of the cpu count, which results in incorrect estimates. CID 44784.
-
Dominik Bartkiewicz authored
CID 44787.
-
Tim Wickberg authored
Bug 2950. Also identified as CID 56684 (copy+paste error).
-
Morris Jette authored
-
Dominik Bartkiewicz authored
CID 45023 and 45024.
-
- 08 Aug, 2016 3 commits
-
-
Morris Jette authored
Fix task:CPU binding logic for some processors. This bug was introduced in version 16.05.1 to address KNL bunding problem. bug 2972
-
Dominik Bartkiewicz authored
Needed due to part_filter_set() calls; without write lock this can race returning inconsistent results to 'sinfo'. Bug 2958.
-
Morris Jette authored
Regression test fixes if SelectTypePlugin not managing memory and no node memory size set (defaults to 1 MB per node).
-
- 07 Aug, 2016 1 commit
-
-
Morris Jette authored
Fix race condition in the account_gather plugin that could result in job stuck in COMPLETING state. bug 2973
-
- 04 Aug, 2016 1 commit
-
-
Morris Jette authored
These problems have been around for a while, but coverity just started to report them for some reason.
-
- 03 Aug, 2016 3 commits
-
-
Morris Jette authored
Prior logic used to create an advanced reservation based upon a core count would ignore the specialized cores. Then when a job tried to use the reservation, it would consider the specialized cores and not be able to use the core count used in the reservation creation. This change considers specialized cores when creating the reservation.
-
Morris Jette authored
There was a line removed that should not have been
-
Morris Jette authored
Move the function that identifies specialized cores from static to extern for future use by advanced reservation logic.
-
- 02 Aug, 2016 2 commits
-
-
Morris Jette authored
Problem introduced in commit 03cf4a5d
-
Sergey Meirovich authored
If slurmstepd had been swapped out before upgrade happened it could easily lead to SIGBUS at any time after upgrade. Prevent that by mlocking it. bug 2334
-
- 29 Jul, 2016 6 commits
-
-
Moe Jette authored
SLURM_JOB_RESERVAION environment variables are set for the salloc command. Document the same environment variables for the salloc, sbatch and srun commands in their man pages.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
Also we are making extern lower case in the api to match this and sacct.
-
Danny Auble authored
-
Danny Auble authored
that had a partition in them.
-
- 28 Jul, 2016 1 commit
-
-
Danny Auble authored
-