- 16 Jul, 2016 1 commit
-
-
Morris Jette authored
Start power save thread only after the partition information is read in order to avoid trying to interpret the SuspendExcParts configuration information before the partition information is available, which would result in a slurmctld abort.
-
- 15 Jul, 2016 2 commits
-
-
Jacek Budzowski authored
bug 2900
-
Danny Auble authored
Before it was showing it as TBD since pending steps and the extern step have the same stepid.
-
- 14 Jul, 2016 2 commits
-
-
Morris Jette authored
Fix gang scheduling and license release logic if single node job killed on bad node. Notifying gang and releasing licences is normally done when the epilog completion happens, but if the node(s) assigned to a job are all down, that does not happen. This results in the licenses being reserved indefinitely and the gang scheduler being left with a bad (old) job pointer that can result in various failure modes bug 2867
-
Danny Auble authored
anyway to attempt to log the backtraces of the potential unkillable processes.
-
- 13 Jul, 2016 1 commit
-
-
Danny Auble authored
processes.
-
- 12 Jul, 2016 6 commits
-
-
Nicolas Joly authored
Bug 2892.
-
Danny Auble authored
Bug 2874 We will most likely redo this logic (as it appears to be duplicated) in a following patch.
-
Morris Jette authored
Don't generate an error when a batch job is submitted that must wait for stage-in before starting.
-
Danny Auble authored
-
Danny Auble authored
Bug 2886
-
Jacek Budzowski authored
Was incorrectly translating request to job.extern if part of a comma-separate list. Bug 2890.
-
- 11 Jul, 2016 1 commit
-
-
Danny Auble authored
(regression in 16.05.2). related commit 5d3e5e1e Bug 2612 and 2886
-
- 08 Jul, 2016 7 commits
-
-
Morris Jette authored
Document limitations in burst buffer use by the salloc command (possible access problems from a login node). bug 2883
-
Janne Blomqvist authored
task/cgroup plugin is configured with ConstrainRAMSpace=yes, then set soft memory limit to allocated memory limit (previously no soft limit was set). bug 2679
-
Morris Jette authored
-
Danny Auble authored
of 0. This might be the cause of run away jobs. I couldn't see how an end_time could be 0, but if it was it would just exit and never set time_end to anything. At least if it happens now we can have an idea that it is possible and we will have an idea this is the place it happens.
-
Danny Auble authored
This will keep from referencing the task array that might not be set up correctly in src/common/plugstack.c _spank_handle_init().
-
Morris Jette authored
-
Morris Jette authored
-
- 07 Jul, 2016 6 commits
-
-
Morris Jette authored
-
Morris Jette authored
Prevent possible incorrect counting of GRES of a given type if a node has the multiple "types" of a given GRES "name", which could over-subscribe GRES of a given type. bug 2836
-
Danny Auble authored
cleaning up on a restart.
-
Danny Auble authored
sleep is killed.
-
Danny Auble authored
-
Morris Jette authored
-
- 06 Jul, 2016 5 commits
-
-
Danny Auble authored
for steps.
-
Danny Auble authored
for a step.
-
Danny Auble authored
hostlists when brackets == 0.
-
Morris Jette authored
-
Morris Jette authored
Fix for invalid array pointer when creating advanced reservation when job allocations span heterogeneous nodes (differing core or socket counts). bug 2876
-
- 05 Jul, 2016 2 commits
-
-
Morris Jette authored
Prevent backfill scheduler from starting a second "singleton" job if another one started during a backfill sleep. related to bug 2808
-
Danny Auble authored
-
- 04 Jul, 2016 1 commit
-
-
Morris Jette authored
Previous logic required listing each task of job array individually. For example --dependency="afterany:123_4:123_5" can not be expressed as --dependency="afterany:123_[4-5]". bug 2644
-
- 02 Jul, 2016 3 commits
-
-
Danny Auble authored
we have the pids added to the system correctly. Most likely related to bug 2874
-
Danny Auble authored
-
Danny Auble authored
-
- 01 Jul, 2016 2 commits
-
-
Danny Auble authored
back to the slurmctld.
-
Danny Auble authored
-
- 30 Jun, 2016 1 commit
-
-
Brian Christiansen authored
prctld(PR_GET_NAME) >= 2.6.11
-