- 05 Dec, 2018 4 commits
-
-
Tim Wickberg authored
Bug 6155
-
Tim Wickberg authored
Bug 6155
-
Felip Moll authored
When bf_continue is set, and locks are released during a backfill cycle, other operations can make new resorces available while part way through the queue. When backfill continues the cycle and evaluates new jobs, it may allocate some of these newly available resources to lower priority jobs, rather than to higher priority jobs that were already considered in this backfill cycle. This patch introduces bf_ignore_newly_avail_nodes to SchedulerParameters to solve this issue. This option will ignore nodes made available when the backfill scheduler yields when resuming the backfill cycle. Bug 5279.
-
Danny Auble authored
slurmd yet delivering it's TRES list. Bug 6122 Co-authored-by: Marshall Garey <marshall@schedmd.com>
-
- 04 Dec, 2018 7 commits
-
-
Nate Rini authored
Bug 6008
-
Morris Jette authored
then an error is generated if more than one of those specifications contains KNL NUMA or MCDRAM modes. Bug 5846
-
Morris Jette authored
are down nodes. Bug 5846
-
Morris Jette authored
NODE_SET_REBOOT to continue. Bug 5846
-
Morris Jette authored
node change when possible. Bug 5846
-
Marshall Garey authored
Plugins reading in their own config files rely on the SLURM_CONF environment variable pointing to the appropriate directory, otherwise they will fall back to the build in sysconfdir path. Set the environment variable early enough so that the -f flag operates correctly, but not before conf->conffile has definitely been set. Remove the setenv call that happens before the first slurmstepd is fork()'d as it is now redundant. Bug 4774.
-
Alejandro Sanchez authored
sbatch sets these, but salloc did not. This should make srun behavior between the two consistent. Bug 3861.
-
- 03 Dec, 2018 2 commits
-
-
Marshall Garey authored
time that wasn't existent instead of just updating lines that have time with a lesser time.
-
Dominik Bartkiewicz authored
Slurm is going to replace internally. Bug 5800
-
- 29 Nov, 2018 2 commits
-
-
Dominik Bartkiewicz authored
Bug 6121
-
Nate Rini authored
Bug 6008
-
- 28 Nov, 2018 5 commits
-
-
Alejandro Sanchez authored
Bug 6077
-
Danny Auble authored
node system. Bug 6037
-
Marshall Garey authored
Bug 6037
-
Artem Y. Polyakov authored
Bug 5983
-
Artem Y. Polyakov authored
In case of the error code paths (like collective timeout) it is possible that a callback provided by PMIx will be called twice leading to a segmentation fault. This commit fixes it by properly accounting callback invocations. Bug 5983
-
- 27 Nov, 2018 7 commits
-
-
Danny Auble authored
Bug 5935
-
Boris Karasev authored
This could have caused core dumps if communication failed for one reason or another. Signed-off-by: Boris Karasev <karasev.b@gmail.com> Bug 5935
-
Morris Jette authored
This patch does 2 things: 1. When a step fails on some node, then mark it as complete on those nodes. This is needed so that when the step ends on the other nodes, slurmctld recognized the step as completely done. 2. If the step does not have the --no-kill option set, then when some node fails, send a request to terminate the step on ALL of its nodes. Bug 5805
-
Nate Rini authored
when env is overwritten by the command line. Bug 5977
-
Danny Auble authored
-
Danny Auble authored
Bug 6016
-
Broderick Gardner authored
Bug 6092
-
- 26 Nov, 2018 1 commit
-
-
Felip Moll authored
configuration. Bug 5872
-
- 23 Nov, 2018 2 commits
-
-
Lewis Lakerink authored
Bug 6015.
-
Tim Wickberg authored
Was attributed to the wrong author by mistake. This reverts commit 8b5acf90.
-
- 21 Nov, 2018 3 commits
-
-
Nate Rini authored
Bug 5977
-
Marshall Garey authored
started on a node. Bug 5865
-
Broderick Gardner authored
Bug 6061
-
- 20 Nov, 2018 3 commits
-
-
Danny Auble authored
bug 6016
-
Chris Samuel authored
Bug 6015
-
Morris Jette authored
Bug 6015
-
- 15 Nov, 2018 1 commit
-
-
Marshall Garey authored
The srun man page states that --prolog=none or --epilog=none would prevent the SrunProlog or SrunEpilog scripts defined in slurm.conf from running. However, with SLURM_TEST_EXEC set in the environment, srun would look for a script named "none" and run it if found. If not found, then srun would print an error message, but the SrunProlog or SrunEpilog would (correctly) not run. * Prevent srun from running a script named "none" that is in the path. * This also prevents the error message if it is not found. Bug 5948.
-
- 12 Nov, 2018 1 commit
-
-
Brian Christiansen authored
Moved to https://github.com/SchedMD/slurm-gcp Left README with link to new location. Will remove gcp dir in master/19.05. Bug 5842
-
- 10 Nov, 2018 1 commit
-
-
Nate Rini authored
Bug 5985
-
- 09 Nov, 2018 1 commit
-
-
Felip Moll authored
scheduler. Main scheduler detects if a job asked for more than one partition and splits one scheduling tasks for each partition. If the job asked for a reservation and the intersection between the nodes in the reservation and the partition does not suffice to satisfy the job, then skip scheduling this partition. Otherwise this may cause useless scheduling effort and possibly overwriting incorrectly the PD reason of a job. bug 5895
-