- 20 Dec, 2018 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
- 19 Dec, 2018 4 commits
-
-
Nate Rini authored
bug 6197
-
Morris Jette authored
The previous select/cons_tres logic would under some circumstances allocate a job more cores/CPUs than requested. One specific example is a cluster having nodes with 4 cores, 2 hyperthreads each. A job request for 20 tasks would launch 8 tasks on 8 CPUs on each of 2 nodes and 4 tasks on 6 CPUs on a third node (i.e. a total of 22 CPUs when only 20 are needed).
-
Morris Jette authored
-
Danny Auble authored
# Conflicts: # src/common/gres.c
-
- 18 Dec, 2018 8 commits
-
-
Dominik Bartkiewicz authored
Bug 5682 and 4584 Backport of ba07a6e09b6071
-
Morris Jette authored
The commit introduced other problems and still needs more work.
-
Dominik Bartkiewicz authored
other than knl. Bug 5603
-
Morris Jette authored
The previous select/cons_tres logic would under some circumstances allocate a job more cores/CPUs than requested. One specific example is a cluster having nodes with 4 cores, 2 hyperthreads each. A job request for 20 tasks would launch 8 tasks on 8 CPUs on each of 2 nodes and 4 tasks on 6 CPUs on a third node (i.e. a total of 22 CPUs when only 20 are needed).
-
Tim Wickberg authored
Only use removed prior to 17.11 in 04bc96f6. Bug 6261.
-
Danny Auble authored
Bug 5603
-
Moe Jette authored
This patch also simplifies the code using bit_cnt instead of _bitstr_bits(). This isn't really necessary, but does look cleaner. Bug 6216
- 17 Dec, 2018 8 commits
-
-
Michael Hinton authored
Bug 6420
-
Morris Jette authored
without this logic the job could get more gres on a node than there are CPUs for
-
Danny Auble authored
Continuation of 2 commits ago. Bug 6159
-
Broderick Gardner authored
Continuation of last commit Bug 6159
-
Broderick Gardner authored
It was using the local cluster, sending the query to the wrong table Bug 6159
-
Morris Jette authored
-
Michael Hinton authored
Update GRES docs regarding ignore records and links. Update tests to get rid of extra fields for GRES ignores lines. Add tests to check for improved ignore syntax. Test the gres.conf examples. Add mps to regression slurm.conf, to test mps record parsing. Bug 5520
-
Tim Wickberg authored
Add --version option to slurmd, and document that new option.
-
- 15 Dec, 2018 5 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
This has been disabled since before Slurm 1.0.0.
-
Tim Wickberg authored
No functional change, all these are in comments.
-
Morris Jette authored
Insure that output appears in a fixed order for parsing by test.
-
Morris Jette authored
This supports heterogeneous environments (i.e. different MPS counts on different GPUs within a node)
-
- 14 Dec, 2018 8 commits
-
-
Nate Rini authored
bug 6116
-
Nate Rini authored
bug 6116
-
Brian Christiansen authored
Bug 6237
-
Morris Jette authored
if the gres count on a node with topology changes when the slurmctld restarts then the gres data structures were left in an inconsistent state. Namely the bitmaps would reflect the old size while the count reflects the new size, which resulted in asserts. In addition, the gres/mps data structure sizes need to match the gpu count on each node. This new logic will synchronize mps data structures on gpu count changes.
-
Dominik Bartkiewicz authored
if OverTimeLimit is Unlimited. Bug 6093
-
Michael Hinton authored
-
Tim Wickberg authored
-
Morris Jette authored
-
- 13 Dec, 2018 3 commits
-
-
Morris Jette authored
Add support for co-scheduling of gres/gpu and gres/mps. GPUs that are allocated to one are avoided for the other GRES type. Add gres/mps documentation Recover job gres/mps state on slurmctld restart. Wwe need to use job gres/mps state to recover node info since we will not know the count of mps on each device file until the node registers
-
Michael Hinton authored
Check for cgroup usage and change GPU indexes accordingly. Fix formatting errors in docs. bug 5520
-
Danny Auble authored
-
- 11 Dec, 2018 2 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
-