- 16 Nov, 2015 1 commit
-
-
Morris Jette authored
Backfill scheduler: Test association and QOS node limits before reserving resources for pending job. bug 2129
-
- 13 Nov, 2015 8 commits
-
-
Brian Christiansen authored
-
Brian Christiansen authored
Prevents the following sequence from causing a segfault: $ scontrol create partitionname=stuff nodes=ALL $ sbatch --wrap="hostname" -o/dev/null -p stuff Submitted batch job 1047468 $ scontrol delete partitionname=stuff $ scontrol update jobid=1047468 partition=stuff
-
Danny Auble authored
-
Danny Auble authored
tree.
-
Danny Auble authored
-
Brian Christiansen authored
Bug 2006
-
Danny Auble authored
step.
-
Morris Jette authored
-
- 12 Nov, 2015 3 commits
-
-
Mark Roberts authored
-
Morris Jette authored
-
Morris Jette authored
Previously only supported by SlurmUser and root.
-
- 11 Nov, 2015 4 commits
-
-
Morris Jette authored
Previously only reserved space for one task of pending job array.
-
Morris Jette authored
Support taking node out of FUTURE state with "scontrol reconfig" command. Previous logic would keep node in FUTURE state if that was the original configuration when slurmctld started. If job was running on the node, it will stay running, but the node make not be visible.
-
David Bigagli authored
-
Morris Jette authored
Make SLURM_ARRAY_TASK_MIN, SLURM_ARRAY_TASK_MAX, and SLURM_ARRAY_TASK_STEP environment variables available to PrologSlurmctld and EpilogSlurmctld.
-
- 10 Nov, 2015 5 commits
-
-
Hongjia Cao authored
-
Danny Auble authored
We needed to send a finish from each node in the step whether it had any activity or not. This way the controller knew things were done on the node and the data was sent to the database. Bug 2097
-
Danny Auble authored
-
Morris Jette authored
Burst_buffer/cray: Don't stall scheduling of other jobs while a stage-in is in progress. bug 2114
-
Morris Jette authored
Fix to purge terminated jobs with burst buffer errors. bug 2123
-
- 09 Nov, 2015 2 commits
-
-
Morris Jette authored
The prolog_running counter can now exceed 1. New logic raises limit from 1 to 4 before preventing job recovery on restart.
-
David Bigagli authored
-
- 07 Nov, 2015 1 commit
-
-
Morris Jette authored
Added burst_buffer.conf flag parameter of "TeardownFailure" which will teardown and remove a burst buffer after failed stage-in or stage-out. By default, the buffer will be preserved for analysis and manual teardown. bug 2116
-
- 06 Nov, 2015 2 commits
-
-
David Bigagli authored
-
Danny Auble authored
Bug 2106 What was happening was the calculation wasn't happening for memory or nodes, just cpus and gres.
-
- 05 Nov, 2015 1 commit
-
-
Kilian Cavalotti authored
-
- 04 Nov, 2015 6 commits
-
-
Morris Jette authored
-
Alejandro Sanchez authored
pre-increment.
-
Morris Jette authored
-
Morris Jette authored
The "dw_wlm_cli paths" command returns an error if no #DW options.
-
Brian Christiansen authored
commit:508f866e
-
Brian Christiansen authored
Bug 2095
-
- 03 Nov, 2015 4 commits
-
-
Danny Auble authored
-
Ryan Cox authored
-
Morris Jette authored
Add logic to call the "setup" function.
-
Danny Auble authored
in an hour counting multiple times.
-
- 02 Nov, 2015 1 commit
-
-
Morris Jette authored
Return permission denied if regular user tries to release job held by an administrator. bug 2087
-
- 30 Oct, 2015 1 commit
-
-
Deric Sullivan authored
Fix creation of advanced reservation of cores on nodes which are DOWN. There seems to be a bug with reservations using a node list (e.g. Nodes=something + CoreCnt=something). The result is a reservation made that's arguably broken; listing the reservation (scontrol show reservation) will show "Nodes=" (blank) and "CoreCnt=0". It's very easy to reproduce, just by doing the following against a node in a DOWN (also tested with POWER_UP) state: scontrol create ReservationName=tmp_res StartTime=now EndTime=now+600 Nodes=<some_non_idle_node> CoreCnt=1 Users=<some_valid_user> scontrol show reservation. bug 2078
-
- 29 Oct, 2015 1 commit
-
-
Danny Auble authored
-