- 12 Nov, 2015 3 commits
-
-
Mark Roberts authored
-
Morris Jette authored
-
Morris Jette authored
Previously only supported by SlurmUser and root.
-
- 11 Nov, 2015 4 commits
-
-
Morris Jette authored
Previously only reserved space for one task of pending job array.
-
Morris Jette authored
Support taking node out of FUTURE state with "scontrol reconfig" command. Previous logic would keep node in FUTURE state if that was the original configuration when slurmctld started. If job was running on the node, it will stay running, but the node make not be visible.
-
David Bigagli authored
-
Morris Jette authored
Make SLURM_ARRAY_TASK_MIN, SLURM_ARRAY_TASK_MAX, and SLURM_ARRAY_TASK_STEP environment variables available to PrologSlurmctld and EpilogSlurmctld.
-
- 10 Nov, 2015 5 commits
-
-
Hongjia Cao authored
-
Danny Auble authored
We needed to send a finish from each node in the step whether it had any activity or not. This way the controller knew things were done on the node and the data was sent to the database. Bug 2097
-
Danny Auble authored
-
Morris Jette authored
Burst_buffer/cray: Don't stall scheduling of other jobs while a stage-in is in progress. bug 2114
-
Morris Jette authored
Fix to purge terminated jobs with burst buffer errors. bug 2123
-
- 09 Nov, 2015 2 commits
-
-
Morris Jette authored
The prolog_running counter can now exceed 1. New logic raises limit from 1 to 4 before preventing job recovery on restart.
-
David Bigagli authored
-
- 07 Nov, 2015 1 commit
-
-
Morris Jette authored
Added burst_buffer.conf flag parameter of "TeardownFailure" which will teardown and remove a burst buffer after failed stage-in or stage-out. By default, the buffer will be preserved for analysis and manual teardown. bug 2116
-
- 06 Nov, 2015 2 commits
-
-
David Bigagli authored
-
Danny Auble authored
Bug 2106 What was happening was the calculation wasn't happening for memory or nodes, just cpus and gres.
-
- 05 Nov, 2015 1 commit
-
-
Kilian Cavalotti authored
-
- 04 Nov, 2015 6 commits
-
-
Morris Jette authored
-
Alejandro Sanchez authored
pre-increment.
-
Morris Jette authored
-
Morris Jette authored
The "dw_wlm_cli paths" command returns an error if no #DW options.
-
Brian Christiansen authored
commit:508f866e
-
Brian Christiansen authored
Bug 2095
-
- 03 Nov, 2015 4 commits
-
-
Danny Auble authored
-
Ryan Cox authored
-
Morris Jette authored
Add logic to call the "setup" function.
-
Danny Auble authored
in an hour counting multiple times.
-
- 02 Nov, 2015 1 commit
-
-
Morris Jette authored
Return permission denied if regular user tries to release job held by an administrator. bug 2087
-
- 30 Oct, 2015 1 commit
-
-
Deric Sullivan authored
Fix creation of advanced reservation of cores on nodes which are DOWN. There seems to be a bug with reservations using a node list (e.g. Nodes=something + CoreCnt=something). The result is a reservation made that's arguably broken; listing the reservation (scontrol show reservation) will show "Nodes=" (blank) and "CoreCnt=0". It's very easy to reproduce, just by doing the following against a node in a DOWN (also tested with POWER_UP) state: scontrol create ReservationName=tmp_res StartTime=now EndTime=now+600 Nodes=<some_non_idle_node> CoreCnt=1 Users=<some_valid_user> scontrol show reservation. bug 2078
-
- 29 Oct, 2015 4 commits
-
-
Danny Auble authored
-
Danny Auble authored
submit_time. 0 means the job isn't eligible yet so we need to treat it so.
-
Danny Auble authored
-
Morris Jette authored
PrologFlags work: Setting a flag of "Contain" implicitly sets the "Alloc" flag. Fix code path which could prevent execution of the Prolog when the "Alloc" or "Contain" flag were set. bug 2074
-
- 28 Oct, 2015 2 commits
-
-
Morris Jette authored
If a user tries to create/destroy a persistent burst buffer and that is disabled (the default), return a permission denied error rather than an invalid request error. cray bug 832678
-
Tim Wickberg authored
Bug 2071
-
- 27 Oct, 2015 4 commits
-
-
Morris Jette authored
Log the minimum and maximum exit code (or signal) and show the state as "Mixed" if there are multiple termination states (e.g. cancelled, exit 0, exit 1, SIGBUS, etc.) bug 2058
-
Danny Auble authored
it needed to check for INFINITE64 instead of zero.
-
Danny Auble authored
-
Danny Auble authored
This just removes xfrees where the xfree had already happened.
-