- 12 Nov, 2015 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
Previously only supported by SlurmUser and root.
-
- 11 Nov, 2015 7 commits
-
-
Morris Jette authored
Previously only reserved space for one task of pending job array.
-
-
Morris Jette authored
Support taking node out of FUTURE state with "scontrol reconfig" command. Previous logic would keep node in FUTURE state if that was the original configuration when slurmctld started. If job was running on the node, it will stay running, but the node make not be visible.
-
David Bigagli authored
-
Morris Jette authored
Previous logic would create a environment and script file for each task of a job array (hard link to original actually). Due to file system limitations and clutter, this was less than ideal. This patch eliminates the redundant files, using only the original file created for the job array. This should also make support for burst buffers easier in the future for job arrays.
-
Morris Jette authored
-
Morris Jette authored
Make SLURM_ARRAY_TASK_MIN, SLURM_ARRAY_TASK_MAX, and SLURM_ARRAY_TASK_STEP environment variables available to PrologSlurmctld and EpilogSlurmctld.
-
- 10 Nov, 2015 6 commits
-
-
Hongjia Cao authored
-
Danny Auble authored
We needed to send a finish from each node in the step whether it had any activity or not. This way the controller knew things were done on the node and the data was sent to the database. Bug 2097
-
Danny Auble authored
-
Danny Auble authored
to get the batch step, no real code change outside of using strcasecmp instead of strcmp.
-
Morris Jette authored
Burst_buffer/cray: Don't stall scheduling of other jobs while a stage-in is in progress. bug 2114
-
Morris Jette authored
Fix to purge terminated jobs with burst buffer errors. bug 2123
-
- 09 Nov, 2015 4 commits
-
-
Morris Jette authored
The prolog_running counter can now exceed 1. New logic raises limit from 1 to 4 before preventing job recovery on restart.
-
David Bigagli authored
-
Thomas Cadeau authored
-
David Bigagli authored
the error happened.
-
- 07 Nov, 2015 2 commits
-
-
Morris Jette authored
Correct preservation of job ID. This effects emulation mode only. bug 2113
-
Morris Jette authored
Added burst_buffer.conf flag parameter of "TeardownFailure" which will teardown and remove a burst buffer after failed stage-in or stage-out. By default, the buffer will be preserved for analysis and manual teardown. bug 2116
-
- 06 Nov, 2015 8 commits
-
-
Danny Auble authored
Conflicts: doc/man/man5/slurm.conf.5
-
Morris Jette authored
If a stage-out fails, Slurm leaves the burst buffer in place and logs something like "error: bb_set_use_time: job 98 with allocated burst buffers not found" every minute thereafer. This changes the logic to only log the event one time. bug 2112
-
Morris Jette authored
This is a revision to commit e21b666c which did not fix the problem for all configurations. bug 2086
-
David Bigagli authored
-
Alejandro Sanchez authored
-
Danny Auble authored
Bug 2106 What was happening was the calculation wasn't happening for memory or nodes, just cpus and gres.
-
Morris Jette authored
bug 2086
-
Morris Jette authored
-
- 05 Nov, 2015 5 commits
-
-
Morris Jette authored
Job needs exit code of zero for COMPLETED state.
-
Morris Jette authored
bug 2093
-
Josko Plazonic authored
-
Danny Auble authored
-
Kilian Cavalotti authored
-
- 04 Nov, 2015 6 commits
-
-
Morris Jette authored
Conflicts: META NEWS
-
Morris Jette authored
-
Morris Jette authored
-
Alejandro Sanchez authored
-
Danny Auble authored
-
Alejandro Sanchez authored
-