- 26 Jun, 2017 1 commit
-
-
Dominik Bartkiewicz authored
Improve backfill scheduling algorithm with respect to starting jobs as soon as possible while avoiding advanced reservations. bug 3757
-
- 24 Jun, 2017 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
- 23 Jun, 2017 1 commit
-
-
Tim Shaw authored
Bug 3581.
-
- 22 Jun, 2017 6 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Brian Christiansen authored
-
Danny Auble authored
The SLURM_ID_HASH used for Cray systems has changed to fully use the entire 64 bits of the hash. Previously the stepid was multiplied by 10,000,000,000 to make it easy to read both the jobid as well as the stepid in the hash separated by at least a couple of zeros, but this lead to overflow on the hash with steps like the batch and extern step where they used all 32 bits to represent the step. While the new method ruins the easy readability it fixes the more important overflow issue. This most likely will go unnoticed by most, just a note of the change.
-
Doug Jacobsen authored
Bug 3815 It would be nice to figure out a way to remove the check for version all together but I (Danny) couldn't figure out how that would be done since we need to know which libs/headers to use and on systems with multiple installed and no 'lua' lib (Ubuntu) you have to use the PKG_CHECK_EXISTS to set up the pkg name for PKG_CHECK_MODULES or you don't get things set up correct when trying to link.
-
Hongjia Cao authored
Bug 3919
-
- 21 Jun, 2017 1 commit
-
-
Dominik Bartkiewicz authored
bug 3757
-
- 20 Jun, 2017 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
more than 1 partition or when the partition is changed with scontrol. Bug 3849
-
- 19 Jun, 2017 4 commits
-
-
Danny Auble authored
submitted to a QOS/association. Bug 3849
-
Morris Jette authored
Correct error message when ClusterName in configuration files does not match the name in the slurmctld daemon's state save file.
-
Danny Auble authored
the requested value, instead of always setting one. This would make --hint=multithread not work at all. See Bug 3855 (commit 3c852da1) Issue originated from commit 82a959a8.
-
Morris Jette authored
Correct error message when ClusterName in configuration files does not match the name in the slurmctld daemon's state save file.
-
- 15 Jun, 2017 2 commits
-
-
Danny Auble authored
the requested value, instead of always setting one. This would make --hint=multithread not work at all. See Bug 3855 (commit 3c852da1) Issue originated from commit 82a959a8.
-
Dominik Bartkiewicz authored
bug 3447
-
- 14 Jun, 2017 2 commits
-
-
Danny Auble authored
Turns out if the extern step is created here and the job was killed long before hand the step is made erroneously and can cause an assert just lines later. Bug 3898
-
Tim Shaw authored
set correctly. Bug 3858
-
- 13 Jun, 2017 2 commits
-
-
Tim Wickberg authored
-
Danny Auble authored
What this does is populate the node_hash_table as nodes are being read in instead of after the node_record_table_ptr has been fully populated. This speeds up a start of a slurmd with a system of 10000 nodes from > 1 minute to less than a second. In 17.11 we will remove the linear xstrcmp check as it should no longer be needed. Bug 3885
-
- 12 Jun, 2017 4 commits
-
-
Danny Auble authored
batch job takes longer than it takes to finish. Bug 3833
-
Danny Auble authored
time. Bug 3833
-
Morris Jette authored
An array was only being partially cleared due to bad logic bug 3876
-
Tim Wickberg authored
Bug 3874.
-
- 08 Jun, 2017 2 commits
-
-
Dominik Bartkiewicz authored
Improve selection of jobs to preempt when there are multiple partitions with jobs subject to preemption. bug 3824
-
Dominik Bartkiewicz authored
Prevent segfault from pointer dereference to the QOS that is being deleted. Fix to commit 3e8aa451.
-
- 07 Jun, 2017 1 commit
-
-
Tim Wickberg authored
-
- 03 Jun, 2017 1 commit
-
-
Danny Auble authored
Fix regression from commit c05dcb8a (bug 1923) that doesn't take into consideration a blank char * as a valid option. This fixes the scenario like sacctmgr list associations user='' which would only print account associations. Bug 3862
-
- 02 Jun, 2017 2 commits
-
-
Danny Auble authored
a good return code. This also fixes the situation where the step was ending but not yet ended so it sends the KILL_TASK_FAILED error instead of JOB_NOTRUNNING. Also it removes the abort in favor of exit which it should had been anyways. Bug 3758
-
Gary B Skouson authored
which the backfill test window expands. This can be used on a system with a modest number of running jobs (hundreds of jobs) to help prevent expected start times of pending jobs to get pushed forward in time. On systems with large numbers of running jobs, performance of the backfill scheduler will suffer and fewer jobs will be evaluated. Bug 3790
-
- 01 Jun, 2017 7 commits
-
-
Danny Auble authored
This reverts commit da414931.
-
Danny Auble authored
which the backfill test window expands. This can be used on a system with a modest number of running jobs (hundreds of jobs) to help prevent expected start times of pending jobs to get pushed forward in time. On systems with large numbers of running jobs, performance of the backfill scheduler will suffer and fewer jobs will be evaluated. Bug 3790
-
Mark Klein authored
Bug 3671
-
Mark Klein authored
Inadvertently set to one when requested. Bug 3855.
-
Tim Wickberg authored
Bug 3857.
-
Doug Jacobsen authored
Bug 3808
-
Pablo Escobar authored
bug 3846
-