- 15 Dec, 2011 1 commit
-
-
Morris Jette authored
Prevent resetting a held job's priority when updating other job parameters. Patch from Alejandro Lucero Palau, BSC.
-
- 14 Dec, 2011 2 commits
-
-
Morris Jette authored
Patch from John Thiltges, University of Nebraska-Lincoln.
-
Danny Auble authored
-
- 13 Dec, 2011 1 commit
-
-
Danny Auble authored
-
- 09 Dec, 2011 8 commits
-
-
Danny Auble authored
-
Danny Auble authored
starts in accounting.
-
Danny Auble authored
-
Morris Jette authored
Add an srun shepard process to cancel a job and/or step of the srun process is killed abnormally (e.g. SIGKILL).
-
Danny Auble authored
-
Danny Auble authored
starts in accounting.
-
Danny Auble authored
-
Morris Jette authored
-
- 08 Dec, 2011 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
- 06 Dec, 2011 3 commits
-
-
Morris Jette authored
One of our testers discovered a regression in version 2.3.1. If a job is pending due to PartitionNodeLimit and the limit is relieved with a 'sacctmgr modify qos name=<qos name> set flags=partitionmaxnodes' new jobs exceeding the partition limit (but not the QOS limit) are allowed to run. However, the pending job is never allowed to run. Attached is a patch to address this problem. FYI, this problem doesn't exist in version 2.4. Patch from Bill Brophy, Bull.
-
Morris Jette authored
-
Morris Jette authored
Let a job be submitted even if it exceeds a QOS limit. Job will be left in a pending state until the QOS limit or job parameters change. Patch by Phil Eckert, LLNL.
-
- 05 Dec, 2011 2 commits
-
-
Morris Jette authored
Patch by Alexander Bersenev (Institute of Mathematics and Mechanics, Russia).
-
Morris Jette authored
-
- 02 Dec, 2011 1 commit
-
-
Danny Auble authored
There was also some bad code that would reset the conn_type of a block to SMALL no matter what type of SMALL it was.
-
- 01 Dec, 2011 1 commit
-
-
jette authored
This was due to a bug in select/cons_res with some configuration optiions and job options, especially if there is more than one thread per core and the job option includes "--threads-per-core=1". Fixes problem reported by CSCS.
-
- 30 Nov, 2011 3 commits
-
-
Morris Jette authored
-
Danny Auble authored
qos on the cluster to fill that in correctly.
-
Danny Auble authored
incorrectly when getting fairshare from the parent.
-
- 28 Nov, 2011 1 commit
-
-
Danny Auble authored
-
- 23 Nov, 2011 2 commits
-
-
Morris Jette authored
Add ReconfigFlags configuration option to control actions of "scontrol reconfig". Patch from Don Albert, Bull.
-
Danny Auble authored
wasn't started at the time the eligible message was sent but started before the db_index was returned information like start time would be lost.
-
- 22 Nov, 2011 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- 21 Nov, 2011 4 commits
-
-
Morris Jette authored
Add numeric range checks to scontrol update options. Patch from Phil Eckert, LLNL.
-
Danny Auble authored
to use srun instead of runjob.
-
Morris Jette authored
-
Morris Jette authored
-
- 08 Nov, 2011 3 commits
-
-
Danny Auble authored
for this yet though.
-
Morris Jette authored
Note this is an old bug. The new code keeps slurmstepd alive and it keeps trying to send step completion message to slurmctld.
-
Morris Jette authored
Note this is an old bug. The new code keeps slurmstepd alive and it keeps trying to send step completion message to slurmctld.
-
- 07 Nov, 2011 1 commit
-
-
Morris Jette authored
This make the same patch to select/linear as Carles Fenoy's patch to select/cons_res plugin.
-
- 04 Nov, 2011 2 commits
-
-
Morris Jette authored
Patch 4f68cde5bd6b4fcf839f6694457373c81d9548ba from chaos/slurm by Don Lipari, LLNL
-
Morris Jette authored
Patch 4f68cde5bd6b4fcf839f6694457373c81d9548ba from chaos/slurm by Don Lipari, LLNL
-