- 17 Dec, 2012 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Chris Read authored
-
- 14 Dec, 2012 4 commits
-
-
Morris Jette authored
-
Danny Auble authored
-
Chris Reed authored
Without this patch, use of sched/builtin would always result in FIFO scheduling, even if priority/multifactor was configured
-
Danny Auble authored
-
- 13 Dec, 2012 3 commits
-
-
jette authored
-
Danny Auble authored
each block independently.
-
Danny Auble authored
-
- 12 Dec, 2012 1 commit
-
-
Morris Jette authored
-
- 07 Dec, 2012 1 commit
-
-
Morris Jette authored
Correction to hostlist sorting for hostnames that contain two numeric components and the first numeric component has various sizes (e.g. "rack9blade1" should come before "rack10blade1")
-
- 06 Dec, 2012 1 commit
-
-
Morris Jette authored
-
- 05 Dec, 2012 3 commits
-
-
Danny Auble authored
job on future step creation attempts.
-
Danny Auble authored
also cause it to run if the realtime server ever goes away.
-
Morris Jette authored
Especially for newly started jobs, the PrologSlurmctld can change a job's QOS based upon resource allocation.
-
- 04 Dec, 2012 1 commit
-
-
Danny Auble authored
DB2 so hard.
-
- 30 Nov, 2012 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
on them. This should only happen in extreme conditions.
-
- 29 Nov, 2012 7 commits
-
-
Danny Auble authored
with associations get the deleted associations as well.
-
Francois Diakhate authored
request resources that reach a 'Max' limit.
-
Danny Auble authored
-
Danny Auble authored
user mark the state canceled instead of completed.
-
Morris Jette authored
-
Danny Auble authored
so it gets sent again. This isn't a major problem since the start will happen when the job ends, but this does make things cleaner.
-
Morris Jette authored
-
- 28 Nov, 2012 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
you query against that with -N and -E you will get all jobs during that time instead of only the ones running on -N. Signed-off-by: Danny Auble <da@schedmd.com>
-
- 27 Nov, 2012 5 commits
-
-
Danny Auble authored
-
Danny Auble authored
was already in error and isn't deallocating and underlying hardware goes bad one could get overlapping blocks in error making the code assert when a new job request comes in.
-
Danny Auble authored
overcommit.
-
Danny Auble authored
overcommit.
-
Morris Jette authored
Previously only requeued the job once
-
- 26 Nov, 2012 2 commits
-
-
Danny Auble authored
where needed)
-
jette authored
Otherwise an aborted slurmstepd can cause the srun process to hang indefinitely; a problem reported in trouble ticket 149.
-
- 22 Nov, 2012 1 commit
-
-
Danny Auble authored
introduce step accounting for a Cray.
-
- 21 Nov, 2012 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
This is needed if the munge deamon is under very heavy load (e.g. with 1000 slurmd daemons per compute node).
-
- 20 Nov, 2012 1 commit
-
-
Danny Auble authored
slurmctld restart.
-