- 30 Oct, 2013 2 commits
-
-
Morris Jette authored
Add support for -W block=true (wait for job completion) Clear PBS_NODEFILE environment variable Credit to NCSC
-
Morris Jette authored
-
- 28 Oct, 2013 5 commits
-
-
Morris Jette authored
Add support for dependencies of job array elements (e.g. "sbatch --depend=afterok:123_4 ..."). This does not support depenendencies of ALL job array elements, only individual job array elements.
-
Danny Auble authored
preforms the allocation, regression in 2.6.3.
-
Morris Jette authored
-
Danny Auble authored
-
Morris Jette authored
-
- 25 Oct, 2013 4 commits
-
-
Morris Jette authored
Previously the SlurmdDebug value in slurm.conf was ignored if the previous value was not 3/init
-
Morris Jette authored
Effect is minimal without multiple partitions and larger system sizes. With 40 partitions and about 600 nodes each, time goes from about 13 secs to 4 secs).
-
Morris Jette authored
This avoids building hostlist information with NodeHostName and NodeAddr information unless explisitly requested and can improve performance for the default mode of operation by about 65%.
-
Morris Jette authored
Correct sbatch documentation and job_submit/pbs plugin "%j" is job ID, not "%J" (which is job_id.step_id).
-
- 24 Oct, 2013 1 commit
-
-
Morris Jette authored
Without this change a job with a reason of WAIT_PART_DOWN, WAIT_PART_INACTIVE, WAIT_PART_NODE_LIMIT, WAIT_PART_TIME_LIMIT, or WAIT_QOS_THRES would not be cleared when that reason no longer applied.
-
- 23 Oct, 2013 4 commits
-
-
Morris Jette authored
Add cgroup create retry logic in case one step is starting at the same time as another step is ending and the logic to create and delete cgroups overlaps. bug 447
-
Morris Jette authored
If a node has GRES and multiple threads per core the select/cons_res plugin can get stuck in an infinite loop. See bug 475 Contributed by: PREVOST Ludovic NEC HPC Europe
-
Morris Jette authored
-
Morris Jette authored
Previously a node failure would always requeue the job
-
- 22 Oct, 2013 4 commits
-
-
Morris Jette authored
Add cgroup create retry logic in case one step is starting at the same time as another step is ending and the logic to create and delete cgroups overlaps. bug 447
-
Morris Jette authored
If a node has GRES and multiple threads per core the select/cons_res plugin can get stuck in an infinite loop. See bug 475 Contributed by: PREVOST Ludovic NEC HPC Europe
-
Morris Jette authored
-
Morris Jette authored
Previously a node failure would always requeue the job
-
- 21 Oct, 2013 1 commit
-
-
Morris Jette authored
Restore default behavior of allocating cores to jobs on a cyclic basis across the sockets unless SelectTypeParameters=CR_CORE_DEFAULT_DIST_BLOCK or user specifies other distribution options. Reverts commit 7fcdc7e5 bug 466
-
- 20 Oct, 2013 1 commit
-
-
jette authored
If the backfill scheduler relinquishes locks and the normal job scheduler starts a job that the backfill scheduler was actively working, the backfill scheduler will try to re-schedule that same job, possibly resulting in an invalid memory reference or other badness.
-
- 19 Oct, 2013 2 commits
-
-
Morris Jette authored
Fix for --cpu_bind=map_cpu/mask_cpu/map_ldom/mask_ldom plus --mem_bind=map_mem/mask_mem options, broken in 2.6.2. See commit 718382da
-
David Bigagli authored
-
- 18 Oct, 2013 1 commit
-
-
Danny Auble authored
-
- 17 Oct, 2013 3 commits
-
-
Danny Auble authored
-
Morris Jette authored
-
David Bigagli authored
-
- 16 Oct, 2013 3 commits
-
-
Chrysovalantis Paschoulas authored
-
Castain, Ralph H authored
-
jette authored
Formerly was NO_VAL and will be reported as "TBD" by scontrol and squeue commands.
-
- 15 Oct, 2013 3 commits
-
-
Trofinoff, Stephen authored
-
Martin Perry authored
-
Danny Auble authored
-
- 14 Oct, 2013 4 commits
-
-
Filip Skalski authored
-
Janne Blomqvist authored
Set RLIMIT_RSS, RLIMIT_AS and/or RLIMIT_DATA to enforce memory limit. See bug 454 for details
-
jette authored
The pending jobs will have their reservation info removed bug 455
-
jette authored
Previously it was always set to the job's CPU count
-
- 11 Oct, 2013 2 commits
-
-
Morris Jette authored
Initiate jobs pending to run in a reservation as soon as the reservation becomes active. Partial fix for bug 455
-
Morris Jette authored
Revert commit 626be3ea It was causing stack overflow and memory corruption
-