- 24 Oct, 2013 1 commit
-
-
Morris Jette authored
Without this change a job with a reason of WAIT_PART_DOWN, WAIT_PART_INACTIVE, WAIT_PART_NODE_LIMIT, WAIT_PART_TIME_LIMIT, or WAIT_QOS_THRES would not be cleared when that reason no longer applied.
-
- 22 Oct, 2013 4 commits
-
-
Morris Jette authored
Add cgroup create retry logic in case one step is starting at the same time as another step is ending and the logic to create and delete cgroups overlaps. bug 447
-
Morris Jette authored
If a node has GRES and multiple threads per core the select/cons_res plugin can get stuck in an infinite loop. See bug 475 Contributed by: PREVOST Ludovic NEC HPC Europe
-
Morris Jette authored
-
Morris Jette authored
Previously a node failure would always requeue the job
-
- 21 Oct, 2013 1 commit
-
-
Morris Jette authored
Restore default behavior of allocating cores to jobs on a cyclic basis across the sockets unless SelectTypeParameters=CR_CORE_DEFAULT_DIST_BLOCK or user specifies other distribution options. Reverts commit 7fcdc7e5 bug 466
-
- 20 Oct, 2013 1 commit
-
-
jette authored
If the backfill scheduler relinquishes locks and the normal job scheduler starts a job that the backfill scheduler was actively working, the backfill scheduler will try to re-schedule that same job, possibly resulting in an invalid memory reference or other badness.
-
- 19 Oct, 2013 2 commits
-
-
Morris Jette authored
Fix for --cpu_bind=map_cpu/mask_cpu/map_ldom/mask_ldom plus --mem_bind=map_mem/mask_mem options, broken in 2.6.2. See commit 718382da
-
David Bigagli authored
-
- 17 Oct, 2013 2 commits
-
-
Danny Auble authored
-
David Bigagli authored
-
- 16 Oct, 2013 1 commit
-
-
Chrysovalantis Paschoulas authored
-
- 15 Oct, 2013 3 commits
-
-
Trofinoff, Stephen authored
-
Martin Perry authored
-
Danny Auble authored
-
- 14 Oct, 2013 2 commits
-
-
Filip Skalski authored
-
jette authored
The pending jobs will have their reservation info removed bug 455
-
- 11 Oct, 2013 5 commits
-
-
Morris Jette authored
Initiate jobs pending to run in a reservation as soon as the reservation becomes active. Partial fix for bug 455
-
Morris Jette authored
Revert commit 626be3ea It was causing stack overflow and memory corruption
-
Martin Perry authored
-
Morris Jette authored
Previous logic only reported un-reserved node map. New logging adds information about each job testing and where/when it is scheduled resources.
-
Danny Auble authored
slurm.conf when using the DBD.
-
- 10 Oct, 2013 1 commit
-
-
jette authored
Induced by bf_continue option and deleting a partition.
-
- 09 Oct, 2013 3 commits
-
-
David Bigagli authored
to reflect only the latest supported format.
-
Morris Jette authored
if bf_continue option is configured and slurm is reconfigured during one of the sleep cycles, then the backfill scheduler will reference an invalid partition pointer.
-
Morris Jette authored
Previous logic would place more tasks on each node than specified by --ntasks-per-node, using fewer nodes than desired. This only happens with exclusive node allocations (e.g. in partition configuration Shared=Exclusive).
-
- 08 Oct, 2013 1 commit
-
-
Morris Jette authored
EpilogSlurmctld pthread is passed required arguments rather than a pointer to the job record, which under some conditions could be purged and result in an invalid memory reference.
-
- 07 Oct, 2013 1 commit
-
-
David Bigagli authored
-
- 03 Oct, 2013 5 commits
-
-
Rod Schultz authored
-
Morris Jette authored
-
David Bigagli authored
segfault.
-
Morris Jette authored
-
Morris Jette authored
-
- 02 Oct, 2013 3 commits
-
-
David Bigagli authored
is not parsed correctly.
-
Morris Jette authored
gres/gpu and gres/mic - Do not treat the existence of an empty gres.conf file as a fatal error. There may be no gres devices on that node so we do not require the file. Assume gres counts of zero if no file.
-
Morris Jette authored
bug 436
-
- 01 Oct, 2013 4 commits
-
-
Eric Winter authored
job CPU count not loaded correctly Partition time limit format wrong (minutes rather than hhmmss format).
-
Eric Winter authored
-
Eric Winter authored
-
Danny Auble authored
isn't doing the launching.
-