- 13 Jan, 2014 4 commits
-
-
Morris Jette authored
Add description of SLURM_TOPOLOGY_ADDR and SLURM_TOPOLOGY_ADDR_PATTERN environment variables to the topology web page.
-
Morris Jette authored
Do not reset a job's priority when the slurmctld restarts if previously set to some specific value. bug 561
-
Morris Jette authored
No change in functionality
-
John Morrissey authored
groups.
-
- 08 Jan, 2014 4 commits
-
-
David Bigagli authored
-
Christopher Samuel authored
-
David Bigagli authored
This reverts commit 3464295e.
-
David Bigagli authored
-
- 07 Jan, 2014 4 commits
-
-
Danny Auble authored
-
Morris Jette authored
Do not mark the node DOWN if its memory or tmp disk space is lower than configured, just log it using debug message type
-
Morris Jette authored
The previous changes in commit bc3d8828 failed to address some accounting issues addressed here.
-
Morris Jette authored
-
- 06 Jan, 2014 4 commits
-
-
Morris Jette authored
If a job is explicitly suspended, its priority is set to zero. This resets the priority when requeued and also documents that if the job is requeued (e.g. due to a node failure), then it is placed in a held state.
-
Morris Jette authored
Without this patch, the job's RunTime includes its RunTime from before it's prior suspend (i.e. the job's full RunTime rather than just the RunTime of the requeued job).
-
Morris Jette authored
-
Morris Jette authored
-
- 27 Dec, 2013 2 commits
-
-
Filip Skalski authored
Hello, I think I found another bug in the code (I'm using 2.6.3 but I checked the 2.6.5 and 14.03 versions and it's the same there). In file sched/backfill/backfill.c: 1) _add_reservation function, from lines 1172: if (placed == true) { j = node_space[j].next; if (j && (end_reserve < node_space[j].end_time)) { /* insert end entry record */ i = *node_space_recs; node_space[i].begin_time = end_reserve; node_space[i].end_time = node_space[j].end_time; node_space[j].end_time = end_reserve; node_space[i].avail_bitmap = bit_copy(node_space[j].avail_bitmap); node_space[i].next = node_space[j].next; node_space[j].next = i; (*node_space_recs)++; } break; } I draw a picture with `node_space` state after 2 iterations (see attachment). In case where the new reservation i...
-
Morris Jette authored
-
- 25 Dec, 2013 1 commit
-
-
David Bigagli authored
-
- 23 Dec, 2013 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
David Bigagli authored
-
- 21 Dec, 2013 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
- 20 Dec, 2013 6 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
runjob_mux.
-
Danny Auble authored
0c00d359
-
Danny Auble authored
for better debug
-
Danny Auble authored
midplane block that starts on a higher coordinate than it ends (i.e if a block has midplanes [0010,0013] 0013 is the start even though it is listed second in the hostlist).
-
- 19 Dec, 2013 1 commit
-
-
Morris Jette authored
It has been changed to improve the calculated value for pending jobs and use the actual node count value for jobs that have been started (including suspended, completed, etc.) bug 549
-
- 18 Dec, 2013 7 commits
-
-
Danny Auble authored
that spans multiple midplanes the cnodes are correctly accounted for that are in error.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
being in error.
-
Danny Auble authored
-
Morris Jette authored
Note that each job's node allocation is counted separately. bug 548
-
- 17 Dec, 2013 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
will return ENOTCONN and not initialize the addr_str causing valgrind errors.
-