- 31 Dec, 2013 2 commits
-
-
David Bigagli authored
-
David Bigagli authored
then in parallel.
-
- 30 Dec, 2013 1 commit
-
-
David Bigagli authored
Shut down clusters after test done. USe logger instead of print.
-
- 27 Dec, 2013 5 commits
-
-
Morris Jette authored
Rename "Exclusive OR" to "Matching OR" which better describes how it works.
-
Morris Jette authored
-
Filip Skalski authored
Hello, I think I found another bug in the code (I'm using 2.6.3 but I checked the 2.6.5 and 14.03 versions and it's the same there). In file sched/backfill/backfill.c: 1) _add_reservation function, from lines 1172: if (placed == true) { j = node_space[j].next; if (j && (end_reserve < node_space[j].end_time)) { /* insert end entry record */ i = *node_space_recs; node_space[i].begin_time = end_reserve; node_space[i].end_time = node_space[j].end_time; node_space[j].end_time = end_reserve; node_space[i].avail_bitmap = bit_copy(node_space[j].avail_bitmap); node_space[i].next = node_space[j].next; node_space[j].next = i; (*node_space_recs)++; } break; } I draw a picture with `node_space` state after 2 iterations (see attachment). In case where the new reservation is fully inside another reservation, then everything is OK. But if the new reservation spans multiple existing reservations then the `end entry record` is not created. This is because only the newly created `start entry record` is checked. Easy fix would be to change the if into a loop, for example: if (placed == true) { while((j = node_space[j].next) > 0) { if (end_reserve < node_space[j].end_time) { //same as above break; } } break; } 2) You could also change line 612: node_space = xmalloc(sizeof(node_space_map_t) * (max_backfill_job_cnt + 3)); To `(max_backfill_job_cnt * 2 + 1)` , since each reservation can add at most two entries (check at line 982 should never execute). At the moment, in a worst case scenario this only checks half of the max_backfill_job_cnt. NOTE: However this is all based on the assumption, that it is not done on purpose to speed up the calculations and trading some of the accuracy (especially point 2). Best regards, Filip Skalski
-
Morris Jette authored
-
Danny Auble authored
-
- 26 Dec, 2013 1 commit
-
-
David Bigagli authored
-
- 25 Dec, 2013 1 commit
-
-
David Bigagli authored
-
- 24 Dec, 2013 2 commits
-
-
David Bigagli authored
-
David Bigagli authored
-
- 23 Dec, 2013 8 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Conflicts: NEWS
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Conflicts: META
-
Morris Jette authored
-
David Bigagli authored
-
- 21 Dec, 2013 4 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
- 20 Dec, 2013 9 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
runjob_mux.
-
Danny Auble authored
0c00d359
-
Danny Auble authored
for better debug
-
Danny Auble authored
midplane block that starts on a higher coordinate than it ends (i.e if a block has midplanes [0010,0013] 0013 is the start even though it is listed second in the hostlist).
-
Morris Jette authored
Add --test-only option to sbatch command to validate the script and options. The response includes expected start time and resources to be allocated. bug 550
-
- 19 Dec, 2013 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
It has been changed to improve the calculated value for pending jobs and use the actual node count value for jobs that have been started (including suspended, completed, etc.) bug 549
-
Nathan Yee authored
Purge vestigial accounts
-
- 18 Dec, 2013 4 commits
-
-
Morris Jette authored
This also removes the call to function X_AC_SLURM_SIMULATION as the logic is executed as part of X_AC_DEBUG. This builds on commit 67fb99a0
-
Alejandro Lucero authored
-
Danny Auble authored
Fix duplication of node name in error message.
-
Michael Jennings authored
-