- 26 Nov, 2013 3 commits
-
-
Morris Jette authored
Logs errors related to apbasil use
-
Morris Jette authored
No change in logic, just move the logic that resets a batch job accounting information into its own function.
-
Morris Jette authored
-
- 25 Nov, 2013 5 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
jette authored
No change in underlying logic
-
jette authored
This fixes a problem where a job contains a license that is removed in a slurmctld reconfiguration. Without this change, the job would be left with a non-zero license_list pointer referencing memory that had been freed bug 527
-
jette authored
Increase the range of possible reservation time values to allow for a really long RPC delay (possibly due to slurmctld fail over from primary to backup controller). Also change to a #define value for clarity bug 527
-
- 24 Nov, 2013 3 commits
- 18 Nov, 2013 1 commit
-
-
Morris Jette authored
The time/resource allocation matrix is rebuilt on each job exit, which severely impacts performance at large counts of running jobs (say >10k jobs).
-
- 14 Nov, 2013 4 commits
-
-
Morris Jette authored
bug 511
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- 13 Nov, 2013 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
This makes it simpler to enable detailed debugging for reservations. This includes more information than we probably want to see with the DebugFlag=reservation and would be only for developer debugging
-
Morris Jette authored
This might have worked fine for core reservations or when there are sufficient idle nodes to use, the the select_g_resv_test() function clears the node bitmap for nodes that it can not use and the reservation create logic did not restore that bitmap after a failed resource selection attempt. This logic restores the node bitmap on a failed call to select_g_resv_test() so we can add nodes to the bitmap of available nodes rather than having it repeatedly cleared. The logic also adds some performance enhancements that I will add to in the next commit.
-
Morris Jette authored
-
- 12 Nov, 2013 3 commits
-
-
Danny Auble authored
on a task level if any task hit it the check will be triggered)
-
Danny Auble authored
-
Danny Auble authored
use mem and memsw failcnt, check for existence Thanks Ryan. I'll let you know how it goes.
-
- 09 Nov, 2013 1 commit
-
-
Ryan Cox authored
-
- 08 Nov, 2013 6 commits
-
-
Ryan Cox authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
task/cgroup: notify if memory.memsw.failcnt > 0. indicates oom Thanks Ryan, this looks correct.
-
Ryan Cox authored
-
- 07 Nov, 2013 1 commit
-
-
Morris Jette authored
-
- 06 Nov, 2013 1 commit
-
-
Gennaro Oliva authored
-
- 05 Nov, 2013 1 commit
-
-
Morris Jette authored
Correction to hostlist parsing bug introduced in v2.6.4 for hostlists with more than one numeric range in brackets (e.g. rack[0-3]_blade[0-63]"). bug505
-
- 04 Nov, 2013 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Avoid decrementing grp_used_cpu_run_secs if we are not enforcing accounting limits. Without this change, the plugin will decrement times from zero cpu used times (i.e decrement an uninitialized variable).
-
David Bigagli authored
-
- 01 Nov, 2013 3 commits
-
-
Morris Jette authored
Add argument to priority plugin's priority_p_reconfig function to note when the association and QOS used_cpu_run_secs field has been reset. Without this flag, we remove time on "scontrol setdebug" or "scontrol setdebugflag" that can result in used_cpu_run_secs going negative or otherwise get bad values. Correction to logic added in commit 6d793189 bug 423
-
Morris Jette authored
-
Morris Jette authored
-