- 29 Aug, 2008 5 commits
-
-
Moe Jette authored
https://computing.llnl.gov/linux/slurm/power_save.html or "man slurm.conf" (SuspendProgram and related parameters) for more information. This is the final installment of the work: update some documenation, increase default ResumeRate, and reduce frequency of retrying batch launch state check.
-
Moe Jette authored
-
Moe Jette authored
that it doesn't get set DOWN right away for not responding since the last time it was powered up.
-
Moe Jette authored
Don't allocate nodes to a job step until it is responding (as before) AND the node is no longer in power save mode.
-
Moe Jette authored
"man slurm.conf" (SuspendProgram and related parameters) for more information. NOTE: the step create logic needs to be modify to return EAGAIN or the like until the node's state NOT_RESPONDING flag gets cleared.
-
- 27 Aug, 2008 1 commit
-
-
- 18 Aug, 2008 2 commits
-
-
Moe Jette authored
longer needed.
-
-
- 14 Aug, 2008 2 commits
-
-
-
Moe Jette authored
-
- 12 Aug, 2008 4 commits
- 11 Aug, 2008 6 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Moe Jette authored
ping the node immediately to clear the NOT_RESPONDING flag.
-
Joseph P. Donaghy authored
-
Danny Auble authored
-
Joseph P. Donaghy authored
-
- 09 Aug, 2008 1 commit
-
-
Moe Jette authored
-
- 08 Aug, 2008 7 commits
-
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
expression rather than one line per node. Frequency of log messages is dependent upon SlurmctldDebug value from 300 seconds at SlurmctldDebug<=3 to 1 second at SlurmctldDebug>=5.
-
Moe Jette authored
-
Moe Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
- 07 Aug, 2008 9 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Joseph P. Donaghy authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Joseph P. Donaghy authored
-
Joseph P. Donaghy authored
-
- 06 Aug, 2008 3 commits