- 27 Jun, 2003 3 commits
-
-
Mark Grondona authored
-
Moe Jette authored
but don't wait for any reply. We remove the DOWN node from the job's bitmap. As soon as the other nodes complete the KILL_JOB RPC, the job transistions to some COMPLETED state.
-
Moe Jette authored
-
- 26 Jun, 2003 1 commit
-
-
Moe Jette authored
-
- 25 Jun, 2003 2 commits
-
-
Moe Jette authored
If a node is down and not responding, don't bother to send a KILL_JOB RPC to it. If that is the only node associated with a job, don't have that job go through a COMPLETING state. It goes directly to a COMPLETED state. Also preserve the NO_RESPOND flag associated with a node if its state is changed via user request (e.g. scontrol).
-
Moe Jette authored
-
- 24 Jun, 2003 2 commits
- 23 Jun, 2003 4 commits
-
-
Moe Jette authored
or SlurmUser (which has not been identified at option parsing time anyway).
-
Moe Jette authored
out message was sent (e.g., slurmd down, msg sent to slurmd, slurmd up and registers, msg previously sent to slurmd times out).
-
Moe Jette authored
-
Mark Grondona authored
base 10 (slurm/197)
-
- 20 Jun, 2003 5 commits
-
-
Mark Grondona authored
-
Mark Grondona authored
-
Mark Grondona authored
o fail launch thread when job state is no longer "LAUNCHING"
-
Mark Grondona authored
-
Mark Grondona authored
-
- 18 Jun, 2003 1 commit
-
-
Moe Jette authored
rather than letting agent go off the end of an array.
-
- 17 Jun, 2003 1 commit
-
-
Mark Grondona authored
resulted in logfile messages going to a random fd, usually stderr of the job.
-
- 16 Jun, 2003 5 commits
- 14 Jun, 2003 3 commits
-
-
Mark Grondona authored
-
Mark Grondona authored
-
Mark Grondona authored
condition.
-
- 13 Jun, 2003 10 commits
-
-
Moe Jette authored
-
Mark Grondona authored
-
Mark Grondona authored
where signal is recieved after having allocated nodes but before real signal handlers are installed.
-
Mark Grondona authored
to close, do not wait for job indefinitely.
-
Mark Grondona authored
string (this is not needed).
-
Mark Grondona authored
failed (presumably due to unkillable processes) o retry failed JOB_KILL rpcs
-
Mark Grondona authored
-
Mark Grondona authored
-
Mark Grondona authored
-
Mark Grondona authored
rpc.
-
- 12 Jun, 2003 3 commits
-
-
Moe Jette authored
-
Mark Grondona authored
-
Moe Jette authored
-