- 02 Jul, 2003 2 commits
- 01 Jul, 2003 10 commits
-
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
the node is DOWN. This gets cleaned up when the KILL_JOB RPC gets issued.
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
job, step, node, and partition info. Add new function to free slurm_credential from RPC response.
-
Moe Jette authored
job, step, node, and partition info.
-
Mark Grondona authored
-
Mark Grondona authored
-
Moe Jette authored
-
- 30 Jun, 2003 4 commits
- 28 Jun, 2003 3 commits
-
-
Moe Jette authored
-
Moe Jette authored
-
Mark Grondona authored
-
- 27 Jun, 2003 5 commits
-
-
Mark Grondona authored
o session manager waits for processes in its session, not just children. This fixes a problem when tasks are being attached to by a debugger. o Add elanhosts.[ch] for Elan host config file parsing. Used in slurm to install ElanID/hostname pairs into the kernel. Note that the format of the elanhosts config file has changed.
-
Moe Jette authored
-
Mark Grondona authored
-
Moe Jette authored
but don't wait for any reply. We remove the DOWN node from the job's bitmap. As soon as the other nodes complete the KILL_JOB RPC, the job transistions to some COMPLETED state.
-
Moe Jette authored
-
- 26 Jun, 2003 1 commit
-
-
Moe Jette authored
-
- 25 Jun, 2003 2 commits
-
-
Moe Jette authored
If a node is down and not responding, don't bother to send a KILL_JOB RPC to it. If that is the only node associated with a job, don't have that job go through a COMPLETING state. It goes directly to a COMPLETED state. Also preserve the NO_RESPOND flag associated with a node if its state is changed via user request (e.g. scontrol).
-
Moe Jette authored
-
- 24 Jun, 2003 2 commits
- 23 Jun, 2003 4 commits
-
-
Moe Jette authored
or SlurmUser (which has not been identified at option parsing time anyway).
-
Moe Jette authored
out message was sent (e.g., slurmd down, msg sent to slurmd, slurmd up and registers, msg previously sent to slurmd times out).
-
Moe Jette authored
-
Mark Grondona authored
base 10 (slurm/197)
-
- 20 Jun, 2003 5 commits
-
-
Mark Grondona authored
-
Mark Grondona authored
-
Mark Grondona authored
o fail launch thread when job state is no longer "LAUNCHING"
-
Mark Grondona authored
-
Mark Grondona authored
-
- 18 Jun, 2003 1 commit
-
-
Moe Jette authored
rather than letting agent go off the end of an array.
-
- 17 Jun, 2003 1 commit
-
-
Mark Grondona authored
resulted in logfile messages going to a random fd, usually stderr of the job.
-