- 05 Sep, 2003 8 commits
-
-
Moe Jette authored
of socket communications. Previously was sometimes overwriting legitimate SLURM error code with fcntl error code of EINTR.
-
Moe Jette authored
sort of slurm error.
-
Moe Jette authored
-
Moe Jette authored
on a job kill. Let the KILL_JOB RPC do all of the cleanup. This removes a redundant RPC. - Moe
-
Moe Jette authored
-
Moe Jette authored
occur naturally if a srun, scontrol, scancel, sinfo, or squeue command is killed by the user with a communication to slurmctld in progress. This seems to occur fairly regularly as part of batch job termination.
-
Moe Jette authored
-
Moe Jette authored
send/receive, function (poll, timeout, send, recv, etc), and the error message are all reported.
-
- 04 Sep, 2003 3 commits
- 03 Sep, 2003 3 commits
-
-
Moe Jette authored
It was picking zero nodes and failing.
-
Moe Jette authored
-
Mark Grondona authored
problem when debugging remote tasks. (and error should have only printed once anyway)
-
- 02 Sep, 2003 1 commit
-
-
Mark Grondona authored
not SIGXCPU on reaching timelimit.
-
- 20 Aug, 2003 1 commit
-
-
jwindley authored
-
- 14 Aug, 2003 2 commits
- 13 Aug, 2003 13 commits
-
-
Mark Grondona authored
-
Moe Jette authored
to match that of a job's run time (TIME).
-
Mark Grondona authored
-
Moe Jette authored
hand if race condition starting all daemons).
-
Moe Jette authored
-
Mark Grondona authored
-
Moe Jette authored
-
Mark Grondona authored
o Export SLURM_NODELIST environment var in all cases
-
Mark Grondona authored
o Changed correspondingly calls to above o Removed unnecessary envc member of job structure, along with references to said member
-
Moe Jette authored
-
Moe Jette authored
-
Mark Grondona authored
is through with them. o added new function unsetenvp() to support above
-
Moe Jette authored
the old_job_allocation response so as to conform with job step's specification. For example, remove nodes that are excluded (-x), set the node count to that requested (-N, eliminate nodes as required), etc. This requires modifying the node list including the slurm_addr, and cpu_count arrays.
-
- 12 Aug, 2003 3 commits
- 08 Aug, 2003 2 commits
- 07 Aug, 2003 4 commits
-
-
Moe Jette authored
By default, logging is to syslog. Expanded usage description to be more complete.
-
Moe Jette authored
-
Moe Jette authored
without duplicates. For example, we want to convert a users request for nodes "lx1,lx2,lx1,lx3" to "lx[1-3]".
-
Moe Jette authored
entry counting function. As a job is completing the bitmap gets cleared as nodes complete, resulting in incorrect values.
-