Commit 8405b4eb authored by Morris Jette's avatar Morris Jette
Browse files

Add timeout on srun's I/O connect message to better handle some failure modes

If the slurmstepd connects task I/O, but aborts after srun accepts the connect
and before slurmstepd writes data then srun could possibly hand indefinitely.
This probably does not explain failures seen at CEA, but can't hurt matters.
then the sr
parent c25595ff
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment