Commit 38089f2b authored by Morris Jette's avatar Morris Jette
Browse files

Fix srun/sattach race condtion

The old logic would result in test16.4 failing some of the time.
  The failure was caused by the sattach command attaching to a
  job step before the original srun command received a
  RESPONSE_LAUNCH_TASKS message. That messsage  would then be sent
  to the salloc command. Since srun never got the message, it
  would hang. This change does not mark the job step as RUNNING
  until after the original srun gets sent the RESPONSE_LAUNCH_TASKS
  message and sattach requests are blocked until that time.
parent a82a70dc
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment