Add socket connect retry logic in case slurmd is down
Modify sbast logic to continue when slurmd daemon restarts Previously a file transmission in progress would be aborted when any of the slurmd daemons restarted. Now it reconnects, revalidates the credential, and resumes file transmission.
Please register or sign in to comment