Handle if a newer major slurmctld is communicates with an older major
slurmd. This will make the node unresponsive if any communication error happens. When the slurmd comes in with the current newer version the state is removed and all is well. Previously if this state was around for too long then the node would be marked down and kill running jobs.
Please register or sign in to comment