If a node is DOWN and a job is to complete on it, send the KILL_JOB RPC,
but don't wait for any reply. We remove the DOWN node from the job's bitmap. As soon as the other nodes complete the KILL_JOB RPC, the job transistions to some COMPLETED state.
Please register or sign in to comment