Commit 3f4b2d51 authored by Morris Jette's avatar Morris Jette
Browse files

Check node name on batch job completion

Add a check to make sure that the job completion RPC from a
slurmstepd match that node that the batch job is running on.
This would not be the case of for a job started on a node
if that node's slurmd fails, but the slurmstepd keeps running.
The job could then be requeued and generate a completion RPC
from both slurmstepd daemons (one per node). This logic will
ignore the job complete RPC from the node NOT currently
running the batch job.
parent 18237e97
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment