Commit 0c596661 authored by Morris Jette's avatar Morris Jette
Browse files

node_features/knl_cray: add UME monitoring

Add logic to monitor Uncorrectable Memory Errors (UME) and notify
  active jobs in case they run for a while afterwards. This copies
  logic from knl_generic to knl_cray. There may be a different UME
  monitoring system for Cray systems in the future. The original
  knl_generic development is in commit 56ff27da
bug 3341
parent a01f1bbf
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment