User Tools

Site Tools


library:computing:xios_impi_troubles

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
library:computing:xios_impi_troubles [2025/12/30 11:10]
84.88.52.107 old revision restored (2025/12/05 11:11)
library:computing:xios_impi_troubles [2026/01/13 21:01] (current)
84.88.52.107 old revision restored (2026/01/01 12:01)
Line 196: Line 196:
  
  
-**Actions taken:** Given that the error is a floating invalid we disabled the -fpe0 flag, but we still were having the same problem. Then we disabled compiler optimizations (use -O0) and the problem disappeared, but this obviously has an effect on performance. At second instance we activated -O2 optimizations and the model runs, so the performance loss is not that important as it would be running with -O0.+**Actions taken:** Given that the error is a floating invalid we disabled the -fpe0 flag, but we still were having the same problem. Then we disabled compiler optimizations (use -O0) and the problem disappeared, but this obviously has an effect on performance.
  
-**Diagnosis:** +**Diagnosis:**
  
-**Solution:** Use -O2 flag (instead of -O2).+**Solution:** Disabling compiler optimizations (activate -O0).
  
 ==== Issue 3: MPI kills XIOS when writing model output ==== ==== Issue 3: MPI kills XIOS when writing model output ====
Line 270: Line 270:
  
 [[https://github.com/01org/psm/blob/master/ptl_ips/ips_proto.c]] [[https://github.com/01org/psm/blob/master/ptl_ips/ips_proto.c]]
-==== Issue 4: EC-Earth enters in a dead lock when using fabric (OFA network fabrics) module ====+==== Issue 4: ====
  
 **Environment:** Auto-EC-Earth 3.2.2_develop_MN4 (EC-Earth 3.2 r4063-runtime-unification).  **Environment:** Auto-EC-Earth 3.2.2_develop_MN4 (EC-Earth 3.2 r4063-runtime-unification). 
Line 280: Line 280:
   * Flags: -O0 & -O3   * Flags: -O0 & -O3
  
-**Problem:** When loading module fabric, created by Ops to solve Issue 2 on NEMO (assertion invalid), EC-Earth enters in a deadlock. Our NEMO benchmark was rather running, but MPI_Finalize was not working and jobs never finished until wallclock time limit was reached. +**Problem:** 
  
 **Actions taken:** **Actions taken:**
library/computing/xios_impi_troubles.1767093038.txt.gz ยท Last modified: 2025/12/30 11:10 by 84.88.52.107