This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
library:computing:xios_impi_troubles [2026/05/15 06:43] 84.88.52.107 old revision restored (2025/11/08 08:27) |
library:computing:xios_impi_troubles [2026/06/15 22:20] (current) 84.88.52.107 old revision restored (2026/05/11 00:51) |
||
|---|---|---|---|
| Line 70: | Line 70: | ||
| ===== NEMO-XIOS @ MN4 ===== | ===== NEMO-XIOS @ MN4 ===== | ||
| - | ==== Issue 1: ==== | + | ==== Issue 1: NEMO fails to read input files ==== |
| **Environment: | **Environment: | ||
| Line 126: | Line 126: | ||
| </ | </ | ||
| - | **Actions taken:** Operations had observed this error using NEMO standalone and NeTCDF | + | **Actions taken:** Operations had observed this error using NEMO standalone and NetCDF |
| - | **Diagnosis: | + | After looking for differences between NetCDF 4.4.0 and NetCDF 4.2 configurations (using nc-config & nf-config commands), we found out that while NetCDF 4.4.0 was compiled with no support for nc4 nor P-NetCDF (a library used that gives parallel I/O support for classic NetCDF files), while NetCDF was supporting this features. Then operations compiled again __NetCDF without linking P-NetCDF__, and this seemed to fix the problem. |
| - | **Solution: | + | In order to know more about the source of this bug, we __compared the behavior of two NEMO executables__, |
| + | |||
| + | We did a __comparison of the functions included in both binaries__ through the nm command, and we found that __they were identical__. Then we did a __more in deep comparison of both binaries__ with objdump and we found out little differences, | ||
| + | |||
| + | **Diagnosis: | ||
| + | |||
| + | **Solution: | ||
| **More information: | **More information: | ||
| Line 198: | Line 204: | ||
| ==== Issue 3: ==== | ==== Issue 3: ==== | ||
| - | **Environment: | + | **Environment: |
| - | **Problem:** | + | |
| + | | ||
| + | | ||
| + | | ||
| + | * Flags: -O0 | ||
| - | **Actions taken:** | + | **Problem: |
| + | |||
| + | __ocean.output__: | ||
| + | |||
| + | |||
| + | __log.err__: | ||
| + | |||
| + | < | ||
| + | s11r1b56.58976Assertion failure at / | ||
| + | forrtl: error (76): Abort trap signal | ||
| + | Image PC Routine | ||
| + | nemo.exe | ||
| + | libpthread-2.22.s | ||
| + | libc-2.22.so | ||
| + | libc-2.22.so | ||
| + | libpsm2.so.2.1 | ||
| + | libpsm2.so.2.1 | ||
| + | libpsm2.so.2.1 | ||
| + | libpsm2.so.2.1 | ||
| + | libpsm2.so.2.1 | ||
| + | libpsm2.so.2.1 | ||
| + | libpsm2.so.2.1 | ||
| + | libtmip_psm2.so.1 | ||
| + | libmpi.so.12.0 | ||
| + | libmpi.so.12.0 | ||
| + | libmpi.so.12 | ||
| + | libmpi.so.12 | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | nemo.exe | ||
| + | libc-2.22.so | ||
| + | nemo.exe | ||
| + | </ | ||
| + | |||
| + | |||
| + | **Actions taken: | ||
| **Diagnosis: | **Diagnosis: | ||
| Line 208: | Line 261: | ||
| **Solution: | **Solution: | ||
| + | About Intel Communication Fabrics control: | ||
| + | |||
| + | [[https:// | ||
| ==== Issue 4: ==== | ==== Issue 4: ==== | ||
| - | **Environment: | + | **Environment: |
| + | |||
| + | * Compiler: Intel 2017.4 | ||
| + | * MPI: Intel 2017.3.196 | ||
| + | * NetCDF: 4.4.0 | ||
| + | * HDF5: 1.8.19 | ||
| + | * Flags: -O0 & -O3 | ||
| - | **Problem: | + | **Problem: |
| **Actions taken:** | **Actions taken:** | ||