This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
library:computing:xios_impi_troubles [2017/08/04 15:13] 84.88.184.232 [Issue 2: XIOS crashes when writing model output] |
library:computing:xios_impi_troubles [2024/06/24 18:04] 84.88.52.107 old revision restored (2024/06/02 17:07) |
||
---|---|---|---|
Line 196: | Line 196: | ||
- | **Actions taken:** Given that the error is a floating invalid we disabled the -fpe0 flag, but we still were having the same problem. Then we disabled compiler optimizations (use -O0) and the problem disappeared, | + | **Actions taken:** Given that the error is a floating invalid we disabled the -fpe0 flag, but we still were having the same problem. Then we disabled compiler optimizations (use -O0) and the problem disappeared, |
- | **Diagnosis: | + | **Diagnosis: |
- | **Solution: | + | **Solution: |
- | ==== Issue 3: MPI kills XIOS when writing model output | + | ==== Issue 3: ==== |
**Environment: | **Environment: | ||
Line 212: | Line 212: | ||
* Flags: -O0 | * Flags: -O0 | ||
- | **Problem: | + | **Problem: |
__ocean.output__: | __ocean.output__: | ||
Line 257: | Line 257: | ||
**Actions taken:** A similar error was observed with NEMO standalone v3.6r6499. In that case, Ops told us to use the //fabric// module, which selects //ofi// as internode fabrics, similarly to the solution used in MN3 (see above). Using this module solved the problem for NEMO standalone, although it had the collateral effect that jobs were never ending. In coupled EC-Earth this module produced a dead lock, commented below. | **Actions taken:** A similar error was observed with NEMO standalone v3.6r6499. In that case, Ops told us to use the //fabric// module, which selects //ofi// as internode fabrics, similarly to the solution used in MN3 (see above). Using this module solved the problem for NEMO standalone, although it had the collateral effect that jobs were never ending. In coupled EC-Earth this module produced a dead lock, commented below. | ||
- | We tried an alternative solution, which was to __increment | + | We tried an alternative solution, which was to increment |
**Diagnosis: | **Diagnosis: |