1. 21 Jan, 2015 1 commit
  2. 07 Jan, 2015 3 commits
    • Aaron Knister's avatar
      Add pbs parser fix to NEWS · 729a58ac
      Aaron Knister authored
      729a58ac
    • Rémi Palancher's avatar
      avoid delay on commit for PMI task at rank 0 · bb6656dc
      Rémi Palancher authored
      Intel MPI, on MPI jobs initialisation through PMI, uses to call PMI_KVS_Put()
      many many times from task at rank 0, and each on these call is followed by
      PMI_KVS_Commit(). Slurm implementation of PMI_KVS_Commit() imposes a delay
      to avoid DDOS on original srun. This delay is proportional to the total number.
      It could be up to 3 secs for large jobs for ex. with 7168 tasks. Therefore,
      when Intel MPI calls PMI_KVS_Commit() 475 times (mesured on a test case) from
      task at rank 0, 28 minutes are spent in delay function.
      All other tasks in the job are waiting for a PMI_Barrier. Therefore, there is
      no risk for a DDOS from this single task 0. The patch alters the delaying time
      calculation to make sure task at rank 0 will does not be delayed. All other
      tasks are globally spreaded in the same time range as before.
      bb6656dc
    • Aaron Knister's avatar
      Add PMI2 fix to NEWS · 84d61f94
      Aaron Knister authored
      84d61f94
  3. 19 Dec, 2014 1 commit
  4. 12 Dec, 2014 1 commit
  5. 11 Dec, 2014 2 commits
  6. 08 Dec, 2014 1 commit
  7. 05 Dec, 2014 1 commit
  8. 04 Dec, 2014 3 commits
  9. 03 Dec, 2014 1 commit
    • Morris Jette's avatar
      Do not treat lack of mpi_fini as error · 6ee84e2d
      Morris Jette authored
      Log Cray MPI job calling exit() without mpi_fini(), but do not treat it as
      a fatal error. This partially reverts logic added in version 14.03.9.
      bug 1171
      6ee84e2d
  10. 02 Dec, 2014 3 commits
  11. 24 Nov, 2014 1 commit
  12. 21 Nov, 2014 2 commits
  13. 13 Nov, 2014 2 commits
  14. 12 Nov, 2014 2 commits
  15. 10 Nov, 2014 1 commit
    • Danny Auble's avatar
      Fix issue where exclusive allocations wouldn't lay tasks out correctly · 7461c119
      Danny Auble authored
      with CR_PACK_NODES.
      
      Really do commit d388dd67 a different way to get the same info and
      be able to lay out tasks correctly when --hint=nomultithread.
      
      tests on a 4 core 8 thread system are
      srun -n6 --hint=nomultithread --exclusive whereami | sort -h
      srun: cpu count 6
         0 snowflake0 - MASK:0x1
         1 snowflake0 - MASK:0x2
         2 snowflake0 - MASK:0x4
         3 snowflake0 - MASK:0x8
         4 snowflake1 - MASK:0x1
         5 snowflake1 - MASK:0x2
      
      and
      
      srun -n10 -N5 --hint=nomultithread --exclusive whereami | sort -h
      srun: cpu count 10
         0 snowflake0 - MASK:0x1
         1 snowflake0 - MASK:0x2
         2 snowflake0 - MASK:0x4
         3 snowflake0 - MASK:0x8
         4 snowflake1 - MASK:0x1
         5 snowflake1 - MASK:0x2
         6 snowflake1 - MASK:0x4
         7 snowflake2 - MASK:0x1
         8 snowflake3 - MASK:0x1
         9 snowflake4 - MASK:0x1
      7461c119
  16. 07 Nov, 2014 2 commits
  17. 06 Nov, 2014 4 commits
  18. 05 Nov, 2014 1 commit
  19. 04 Nov, 2014 2 commits
  20. 31 Oct, 2014 4 commits
  21. 30 Oct, 2014 1 commit
  22. 27 Oct, 2014 1 commit