1. 11 Sep, 2015 1 commit
    • Morris Jette's avatar
      handle job kill while step prolog running · bda0a436
      Morris Jette authored
      This prevents a step from being launched if the job is killed
      while the prolog is running. Reproducing the original failure
      requires use of srun to trigger the prolog and using scancel
      while that prolog is running.
      bug 1755
      bda0a436
  2. 17 Mar, 2015 1 commit
  3. 11 Mar, 2015 3 commits
  4. 05 Feb, 2015 3 commits
  5. 30 Jan, 2015 1 commit
  6. 21 Jan, 2015 1 commit
  7. 07 Jan, 2015 5 commits
    • David Bigagli's avatar
      Merge pull request #93 from aaronknister/slurm-14.03-nccs · 2fadf134
      David Bigagli authored
      Slurm 14.03 nccs
      2fadf134
    • Aaron Knister's avatar
      Add pbs parser fix to NEWS · 729a58ac
      Aaron Knister authored
      729a58ac
    • Rémi Palancher's avatar
      avoid delay on commit for PMI task at rank 0 · bb6656dc
      Rémi Palancher authored
      Intel MPI, on MPI jobs initialisation through PMI, uses to call PMI_KVS_Put()
      many many times from task at rank 0, and each on these call is followed by
      PMI_KVS_Commit(). Slurm implementation of PMI_KVS_Commit() imposes a delay
      to avoid DDOS on original srun. This delay is proportional to the total number.
      It could be up to 3 secs for large jobs for ex. with 7168 tasks. Therefore,
      when Intel MPI calls PMI_KVS_Commit() 475 times (mesured on a test case) from
      task at rank 0, 28 minutes are spent in delay function.
      All other tasks in the job are waiting for a PMI_Barrier. Therefore, there is
      no risk for a DDOS from this single task 0. The patch alters the delaying time
      calculation to make sure task at rank 0 will does not be delayed. All other
      tasks are globally spreaded in the same time range as before.
      bb6656dc
    • Aaron Knister's avatar
      Add PMI2 fix to NEWS · 84d61f94
      Aaron Knister authored
      84d61f94
    • Artem Polyakov's avatar
      PMI2 race condition fix. · 1c71199b
      Artem Polyakov authored
      1c71199b
  8. 05 Jan, 2015 1 commit
  9. 20 Dec, 2014 1 commit
  10. 19 Dec, 2014 1 commit
  11. 12 Dec, 2014 2 commits
  12. 11 Dec, 2014 3 commits
  13. 08 Dec, 2014 1 commit
  14. 05 Dec, 2014 1 commit
  15. 04 Dec, 2014 3 commits
  16. 03 Dec, 2014 2 commits
  17. 02 Dec, 2014 3 commits
  18. 01 Dec, 2014 1 commit
  19. 24 Nov, 2014 1 commit
  20. 21 Nov, 2014 3 commits
  21. 20 Nov, 2014 2 commits