1. 22 Dec, 2014 1 commit
    • Rémi Palancher's avatar
      avoid delay on commit for PMI task at rank 0 · fcc11e22
      Rémi Palancher authored
      Intel MPI, on MPI jobs initialisation through PMI, uses to call PMI_KVS_Put()
      many many times from task at rank 0, and each on these call is followed by
      PMI_KVS_Commit(). Slurm implementation of PMI_KVS_Commit() imposes a delay
      to avoid DDOS on original srun. This delay is proportional to the total number.
      It could be up to 3 secs for large jobs for ex. with 7168 tasks. Therefore,
      when Intel MPI calls PMI_KVS_Commit() 475 times (mesured on a test case) from
      task at rank 0, 28 minutes are spent in delay function.
      All other tasks in the job are waiting for a PMI_Barrier. Therefore, there is
      no risk for a DDOS from this single task 0. The patch alters the delaying time
      calculation to make sure task at rank 0 will does not be delayed. All other
      tasks are globally spreaded in the same time range as before.
      fcc11e22
  2. 20 Dec, 2014 1 commit
  3. 19 Dec, 2014 3 commits
  4. 17 Dec, 2014 2 commits
  5. 16 Dec, 2014 2 commits
  6. 12 Dec, 2014 3 commits
    • Morris Jette's avatar
      Prevent vestigial job array record · 42d75a09
      Morris Jette authored
      If a master job array record is complete, then consider all pending
      tasks as also complete. This problem happens when a master job array
      record is pending (has pending tasks) and is cancelled. The result
      previously was a job record not visible to squeue/scontrol, but occupying
      memory.
      The same type of problem happened with respect to a dependency on a job
      array which was cancelled.
      42d75a09
    • Danny Auble's avatar
      Update news for next tag · e437094d
      Danny Auble authored
      e437094d
    • Danny Auble's avatar
      Update news for potential next tag · f84b4724
      Danny Auble authored
      f84b4724
  7. 11 Dec, 2014 6 commits
  8. 09 Dec, 2014 2 commits
  9. 08 Dec, 2014 4 commits
  10. 05 Dec, 2014 4 commits
  11. 04 Dec, 2014 6 commits
  12. 03 Dec, 2014 3 commits
  13. 02 Dec, 2014 3 commits