1. 22 Dec, 2014 5 commits
    • Morris Jette's avatar
      Prevent refernce of NULL pointer · f6a5cff9
      Morris Jette authored
      f6a5cff9
    • Morris Jette's avatar
      Merge branch 'slurm-14.11' · 1449c3f1
      Morris Jette authored
      1449c3f1
    • Rémi Palancher's avatar
      avoid delay on commit for PMI task at rank 0 · fcc11e22
      Rémi Palancher authored
      Intel MPI, on MPI jobs initialisation through PMI, uses to call PMI_KVS_Put()
      many many times from task at rank 0, and each on these call is followed by
      PMI_KVS_Commit(). Slurm implementation of PMI_KVS_Commit() imposes a delay
      to avoid DDOS on original srun. This delay is proportional to the total number.
      It could be up to 3 secs for large jobs for ex. with 7168 tasks. Therefore,
      when Intel MPI calls PMI_KVS_Commit() 475 times (mesured on a test case) from
      task at rank 0, 28 minutes are spent in delay function.
      All other tasks in the job are waiting for a PMI_Barrier. Therefore, there is
      no risk for a DDOS from this single task 0. The patch alters the delaying time
      calculation to make sure task at rank 0 will does not be delayed. All other
      tasks are globally spreaded in the same time range as before.
      fcc11e22
    • Morris Jette's avatar
      Fix to set more job env vars · c2b6d81f
      Morris Jette authored
      This moves a bzero() call checked in with commit 30e45f8a
      I also noticed that test1.14 was generating errors like this
      "srun: error: cpus_per_node array is not set"
      This was due to previously uninitialized variables now being
      cleared by bzero (i.e. the old data was garbage, but avoided
      the error message). The properly cleared variables were introduced
      in commit 0252a63e
      bug 1306
      c2b6d81f
    • Morris Jette's avatar
      Fix to set more job env vars · 30e45f8a
      Morris Jette authored
      This is a correction to commit 0252a63e
      Previous logic failed to populate data structure as used in another RPC
      bug 1306
      30e45f8a
  2. 20 Dec, 2014 8 commits
  3. 19 Dec, 2014 22 commits
  4. 18 Dec, 2014 2 commits
    • Morris Jette's avatar
      burst_buffer/cray enhancements · e44bdb70
      Morris Jette authored
      Correct arguments to spawned programs
      Add asynchronous stage-in logic
      Set up bbs_pre_run call logic
      Fix locking deadlock in _teardown logic
      Capture stdout and stderr from spawned processes
      e44bdb70
    • Morris Jette's avatar
      clean up sacct test · 36b5ef81
      Morris Jette authored
      Allignment was bad and there was redundant test logic (duplicate errors)
      36b5ef81
  5. 17 Dec, 2014 3 commits