1. 13 Dec, 2016 1 commit
    • Tim Wickberg's avatar
      Do not attempt to lookup task program in slurmd. · e3ca013b
      Tim Wickberg authored
      Reverts most of commit 84023f27.
      
      Searching the PATH in slurmd can fail due to root_squash'd NFS
      filesystems, leading to the "wrong" program being launched.
      
      If you'd like the performance benefit from avoiding this lookup
      during each separate task launch, set SLURM_TEST_EXEC=1 instead
      which will perform the lookup once within srun, which then ensures
      the lookup happens under the users own environment and not that
      of the slurmd.
      
      Bug 2992.
      e3ca013b
  2. 09 Dec, 2016 2 commits
  3. 08 Dec, 2016 6 commits
    • Danny Auble's avatar
      add 16.05.8 for next release to NEWS · cb22a0c3
      Danny Auble authored
      cb22a0c3
    • Tim Wickberg's avatar
      Fix race condition with getgrouplist(). · 8cb636dd
      Tim Wickberg authored
      If the second call to getgrouplist() found additional groups,
      ngroups will be overwritten with this new larger value, while
      the gids list would be truncated. (ngroups is a value-result arg.)
      This will then lead to _gids_cache_lookup() returning the wrong
      number of groups including invalid parts of memory, which are likely
      to include some zeros.
      
      Those zeros could then make it to the setgroups() call and thus
      give the user access to the root group. Especially as setgroups
      will succeed as long as the array does not contain -1 as a gid.
      
      Bug 3320.
      8cb636dd
    • Tim Wickberg's avatar
      Fix NEWS line from 1ccf8a72. · b0838df0
      Tim Wickberg authored
      b0838df0
    • Danny Auble's avatar
    • Morris Jette's avatar
      Change task/cgroup error message · e6ef1f0c
      Morris Jette authored
      task/cgroup - Change error message if CPU binding can not take place to
          better identify the root cause of the problem. Specifically, if
          the hwloc_get_obj_below_by_type() function call completely fails
          that is likely due to task/affinity not being configured, so
          cpusets are not configured. Previous message was
          "task/cgroup: task[%u] infinite loop broken while trying to provision compute elements using %s (bitmap:%s)"
          The new message is
          "task/cgroup: hwloc_get_obj_below_by_type() failing, task/affinity plugin also required"
      e6ef1f0c
    • Dominik Bartkiewicz's avatar
      Fix printf format specified in elasticsearch plugin. %u not %hu. · fee2645d
      Dominik Bartkiewicz authored
      uint32_t needs %u on 32-bit platforms. Noticed by clang/travisci.
      fee2645d
  4. 07 Dec, 2016 2 commits
  5. 06 Dec, 2016 7 commits
  6. 05 Dec, 2016 2 commits
  7. 02 Dec, 2016 4 commits
  8. 01 Dec, 2016 2 commits
  9. 30 Nov, 2016 2 commits
    • Morris Jette's avatar
      cray/burst_buffer - Increase timer · b4763c75
      Morris Jette authored
      cray/burst_buffer - Increase time to synchronize operations between threads
          from 5 to 60 seconds ("setup" operation time observed over 17 seconds).
          This should fix a race condition between a thread performing a buffer
          creation (setup) and a thread looking for unexpected buffers. If a
          buffer is found during the time window allowed for creation, it's
          space will be counted twice. First by the status checking thread
          and second by the thread doing the creation. The deallocation only
          happens once, so the used space information can be left with an
          invalid value.
      bug 3295
      b4763c75
    • Tim Wickberg's avatar
      sbcast - prevent segfault in slurmd from multiple zlib compressed transfers · 8c5765c9
      Tim Wickberg authored
      static variable means multiple active decompression streams will corrupt
      zlib's internal state, which can lead to a segfault.
      
      Bug 3299.
      8c5765c9
  10. 29 Nov, 2016 3 commits
  11. 28 Nov, 2016 5 commits
  12. 22 Nov, 2016 4 commits