1. 30 Oct, 2017 1 commit
  2. 28 Oct, 2017 1 commit
    • Morris Jette's avatar
      CRAY - Fix abort · b319d5b1
      Morris Jette authored
      If configured with NodeFeatures=knl_cray and there are non-KNL
      nodes which include no features the slurmctld will abort without
      this patch when attempting strtok_r(NULL).
      
      bug 4294
      b319d5b1
  3. 27 Oct, 2017 1 commit
  4. 25 Oct, 2017 2 commits
    • Danny Auble's avatar
      Fix layouts code to only allow setting a boolean. · b9273782
      Danny Auble authored
      Before it would allow all sorts of things like
      adding/subtracting/multiplying/etc.  It would cause warnings such as
      
      /home/bart/slurm-tmp5/src/common/layouts_mgr.c: In function ‘_layouts_load_automerge’:
      /home/bart/slurm-tmp5/src/common/layouts_mgr.c:363:21: error: ‘*’ in boolean context, suggest ‘&&’ instead [-Werror=int-in-bool-context]
         *lvalue = *lvalue * *rvalue;     \
                   ~~~~~~~~^~~~
      /home/bart/slurm-tmp5/src/common/layouts_mgr.c:1034:4: note: in expansion of macro ‘_entity_update_kv_helper’
          _entity_update_kv_helper(type_t, operator); \
          ^~~~~~~~~~~~~~~~~~~~~~~~
      /home/bart/slurm-tmp5/src/common/layouts_mgr.c:1086:4: note: in expansion of macro ‘_layouts_load_merge’
          _layouts_load_merge(bool, s_p_get_boolean);
      [tag] [reply] [−] Private Comment 16
      
      Bug 4062
      b9273782
    • Felip Moll's avatar
      Work around issue with sysmacros.h and gcc7 / glibc 2.25. · 8706f388
      Felip Moll authored
      Setting -Werror tricks the test into failing on the
      
      "error: In the GNU C Library, "major" is defined by <sys/sysmacros.h>.
       For historical compatibility, it is  currently defined by <sys/types.h>
       as well, but we plan to remove this soon. To use "major", include
       <sys/sysmacros.h> directly. If you did not intend to use a system-defined
       macro "major", you should undefine it after including <sys/types.h>."
      
      error. Since the normal Slurm build uses -Werror, this warning on including
      both headers will then cause the build itself to fail.
      
      Bug 3982.
      8706f388
  5. 24 Oct, 2017 6 commits
  6. 19 Oct, 2017 5 commits
  7. 18 Oct, 2017 1 commit
  8. 14 Oct, 2017 1 commit
  9. 13 Oct, 2017 5 commits
  10. 10 Oct, 2017 4 commits
  11. 05 Oct, 2017 1 commit
    • Brian Christiansen's avatar
      Show correct MaxTRESPerNode limit assoc reasons · 6e806f2d
      Brian Christiansen authored
      Before:
      $ sbatch --wrap="sleep 300"
      Submitted batch job 228
      $ squeue
                   JOBID PARTITION     NAME     USER ST       TIME   CPUS NODELIST(REASON)
                     228     debug     wrap    brian PD       0:00      1 (AssocMaxUnknownPerNode)
      
      Fixed:
      $ squeue
                   JOBID PARTITION     NAME     USER ST       TIME   CPUS NODELIST(REASON)
                     229     debug     wrap    brian PD       0:00      1 (AssocMaxCpuPerNode)
      
      $ sacctmgr mod account stuff set maxtrespernode=cpu=-1,mem=1
      $ squeue
                   JOBID PARTITION     NAME     USER ST       TIME   CPUS NODELIST(REASON)
                     229     debug     wrap    brian PD       0:00      1 (AssocMaxMemPerNode)
      
      $ sbatch --wrap="sleep 300" --gres=blah:2 -pgpu
      Submitted batch job 235
      $ squeue
                   JOBID PARTITION     NAME     USER ST       TIME   CPUS NODELIST(REASON)
                     235       gpu     wrap    brian PD       0:00      1 (AssocMaxGRESPerNode)
      6e806f2d
  12. 04 Oct, 2017 1 commit
    • Morris Jette's avatar
      burst_buffer/cray plugin updated for Cray UP06 sofware · 859f6c82
      Morris Jette authored
      burst_buffer/cray plugin modified to work with changes in Cray UP06
         software release.
      Specific changes: Cray software now returns an error if a state_in
         or stage_out script is processed that doesn't actually request a
         stage in or out (previously silently ignored).
      Also the warning message about tearing down a buffer that is already
         gone changed.
      859f6c82
  13. 02 Oct, 2017 2 commits
  14. 29 Sep, 2017 2 commits
  15. 27 Sep, 2017 2 commits
  16. 19 Sep, 2017 3 commits
  17. 14 Sep, 2017 1 commit
    • Tim Wickberg's avatar
      Prevent a second PMI2_Init call from leaving a hung slurmstepd process. · b2aa25d5
      Tim Wickberg authored
      A second PMI2_Init() within the same step is invalid, and cannot succeed.
      
      Return an error code back to the client end, and close the fd to force the
      step to terminate immediately.
      
      Due to a bug in our libpmi code, just returning a cmd=response_to_init with
      an appropriate rc number will not tear down the connection properly, so
      send back something else that will trigger the error path.
      
      Bug 3520.
      b2aa25d5
  18. 13 Sep, 2017 1 commit