1. 24 Oct, 2017 3 commits
  2. 19 Oct, 2017 5 commits
  3. 18 Oct, 2017 1 commit
  4. 14 Oct, 2017 1 commit
  5. 13 Oct, 2017 5 commits
  6. 10 Oct, 2017 4 commits
  7. 05 Oct, 2017 1 commit
    • Brian Christiansen's avatar
      Show correct MaxTRESPerNode limit assoc reasons · 6e806f2d
      Brian Christiansen authored
      Before:
      $ sbatch --wrap="sleep 300"
      Submitted batch job 228
      $ squeue
                   JOBID PARTITION     NAME     USER ST       TIME   CPUS NODELIST(REASON)
                     228     debug     wrap    brian PD       0:00      1 (AssocMaxUnknownPerNode)
      
      Fixed:
      $ squeue
                   JOBID PARTITION     NAME     USER ST       TIME   CPUS NODELIST(REASON)
                     229     debug     wrap    brian PD       0:00      1 (AssocMaxCpuPerNode)
      
      $ sacctmgr mod account stuff set maxtrespernode=cpu=-1,mem=1
      $ squeue
                   JOBID PARTITION     NAME     USER ST       TIME   CPUS NODELIST(REASON)
                     229     debug     wrap    brian PD       0:00      1 (AssocMaxMemPerNode)
      
      $ sbatch --wrap="sleep 300" --gres=blah:2 -pgpu
      Submitted batch job 235
      $ squeue
                   JOBID PARTITION     NAME     USER ST       TIME   CPUS NODELIST(REASON)
                     235       gpu     wrap    brian PD       0:00      1 (AssocMaxGRESPerNode)
      6e806f2d
  8. 04 Oct, 2017 1 commit
    • Morris Jette's avatar
      burst_buffer/cray plugin updated for Cray UP06 sofware · 859f6c82
      Morris Jette authored
      burst_buffer/cray plugin modified to work with changes in Cray UP06
         software release.
      Specific changes: Cray software now returns an error if a state_in
         or stage_out script is processed that doesn't actually request a
         stage in or out (previously silently ignored).
      Also the warning message about tearing down a buffer that is already
         gone changed.
      859f6c82
  9. 02 Oct, 2017 2 commits
  10. 29 Sep, 2017 2 commits
  11. 27 Sep, 2017 2 commits
  12. 19 Sep, 2017 3 commits
  13. 14 Sep, 2017 1 commit
    • Tim Wickberg's avatar
      Prevent a second PMI2_Init call from leaving a hung slurmstepd process. · b2aa25d5
      Tim Wickberg authored
      A second PMI2_Init() within the same step is invalid, and cannot succeed.
      
      Return an error code back to the client end, and close the fd to force the
      step to terminate immediately.
      
      Due to a bug in our libpmi code, just returning a cmd=response_to_init with
      an appropriate rc number will not tear down the connection properly, so
      send back something else that will trigger the error path.
      
      Bug 3520.
      b2aa25d5
  14. 13 Sep, 2017 1 commit
  15. 12 Sep, 2017 3 commits
  16. 08 Sep, 2017 2 commits
  17. 07 Sep, 2017 2 commits
  18. 01 Sep, 2017 1 commit