1. 23 Aug, 2018 4 commits
    • Morris Jette's avatar
      Avoid slurmctld comp_job_cnt underflow error · 9b55a09b
      Morris Jette authored
      This error occurs when one job is used to expand the allocation of
      another job. The node record's "run_job_cnt" is decremented when
      the dependent job's epilog completes and the job getting those
      resources never has the "run_job_cnt" updated for it, which later
      results in the "comp_job_cnt" underflow when it ends. This bug
      was discovered in the course of select/cons_tres development,
      but impacts all select plugins.
      9b55a09b
    • Yu Watanabe's avatar
      Fix typo in Version in slurm.spec. · 0d7c71a5
      Yu Watanabe authored
      0d7c71a5
    • Morris Jette's avatar
      tweak gpu job resize logic · 50248e98
      Morris Jette authored
      when a batch job gets resized to 0 nodes, it instantly gets killed,
      leaving resize scripts around. Also on a Cray system, the resize
      can not happen while the job step NHC is still active. This patch
      fixes both issues
      50248e98
    • Morris Jette's avatar
      Add test of adding GPUs to running job · a8039384
      Morris Jette authored
      a8039384
  2. 22 Aug, 2018 9 commits
  3. 21 Aug, 2018 25 commits
  4. 20 Aug, 2018 2 commits