1. 16 Jul, 2016 1 commit
    • Morris Jette's avatar
      Move startup of power save thread · fb8e3558
      Morris Jette authored
      Start power save thread only after the partition information is read
        in order to avoid trying to interpret the SuspendExcParts configuration
        information before the partition information is available, which would
        result in a slurmctld abort.
      fb8e3558
  2. 15 Jul, 2016 2 commits
  3. 14 Jul, 2016 2 commits
    • Morris Jette's avatar
      Fix gang scheduling and license release logic · 111e3b48
      Morris Jette authored
      Fix gang scheduling and license release logic if single node job killed on
          bad node. Notifying gang and releasing licences is normally done when
          the epilog completion happens, but if the node(s) assigned to a job are
          all down, that does not happen. This results in the licenses being
          reserved indefinitely and the gang scheduler being left with a bad
          (old) job pointer that can result in various failure modes
      bug 2867
      111e3b48
    • Danny Auble's avatar
      CRAY - If trying to kill a step and you have NHC_NO_STEPS set run NHC · e956f297
      Danny Auble authored
      anyway to attempt to log the backtraces of the potential
      unkillable processes.
      e956f297
  4. 13 Jul, 2016 1 commit
  5. 12 Jul, 2016 6 commits
  6. 11 Jul, 2016 1 commit
  7. 08 Jul, 2016 7 commits
  8. 07 Jul, 2016 6 commits
  9. 06 Jul, 2016 5 commits
  10. 05 Jul, 2016 2 commits
  11. 04 Jul, 2016 1 commit
  12. 02 Jul, 2016 3 commits
  13. 01 Jul, 2016 2 commits
  14. 30 Jun, 2016 1 commit