1. 17 Oct, 2014 1 commit
    • Morris Jette's avatar
      Correct license count for suspended jobs · 77a0bb65
      Morris Jette authored
      Correct tracking of licenses for suspended jobs on slurmctld reconfigure or
      restart. Previously licenses for suspended jobs were not counted, so
      the license count could be exceeded with those jobs get resumed.
      77a0bb65
  2. 16 Oct, 2014 2 commits
    • Brian Christiansen's avatar
      e1c42895
    • Morris Jette's avatar
      Change Cray mpi_fini failure logic · 5f89223f
      Morris Jette authored
      Treat Cray MPI job calling exit() without mpi_fini() as fatal error for
      that specific task and let srun handle all timeout logic.
      Previous logic would cancel the entire job step and srun options
      for wait time and kill on exit were ignored. The new logic provides
      users with the following type of response:
      
      $ srun -n3 -K0 -N3 --wait=60 ./tmp
      Task:0 Cycle:1
      Task:2 Cycle:1
      Task:1 Cycle:1
      Task:0 Cycle:2
      Task:2 Cycle:2
      slurmstepd: step 14927.0 task 1 exited without calling mpi_fini()
      srun: error: tux2: task 1: Killed
      Task:0 Cycle:3
      Task:2 Cycle:3
      Task:0 Cycle:4
      ...
      
      bug 1171
      5f89223f
  3. 15 Oct, 2014 4 commits
  4. 14 Oct, 2014 2 commits
  5. 10 Oct, 2014 7 commits
  6. 09 Oct, 2014 2 commits
  7. 08 Oct, 2014 3 commits
  8. 07 Oct, 2014 4 commits
  9. 04 Oct, 2014 1 commit
  10. 03 Oct, 2014 4 commits
  11. 02 Oct, 2014 3 commits
  12. 30 Sep, 2014 1 commit
  13. 29 Sep, 2014 5 commits
  14. 26 Sep, 2014 1 commit