1. 23 Oct, 2015 2 commits
  2. 22 Oct, 2015 4 commits
  3. 21 Oct, 2015 2 commits
  4. 20 Oct, 2015 3 commits
  5. 19 Oct, 2015 7 commits
  6. 16 Oct, 2015 1 commit
  7. 15 Oct, 2015 2 commits
  8. 14 Oct, 2015 1 commit
  9. 12 Oct, 2015 1 commit
  10. 09 Oct, 2015 1 commit
  11. 08 Oct, 2015 4 commits
    • Brian Christiansen's avatar
      Fix case where the primary and backup dbds would both be performing rollup. · b2eb504b
      Brian Christiansen authored
      If the backup dbd happened to be doing rollup at the time the primary resumed
      both the primary and the backup would be doing rollups and causing contention on
      the database tables. The backup would wait for the rollup handler to finish
      before giving up control.
      
      The fix is to cancel the rollup_handler and let the backup begin to shutdown so
      that it will close an existing connections and then re-exec itself. The re-exec
      helps because the rollup handler spawns a thread for each cluster to rollup and
      just cancelling the rollup handler doesn't cancel the spawned threads from the
      rollup handler. This cleans up the dbd and locks. The re-exec only happens in
      the backup if the primary resumed and a rollup was happening.
      
      Bug 1988
      b2eb504b
    • Brian Christiansen's avatar
      Fix case where if the backup slurmdbd has existing connections when it gives... · 44bb06bc
      Brian Christiansen authored
      Fix case where if the backup slurmdbd has existing connections when it gives up control that the it would be killed.
      
      If the backup had existing connections when giving up control, it would try to
      signal the existing threads by using pthread_kill to send SIGKILL to the
      threads. The problem is that SIGKILL doesn't go the thread but the main process
      and the backup dbd would be killed.
      44bb06bc
    • Danny Auble's avatar
      Fixed slurmctld not sending cold-start messages correctly to the database · 4ed2f8c6
      Danny Auble authored
      when a cold-start (-c) happens to the slurmctld.
      4ed2f8c6
    • Morris Jette's avatar
      Remove SICP job option · 0f6bf406
      Morris Jette authored
      This was intended as a step toward managing jobs across mutliple
        clusters, but we will be pursuing a very different design.
      0f6bf406
  12. 07 Oct, 2015 7 commits
  13. 06 Oct, 2015 5 commits