1. 17 Sep, 2016 1 commit
    • Morris Jette's avatar
      Restore ability to manually power down nodes · da722a89
      Morris Jette authored
      Restore ability to manually power down nodes, broken in 15.08.12
      in commit b4904661
      
      The patch introduced in commit b4904661 (not powering down dead node) has a bad side effect.  Adding the "(node_ptr->last_idle != 0)" condition prevents from powering down nodes with the following command:
      
      scontrol update nodename=nX state=power_down
      
      because the state update function relies on zeroing the "last_idle" variable when a power_down is requested (see src/slurmctld/node_mgr.c, line 1589).
      
      Reverting this commit should solve the problem...but I let you decide...
      
      Didier GAZEN
      da722a89
  2. 16 Sep, 2016 1 commit
    • Morris Jette's avatar
      Update KNL modes for out-of-band reboot · 3a465f80
      Morris Jette authored
      node_features/knl_cray: If a node is rebooted outside of Slurm's direction,
          update it's active features with current MCDRAM and NUMA mode information.
      bug 3071
      3a465f80
  3. 15 Sep, 2016 2 commits
  4. 14 Sep, 2016 2 commits
  5. 09 Sep, 2016 3 commits
  6. 08 Sep, 2016 1 commit
    • Morris Jette's avatar
      Restructure srun task_exit logic · 6b6d4e1a
      Morris Jette authored
      Restructure srun command locking for task_exit processing logic for improved
        parallelism. This change decreases the amount of time consumed by serial
        logic by 2 orders of magnitude.
      bug 3044
      6b6d4e1a
  7. 07 Sep, 2016 2 commits
    • Morris Jette's avatar
      Preserve node "RESERVATION" state · 5eee1d28
      Morris Jette authored
      Preserve node "RESERVATION" state when one of multiple overlapping
          reservations ends. Previous logic would clear the node's
          RESERVATION state flag when any one of the reservations on the
          node ended rather than keeping the node in RESERVATION state
          until the last reservation ended.
      bug 3057
      5eee1d28
    • Morris Jette's avatar
      Handle slurmctld restart while compute node reboot request in progress · 4517c454
      Morris Jette authored
      Handle case when slurmctld daemon restart while compute node reboot in
          progress. Return node to service rather than setting DOWN.
      bug 3042
      4517c454
  8. 06 Sep, 2016 3 commits
  9. 02 Sep, 2016 1 commit
  10. 01 Sep, 2016 2 commits
  11. 30 Aug, 2016 2 commits
  12. 27 Aug, 2016 2 commits
  13. 26 Aug, 2016 2 commits
  14. 25 Aug, 2016 1 commit
    • Morris Jette's avatar
      Corrections to gres.conf parsing logic · dbfd87e4
      Morris Jette authored
      If all GRES were not defined on all nodes OR if a regular expression was used
         for a GRES file configuration (e.g. in gres.conf
         "Type=gpu Files=/dev/nvidia[0-4]"), then memory corruption was likely.
         The logic has been bad since its inception several years ago.
      dbfd87e4
  15. 24 Aug, 2016 1 commit
  16. 23 Aug, 2016 1 commit
  17. 22 Aug, 2016 2 commits
  18. 20 Aug, 2016 1 commit
  19. 19 Aug, 2016 1 commit
  20. 17 Aug, 2016 1 commit
  21. 16 Aug, 2016 4 commits
  22. 15 Aug, 2016 1 commit
  23. 12 Aug, 2016 2 commits
  24. 11 Aug, 2016 1 commit