1. 25 Oct, 2016 2 commits
    • Morris Jette's avatar
      Document LLN and Weight · 207f4cd2
      Morris Jette authored
      Document that node Weight takes precedence over load with LLN scheduling.
      bug 3204
      207f4cd2
    • Tim Wickberg's avatar
      Print warning that task/cray must be listed before task/cgroup in TaskPlugin · c3266fca
      Tim Wickberg authored
      task/cray's _get_numa_nodes() function needs to run before task/cgroup
      cleans up the cgroup hierarchies, otherwise ALPS memory compaction will
      never run.
      
      Also move task_p_add_pid() outside the #ifdef HAVE_NATIVE_CRAY
      block so that the plugin will load (albeit without any functionality)
      on non-Cray systems for testing purposes.
      
      Revise documentation and provided slurm.conf templates as well.
      
      Bug 3154.
      c3266fca
  2. 20 Oct, 2016 2 commits
  3. 19 Oct, 2016 1 commit
  4. 18 Oct, 2016 2 commits
  5. 17 Oct, 2016 1 commit
  6. 13 Oct, 2016 3 commits
    • Morris Jette's avatar
      added knl_generic plugin · 2f5756f6
      Morris Jette authored
      Added node_features/knl_generic plugin for KNL support on non-Cray systems.
      NOTE: This plugin is still under development.
      2f5756f6
    • Morris Jette's avatar
      Don't set SLURM_UMASK for batch jobs · e25905f0
      Morris Jette authored
      Do not propagate SLURM_UMASK environment variable to batch script.
      bug 2609
      e25905f0
    • Bjørn-Helge Mevik's avatar
      Correct bitmap test function · a04bef5a
      Bjørn-Helge Mevik authored
      Correct a bitmap test function (used only by the select/bluegene plugin).
        The effect of this bug is probably very limited as it will in almost
        all cases revert prematurely to a bit-by-bit test rather than using
        a full-word test.
      bug 3145
      a04bef5a
  7. 12 Oct, 2016 6 commits
  8. 11 Oct, 2016 6 commits
  9. 07 Oct, 2016 1 commit
  10. 06 Oct, 2016 5 commits
  11. 05 Oct, 2016 3 commits
    • Morris Jette's avatar
      node_features/knl_cray plugin: streamline node update · db23662d
      Morris Jette authored
      node_features/knl_cray plugin: Substantially streamline and speed up logic
          to load current node state on reconfigure failure or unexpected node boot.
          Completely eliminate capmc calls and just use cnselect to load current
          node mode information.
      db23662d
    • Morris Jette's avatar
      node_features/knl_cray: Validate nodes at start and reconfig · 59b118bf
      Morris Jette authored
      node_features/knl_cray plugin: drain any node not reported by
          "capmc node_status" on startup or reconfig. Also re-tests
          on failed node restart for job.
      59b118bf
    • Morris Jette's avatar
      Remove KNL features from non-KNL node · 38f072ed
      Morris Jette authored
      node_features/knl_cray plugin: Remove any KNL MCDRAM or NUMA features from
          node's configuration if capmc does NOT report the node as being KNL.
          For example, we don't want a non-KNL node with features="quad,cache".
      38f072ed
  12. 04 Oct, 2016 1 commit
    • Morris Jette's avatar
      add knl.conf parameter CapmcRetries · 5cb90497
      Morris Jette authored
      Add new knl.conf configuration parameter CapmcRetries
      Modify capmc_suspend and capmc_resume to retry operations when
        Cray State Manager is down.
      Add retry logic to node_features/knl_cray to handle Cray State
        manager being down.
      bug 3100
      5cb90497
  13. 03 Oct, 2016 1 commit
  14. 30 Sep, 2016 4 commits
  15. 29 Sep, 2016 2 commits