1. 14 Oct, 2016 1 commit
    • Morris Jette's avatar
      Add repetition count to cpu/memory mask/map options · 432b7b90
      Morris Jette authored
      Modify cpu_bind and mem_bind map and mask options to accept a repetition
          count to better support large task count. For example:
          "mask_mem:0x0f*2,0xf0*2" is equivalent to "mask_mem:0x0f,0x0f,0xf0,0xf0"
      bug 3065
      432b7b90
  2. 13 Oct, 2016 4 commits
    • Morris Jette's avatar
      added knl_generic plugin · 2f5756f6
      Morris Jette authored
      Added node_features/knl_generic plugin for KNL support on non-Cray systems.
      NOTE: This plugin is still under development.
      2f5756f6
    • Morris Jette's avatar
      Don't set SLURM_UMASK for batch jobs · e25905f0
      Morris Jette authored
      Do not propagate SLURM_UMASK environment variable to batch script.
      bug 2609
      e25905f0
    • Morris Jette's avatar
      Fix task binding by core/socket · 0636c055
      Morris Jette authored
      task/affinity plugin: Honor a job's --ntasks-per-socket and
          --ntasks-per-core options in task binding.
      bug 3118
      0636c055
    • Bjørn-Helge Mevik's avatar
      Correct bitmap test function · a04bef5a
      Bjørn-Helge Mevik authored
      Correct a bitmap test function (used only by the select/bluegene plugin).
        The effect of this bug is probably very limited as it will in almost
        all cases revert prematurely to a bit-by-bit test rather than using
        a full-word test.
      bug 3145
      a04bef5a
  3. 12 Oct, 2016 6 commits
  4. 11 Oct, 2016 6 commits
  5. 07 Oct, 2016 2 commits
  6. 06 Oct, 2016 5 commits
  7. 05 Oct, 2016 6 commits
    • Morris Jette's avatar
      node_features/knl_cray: Validate nodes at start and reconfig · 397752b3
      Morris Jette authored
      node_features/knl_cray plugin: drain any node not reported by
          "capmc node_status" on startup or reconfig. Also re-tests
          on failed node restart for job.
      397752b3
    • Morris Jette's avatar
      Remove KNL features from non-KNL node · 75049242
      Morris Jette authored
      node_features/knl_cray plugin: Remove any KNL MCDRAM or NUMA features from
          node's configuration if capmc does NOT report the node as being KNL.
          For example, we don't want a non-KNL node with features="quad,cache".
      75049242
    • Morris Jette's avatar
      add knl.conf parameter CapmcRetries · 9218a40f
      Morris Jette authored
      Add new knl.conf configuration parameter CapmcRetries
      Modify capmc_suspend and capmc_resume to retry operations when
        Cray State Manager is down.
      Add retry logic to node_features/knl_cray to handle Cray State
        manager being down.
      bug 3100
      9218a40f
    • Morris Jette's avatar
      node_features/knl_cray plugin: streamline node update · db23662d
      Morris Jette authored
      node_features/knl_cray plugin: Substantially streamline and speed up logic
          to load current node state on reconfigure failure or unexpected node boot.
          Completely eliminate capmc calls and just use cnselect to load current
          node mode information.
      db23662d
    • Morris Jette's avatar
      node_features/knl_cray: Validate nodes at start and reconfig · 59b118bf
      Morris Jette authored
      node_features/knl_cray plugin: drain any node not reported by
          "capmc node_status" on startup or reconfig. Also re-tests
          on failed node restart for job.
      59b118bf
    • Morris Jette's avatar
      Remove KNL features from non-KNL node · 38f072ed
      Morris Jette authored
      node_features/knl_cray plugin: Remove any KNL MCDRAM or NUMA features from
          node's configuration if capmc does NOT report the node as being KNL.
          For example, we don't want a non-KNL node with features="quad,cache".
      38f072ed
  8. 04 Oct, 2016 1 commit
    • Morris Jette's avatar
      add knl.conf parameter CapmcRetries · 5cb90497
      Morris Jette authored
      Add new knl.conf configuration parameter CapmcRetries
      Modify capmc_suspend and capmc_resume to retry operations when
        Cray State Manager is down.
      Add retry logic to node_features/knl_cray to handle Cray State
        manager being down.
      bug 3100
      5cb90497
  9. 03 Oct, 2016 1 commit
  10. 30 Sep, 2016 8 commits