1. 08 Nov, 2016 3 commits
    • Morris Jette's avatar
      select/linear plugin modified to better support heterogeneous clusters · 243fbb0d
      Morris Jette authored
      select/linear plugin modified to better support heterogeneous clusters when
          topology/none is also configured. Note that use of the select/cons_res
          plugin is strongly recommended for heterogeneous clusters. The use of
          OverSubscribe=exclusive can be used if whole node allocations is
          desired.
      bug 3212
      243fbb0d
    • Alejandro Sanchez's avatar
      9e7e12dc
    • Morris Jette's avatar
      sched/backfill - avoid starting requeued job · 69af50af
      Morris Jette authored
      If a job is started by the main scheduling logic and requeued while
        the backfill scheduler has locks released, that can result in an
        invalid data structure in select/cons_res. Namely, the backfill
        scheduler's attempt to start the job would clear the job resources
        node_bitmap. That leaves a NULL pointer in the select/cons_res
        plugin generating an abort. (That pointer is needed to clean up
        the job allocation records when the Epilog or Cray Node Health
        Check, NHC, are complete and the resources become available for
        another job.
      bug 3230
      69af50af
  2. 07 Nov, 2016 1 commit
  3. 05 Nov, 2016 1 commit
  4. 04 Nov, 2016 2 commits
    • Morris Jette's avatar
      cray/burst_buffer - Preserve job ID · 42a90020
      Morris Jette authored
      cray/burst_buffer - Preserve job ID and don't translate to job array ID
        after slurmctld restart. Prior logic would not set array_task_id to
        NO_VAL, so all job-buffer IDs would be reported in the form
        "JobID=0_0(123)" rather than "JobID=123"
      42a90020
    • Morris Jette's avatar
      Burst_buffer/cray space tracking fix · 1548086f
      Morris Jette authored
      cray/busrt_buffer - Internally track both allocated and unusable space.
          The reported UsedSpace in a pool is now the allocated space (previously was
          unusable space). Base available space on whichever value leaves least free
          space.
      bug 3222
      1548086f
  5. 01 Nov, 2016 3 commits
  6. 28 Oct, 2016 1 commit
    • Danny Auble's avatar
      Fix issue in the priority/multifactor plugin where on a slurmctld restart · be924b88
      Danny Auble authored
      more time than should be allowed would be accounted for.
      
      This only happened on jobs in the completing state when the slurmctld
      was shutdown.
      
      This will also be enhanced in 17.02 as the job's end_time_exp is not
      stored which is needed to determine if the job has already been through
      the decay_thread at end of job.
      
      Bug 3162
      be924b88
  7. 27 Oct, 2016 4 commits
  8. 26 Oct, 2016 4 commits
  9. 25 Oct, 2016 3 commits
  10. 20 Oct, 2016 2 commits
  11. 19 Oct, 2016 1 commit
  12. 18 Oct, 2016 2 commits
  13. 17 Oct, 2016 1 commit
  14. 13 Oct, 2016 3 commits
    • Morris Jette's avatar
      added knl_generic plugin · 2f5756f6
      Morris Jette authored
      Added node_features/knl_generic plugin for KNL support on non-Cray systems.
      NOTE: This plugin is still under development.
      2f5756f6
    • Morris Jette's avatar
      Don't set SLURM_UMASK for batch jobs · e25905f0
      Morris Jette authored
      Do not propagate SLURM_UMASK environment variable to batch script.
      bug 2609
      e25905f0
    • Bjørn-Helge Mevik's avatar
      Correct bitmap test function · a04bef5a
      Bjørn-Helge Mevik authored
      Correct a bitmap test function (used only by the select/bluegene plugin).
        The effect of this bug is probably very limited as it will in almost
        all cases revert prematurely to a bit-by-bit test rather than using
        a full-word test.
      bug 3145
      a04bef5a
  15. 12 Oct, 2016 6 commits
  16. 11 Oct, 2016 3 commits