1. 30 Nov, 2016 6 commits
  2. 29 Nov, 2016 7 commits
  3. 28 Nov, 2016 12 commits
  4. 23 Nov, 2016 5 commits
  5. 22 Nov, 2016 10 commits
    • Morris Jette's avatar
      Remove vestigial/unused variable · 28045e2f
      Morris Jette authored
      28045e2f
    • Morris Jette's avatar
      Added SchedulingParameters option of "bf_job_part_count_reserve" · 209822a8
      Morris Jette authored
      Added SchedulingParameters option of "bf_job_part_count_reserve". Jobs below
          the specified threshold will not have resources reserved for them.
      bug 3275
      209822a8
    • Danny Auble's avatar
      Fix regession in commit 10b9fdc9 where · 36b626af
      Danny Auble authored
      srun -n8 -c1 --spread-job --hint=nomultithread whereami | sort -h
      
      would cause a core dump because the wrong variable was setup.
      36b626af
    • Danny Auble's avatar
      Make it so we don't purge job start messages until after we purge step · 178a929b
      Danny Auble authored
      messages.  Hopefully this will reduce the number of messages lost when
      filling up memory when the database/DBD is down.
      178a929b
    • Morris Jette's avatar
      Merge branch 'slurm-16.05' · 57e47d01
      Morris Jette authored
      57e47d01
    • Morris Jette's avatar
      Correct malloc data type · a12e1a1c
      Morris Jette authored
      sched/backfill plugin: Make malloc match data type (defined as uint32_t and
          allocated as int). No failures observed, if type "int" is smaller than
          "uint32_t", it could result in an invalid memory reference.
      a12e1a1c
    • Sergey Meirovich's avatar
      Fix slurm_job_cpus_allocated_str_on_node_id() API call. · 0ed6488e
      Sergey Meirovich authored
      Fix API call: slurm_job_cpus_allocated_str_on_node_id() and
      in turn slurm_job_cpus_allocated_str_on_node() to return correct
      results for anything but first node. This was caused by missed logic
      to calculate fist bit belongs to particular node. Lookup was always
      starting from bit 0.
      
      Bug 3266.
      0ed6488e
    • Morris Jette's avatar
      Merge branch 'slurm-16.05' · 4c25b993
      Morris Jette authored
      4c25b993
    • Morris Jette's avatar
      backfill algorithm logic · e089b63a
      Morris Jette authored
      After one second of wall time, simulate the termination of all remaining
         running jobs in order to respond in a reasonable time frame.
      bug 3275
      e089b63a
    • Morris Jette's avatar
      Modify backfill algorithm · 6008b021
      Morris Jette authored
      Modify backfill algorithm to improve performance with large numbers of
          running jobs. Group running jobs that end in a "similar" time frame using a
          time window that grows exponentially rather than linearly. The original
          window sizes were (in units of minutes):
          0, 1, 2, 3, 4, 5, 6, 7, ... minutes
          The new window sizes are:
          0.5, 1, 2, 4, 8, 16, 32, ... minutes
          This can dramatically reduce the number of instances where the very time
          consuming "can the pending job run now" operation is executed, especailly
          if there are 1000+ running jobs.
      bug 3275
      6008b021