1. 20 Jan, 2016 4 commits
    • Morris Jette's avatar
      Correct job_cnt_run NULL pointer · 207adf8e
      Morris Jette authored
      This corrects logic from commit e5a61746
      that could result in use of NULL pointer
      207adf8e
    • Morris Jette's avatar
      Prevent job_cnt_run · f76586bf
      Morris Jette authored
      It was previously triggered by executing "scontrol reconfig" on a
        front-end system while there was a job in completing state.
      f76586bf
    • Morris Jette's avatar
      Properly track resources for suspended jobs on reconfig · 21c52d2f
      Morris Jette authored
      Properly account for memory, CPUs and GRES when slurmctld is reconfigured
          while there is a suspended job. Previous logic would add the CPUs, but not
          memory or GPUs. This would result in underflow/overflow errors in select
          cons_res plugin.
      bug 2353
      21c52d2f
    • Morris Jette's avatar
      Correct handling of front-end running job count · e5a61746
      Morris Jette authored
      The counter is really intended to reflect the count of running or
        suspended jobs rather than running jobs alone. Previous logic
        would report an underflow for the "job_cnt_run" variable if
        1. job submitted
        2. job suspended
        3. scontrol reconfig
        4. job cancelled
      e5a61746
  2. 19 Jan, 2016 3 commits
    • Morris Jette's avatar
      Improve select/cons_res logging · 82f61b0d
      Morris Jette authored
      Log the length of bitmaps in addition to the bits set.
      Also increase the string length used for logging.
      82f61b0d
    • Morris Jette's avatar
      Fix for socket allocations and specialized cores · a260397a
      Morris Jette authored
      Previous logic would prevent allocation of sockets to a job unless the
      entire socket was available. If there were any specialized cores, the
      socket was treated as being not available and unusable. For example,
      if a node had 2 sockets, then a job requesting 2 specialized cores
      would reserve one core on each of the two sockets and render the job
      not runnable.
      a260397a
    • Morris Jette's avatar
      Remove redundant sinfo logic · 5e08b4d1
      Morris Jette authored
      There was logic in sinfo's print state function that determined
      if the state was MIXED. This logic was duplicated logic from the
      _query_server() function in sinfo.c and has been removed. Also
      note the logic was already gone from the "short state" print
      function (I noticed the discrepeancy in the print functions,
      but discovered they both printed the correct state information).
      5e08b4d1
  3. 17 Jan, 2016 1 commit
  4. 16 Jan, 2016 2 commits
  5. 15 Jan, 2016 4 commits
  6. 14 Jan, 2016 3 commits
  7. 13 Jan, 2016 2 commits
    • Morris Jette's avatar
      backfill scheduling with group limits fix · 3ee1632f
      Morris Jette authored
      Backfill scheduling fix: If a job can't be started due to a "group" resource
          limit, rather than reserve resources for it when the next job ends, don't
          reserve any resources for it. The problem with the original logic is that
          if a lot of resources are reserved for such pending jobs, then jobs futher
          down the queue may defered when they really can and should be started. An
          ideal solution would track all of the TRES resources through time as jobs
          start and end, but we don't have that logic in the backfill scheduler and
          don't want that extra overhead in the backfill scheduler.
      bugs 2326 and 2282
      3ee1632f
    • Alejandro Sanchez's avatar
      Add more partition info to "scontrol write config" · f428705b
      Alejandro Sanchez authored
      bug 2303
      f428705b
  8. 12 Jan, 2016 8 commits
  9. 11 Jan, 2016 5 commits
  10. 08 Jan, 2016 2 commits
  11. 07 Jan, 2016 6 commits