1. 25 May, 2017 6 commits
    • Dominik Bartkiewicz's avatar
      Prevent a job tested on multiple partitions from being marked WHOLE_NODE_USER. · 162f6a05
      Dominik Bartkiewicz authored
      If a job is considered on a partition with ExclusiveUser=YES
      then it would be marked as if it was submitted with the
      --exclusive flag, which would lead to delays launching it
      on ExclusiveUser=NO partitions, and cause lower-than-expected
      cluster usage.
      
      As a side effect, the job_ptr->part_ptr->flags need to be
      tested wherever WHOLE_NODE_USER is considered, instead of
      just job_ptr->details->whole_node.
      
      Bug 3771.
      162f6a05
    • Tim Wickberg's avatar
      Revert "Prevent a job tested on multiple partitions from being marked" · f1a45962
      Tim Wickberg authored
      Wrong author attributed by mistake.
      
      This reverts commit 9128476a.
      f1a45962
    • Tim Wickberg's avatar
      Revert "Prevent a race between completing jobs on a user-exclusive node from" · 82b0f802
      Tim Wickberg authored
      Wrong author attributed by mistake.
      
      This reverts commit a02d04f1.
      82b0f802
    • Tim Wickberg's avatar
      Prevent a race between completing jobs on a user-exclusive node from · a02d04f1
      Tim Wickberg authored
      leaving the node owned.
      
      Two jobs completing simultaneously leads to make_node_idle()
      returning before it has a chance to decrement node_ptr->owner_job_cnt,
      which can result in the node being "owned" by that user even
      through no jobs are running on it.
      
      Move the decrement block to the end at a fini label, and make sure
      all return paths pass through it. While moving that add a guard
      against node_ptr->owner_job_cnt underflowing.
      
      Bug 3771.
      a02d04f1
    • Tim Wickberg's avatar
      Prevent a job tested on multiple partitions from being marked · 9128476a
      Tim Wickberg authored
      WHOLE_NODE_USER.
      
      If a job is considered on a partition with ExclusiveUser=YES
      then it would be marked as if it was submitted with the
      --exclusive flag, which would lead to delays launching it
      on ExclusiveUser=NO partitions, and cause lower-than-expected
      cluster usage.
      
      As a side effect, the job_ptr->part_ptr->flags need to be
      tested wherever WHOLE_NODE_USER is considered, instead of
      just job_ptr->details->whole_node.
      
      Bug 3771.
      9128476a
    • Alejandro Sanchez's avatar
      Fix WithSubAccounts option to not include WithDeleted unless requested. · 29ebc4b2
      Alejandro Sanchez authored
      _setup_assoc_cond_limits was using the table 'prefix' passed by argument
      in the where clause to select the where clause prefix.deleted=something.
      
      It turns out that _setup_assoc_cond_limits is called by these functions:
      as_mysql_modify_assocs
      as_mysql_remove_assocs
      as_mysql_get_assocs
      as_mysql_acct_no_users
      
      which set the prefix to 't2' before the call if a QOS is provided or if
      WithSubAccounts is provided. The 't2' prefix is fine for other where
      conditions in that case, but for choosing the deleted we need the t1
      which is the table we're selecting the records off.
      
      Bug 3835
      29ebc4b2
  2. 24 May, 2017 4 commits
  3. 23 May, 2017 2 commits
  4. 22 May, 2017 1 commit
  5. 19 May, 2017 6 commits
  6. 18 May, 2017 1 commit
  7. 17 May, 2017 3 commits
  8. 16 May, 2017 4 commits
  9. 15 May, 2017 3 commits
  10. 13 May, 2017 2 commits
  11. 12 May, 2017 4 commits
    • Morris Jette's avatar
      knl_cray plugin: Log incomplete capmc output for a node · 80b27490
      Morris Jette authored
      If capmc reports a node name, but not mcdram_cfg for the node, then
        log the missing data rather than assume the value is zero and
        report a value mismatch with cnselect.
      80b27490
    • Alejandro Sanchez's avatar
      Prevent scontrol crash when operating on array and no-array jobs at once. · 006f7eeb
      Alejandro Sanchez authored
      When requesting an operation on jobs, where the operation permits to specify
      more than one job in the same request, and a job array appears before a
      regular job (no-array job) in the list of jobs to operate with, the
      job_array_resp_msg_t pointer was not properly NULL'ed and thus incorrectly
      accessed when processing the no-array job. This fix prevents the crash from
      happening in the following scontrol operations:
      
      uhold, hold, suspend, requeue, requeuehold, update, release
      
      when the same request has <array_jobid>,<non-array_jobid> in this order in
      the job list to process.
      
      Bug 3759
      006f7eeb
    • Morris Jette's avatar
      Enhance job expansion example · 02b790bc
      Morris Jette authored
      Job expansion example in FAQ enhanced to demonstrate operation in
          heterogeneous environments.
      bug 2979
      02b790bc
    • Alejandro Sanchez's avatar
      avoid starting scheduler on busy system after power cap change · e29e8511
      Alejandro Sanchez authored
      Do not attempt to schedule jobs after changing the power cap if there are
          already many active threads.
      e29e8511
  12. 11 May, 2017 1 commit
  13. 10 May, 2017 1 commit
  14. 09 May, 2017 2 commits