1. 21 Jan, 2019 4 commits
  2. 19 Jan, 2019 2 commits
    • Morris Jette's avatar
      Major refactoring of GRES reconfig/update logic · 15165596
      Morris Jette authored
      Add new logic to set node's GRES string on reconfig.
      Set node GRES sockets based upon real socket/core config info.
      Validate GRES changes before making them.
          Specifically we want to make sure no requests to process changes
          in the count of a GRES associated with File specifications happens.
          For example, if we have 4 gres/gpu associated with /dev/nvidia[0-3]
          and "scontrol update NodeName=... Gres=gpu:2" is executed that
          request will return an error. This is because we have no idea which
          specific gres/gpu records should be removed. We can issue a request
          to keep the count unchanged or set the count to 0, but any other
          count will return an error. This restrictions is not placed on
          GRES without Files (e.g. gres/craynetwork).
      Changed some variable names to better reflect their contents.
      15165596
    • Morris Jette's avatar
      Increase sleep in test · 4fc680bc
      Morris Jette authored
      Give job longer time to be scheduled and started (depends on
      scheduling parameters).
      4fc680bc
  3. 18 Jan, 2019 4 commits
  4. 17 Jan, 2019 20 commits
  5. 16 Jan, 2019 5 commits
  6. 15 Jan, 2019 5 commits
    • Morris Jette's avatar
      Modify tests to work with heterogeneous MPS counts · d7800e9f
      Morris Jette authored
      These tests confirms correct CUDA percentages in an environment
      where the gres/mps counts are not always 100 and can very from
      device to device within a node.
      d7800e9f
    • Morris Jette's avatar
      Fix GRES config parsing logic in slurmd · 66d364bf
      Morris Jette authored
      Slurmd was generating errors of this sort for all GRES:
      error: Ignoring gres.conf record, invalid name: mps
      This was due to some GRES setup logic not being invoked
      on the slurmd. In any case, the relevant checking now only
      happens on the slurmctld.
      66d364bf
    • Morris Jette's avatar
      Expand gres/mps documentation · 16482945
      Morris Jette authored
      16482945
    • Michael Hinton's avatar
      Fix UserCPU, SystemCPU, and TotalCPU with jobacct_gather/cgroup · 8eb11cf7
      Michael Hinton authored
      This patch fixes the regression caused by 07483564 where only the
      linux plugin was handling this scenario correctly.
      
      Bug 6332
      8eb11cf7
    • Morris Jette's avatar
      Insure consistent GRES · 2b4e0950
      Morris Jette authored
      If a gres type was identified in the node's GRES field, but that
      type was not listed in the slurm.conf GresTypes parameter then
      those GRES would appear in the output of "scontrol show node".
      This misleading/wrong information would cause a bunch of tests
      to fail. This patch only moves valid node GRES into the node
      record and deletes the rest.
      2b4e0950