1. 25 Jan, 2019 5 commits
    • Morris Jette's avatar
      clean up some badness in test · 52c466bd
      Morris Jette authored
      put entire description in one line of README file (for logging
      add missing exp_continue
      remove unused variable
      52c466bd
    • Morris Jette's avatar
      Merge branch 'slurm-18.08' · cf8ac47b
      Morris Jette authored
      cf8ac47b
    • Morris Jette's avatar
      Disable a couple of tests with CR_Socket · 6243939b
      Morris Jette authored
      Fix for tests that will fail with SelectTypeParmaters=CR_Socket
      (assuming each socket has more than one core).
      6243939b
    • Morris Jette's avatar
      Change job state handling in tests · b92200d3
      Morris Jette authored
      We want to avoid printing "FAILURE" in the function wait_for_job
      as a new use case may result in a job not starting in a timely
      fashion and NOT be an error. So change "FAILURE" in wait_for_job
      to "WARNING" and add checks for function errors in the tests
      as needed (most places already check and log errors).
      There were also many cases where "FAILURE would be printed by
      wait_for_job, but the job would not have a non-zero exit code
      and those are now fixed.
      b92200d3
    • Morris Jette's avatar
      Add test of job resource allocation size · 1bc20510
      Morris Jette authored
      Make sure that the count of CPUs allocated to a job is appropriate
      for the task count.
      bug 6274
      1bc20510
  2. 24 Jan, 2019 1 commit
  3. 23 Jan, 2019 13 commits
  4. 22 Jan, 2019 1 commit
    • Morris Jette's avatar
      GRES configuration fix · 1dd643de
      Morris Jette authored
      This fix is needed to address the GRES specification in gres.conf
      having a Type option, while the GRES specification in slurm.conf
      does not.
      1dd643de
  5. 21 Jan, 2019 5 commits
    • Morris Jette's avatar
      Fix node GRES update string parsing · 41c45f07
      Morris Jette authored
      If scontrol is used to change a node's GRES and the input string
      contains socket binding information (e.g. "gres=gpu:4(S:0),...")
      then ignore the socket binding information input. Use the binding
      as reported by slurmd in the node registration.
      41c45f07
    • Morris Jette's avatar
      Fix zero count use in node GRES string · b285edcb
      Morris Jette authored
      If a node's GRES has a count of zero (say after updating GRES counts)
      then do not print the count. Just don't report anything for that
      GRES type.
      b285edcb
    • Morris Jette's avatar
      Fix for gres logic · 96c04d29
      Morris Jette authored
      If a GRES was defined with Type information in gres.conf, but
      lacked Type information in slurm.conf, this sets up the data
      structures in slurmctld for resource allocations.
      96c04d29
    • Morris Jette's avatar
      cosmetic change for long line in code · 88ca6ff3
      Morris Jette authored
      88ca6ff3
    • Morris Jette's avatar
      use common node gres string build function · 7b627555
      Morris Jette authored
      Convert node validate logic (on node registration) in gres.c
      to use the same (new) function as is used for node gres update
      operation.
      7b627555
  6. 19 Jan, 2019 3 commits
    • Brian Christiansen's avatar
      Merge branch 'reboot_timeout' into slurm-18.08 · f3e8fe1c
      Brian Christiansen authored
      Bug 5736
      f3e8fe1c
    • Morris Jette's avatar
      Major refactoring of GRES reconfig/update logic · 15165596
      Morris Jette authored
      Add new logic to set node's GRES string on reconfig.
      Set node GRES sockets based upon real socket/core config info.
      Validate GRES changes before making them.
          Specifically we want to make sure no requests to process changes
          in the count of a GRES associated with File specifications happens.
          For example, if we have 4 gres/gpu associated with /dev/nvidia[0-3]
          and "scontrol update NodeName=... Gres=gpu:2" is executed that
          request will return an error. This is because we have no idea which
          specific gres/gpu records should be removed. We can issue a request
          to keep the count unchanged or set the count to 0, but any other
          count will return an error. This restrictions is not placed on
          GRES without Files (e.g. gres/craynetwork).
      Changed some variable names to better reflect their contents.
      15165596
    • Morris Jette's avatar
      Increase sleep in test · 4fc680bc
      Morris Jette authored
      Give job longer time to be scheduled and started (depends on
      scheduling parameters).
      4fc680bc
  7. 18 Jan, 2019 9 commits
  8. 17 Jan, 2019 3 commits