1. 28 Nov, 2017 9 commits
  2. 27 Nov, 2017 2 commits
  3. 24 Nov, 2017 1 commit
    • Brian Christiansen's avatar
      Show federated jobs over local cluster's copy · 53eb17a8
      Brian Christiansen authored
      If a pending federated job exists on clusters 2 and 3 and squeue is run
      from cluster 1 then the active siblings can come and go depending on
      which cluster returns the job info first and depending if that cluster
      is the origin cluster or not. The origin cluster only knows where the
      active siblings are.
      53eb17a8
  4. 22 Nov, 2017 4 commits
  5. 21 Nov, 2017 5 commits
  6. 20 Nov, 2017 2 commits
  7. 17 Nov, 2017 1 commit
  8. 16 Nov, 2017 4 commits
  9. 15 Nov, 2017 5 commits
  10. 14 Nov, 2017 1 commit
  11. 13 Nov, 2017 1 commit
  12. 10 Nov, 2017 2 commits
  13. 09 Nov, 2017 3 commits
    • Morris Jette's avatar
      launch/slurm plugin fix global memory re-use · a23c1032
      Morris Jette authored
      launch/slurm plugin - Avoid using global variable for heterogeneous job
          steps, which could corrupt memory.
      bug 4333
      a23c1032
    • Morris Jette's avatar
      Retry MPI reserved port logic only for non-pack job steps · d64a5f67
      Morris Jette authored
      Ancient versions of OpenMPI and their derivatives (i.e. Cray MPI) are
      dependent upon communication ports being assigned to them by Slurm. Such MPI
      jobs will experience step launch failure if any component of a
      heterogeneous job step is unable to acquire the allocated ports.
      Non-heterogeneous job steps will retry step launch using a new set of
      communication ports (no change in Slurm behavior).
      
      NOTE: Correcting this would necessitate assigning the same set of ports
      to all components of the heterogeneous job (not possible today) plus changes to
      srun in order to better synchronize the step startup and error handling.
      d64a5f67
    • Dominik Bartkiewicz's avatar
      If any acct_gather_*_init fails fatal instead of error and keep going. · f4bf82c3
      Dominik Bartkiewicz authored
      Same logic as done in commit fb296c70 done for energy.
      
      Bug 4336
      f4bf82c3