1. 13 May, 2015 6 commits
  2. 12 May, 2015 3 commits
  3. 11 May, 2015 2 commits
    • Morris Jette's avatar
      Explain job option of --mem=0 means all memory · c0d6edc9
      Morris Jette authored
      This is a special case. This change documents the way Slurm has
      always worked.
      c0d6edc9
    • Morris Jette's avatar
      Purge old step data on job requeue · beecc7b0
      Morris Jette authored
      Make sure that old step data is purged when a job is requeued.
      Without this logic, if a job terminates abnormally then old step
      data may be left in slurmctld. If the job is then requeued and
      started on a different node, referencing that old job step data
      can result in abnormal events. One specific failure mode is if
      the job is requeued on a node with a different number of cores,
      and the step terminated RPC arrives later, the job and step
      bitmaps of allocated cores can differ in size generating an
      abort.
      bug 1660
      beecc7b0
  4. 08 May, 2015 4 commits
  5. 07 May, 2015 4 commits
  6. 06 May, 2015 2 commits
  7. 05 May, 2015 1 commit
  8. 01 May, 2015 3 commits
  9. 30 Apr, 2015 6 commits
  10. 29 Apr, 2015 7 commits
  11. 28 Apr, 2015 2 commits
    • Morris Jette's avatar
      Change SchedulingParameters sched_min_interval meaning · 5ab69ccb
      Morris Jette authored
      Make this be the minimum time between the end of one scheduling
      cycle and the start of the next cycle (rather than using start
      times for both).
      Set the default value to 1,000,000 microseconds for Cray/ALPS
      systems.
      5ab69ccb
    • Morris Jette's avatar
      scancel logic changes · 225a1dea
      Morris Jette authored
      Refactor scancel so that all pending jobs are cancelled before starting
      cancellation of running jobs. Otherwise they happen in parallel and the
      pending jobs can be scheduled on resources as the running jobs are being
      cancelled.
      225a1dea