1. 17 Mar, 2014 3 commits
  2. 16 Mar, 2014 3 commits
    • Morris Jette's avatar
      Export "SLURM*" env vars if --export=NONE · 9b4f3634
      Morris Jette authored
      Previously if the sbatch --export=NONE option was used then several
      Slurm environment variables were not propagated from the sbatch
      command (SLURM_SUBMIT_DIR, SLURM_SUBMIT_HOST, SLURM_JOB_NAME, etc.)
      9b4f3634
    • Morris Jette's avatar
      schedule enhancement for reservation · 08f0f57c
      Morris Jette authored
      Scheduler enhancements for reservations: When a job needs to run in
      reservation, but can not due to busy resources, then do not block all jobs
      in that partition from being scheduled, but only the jobs in that
      reservation.
      08f0f57c
    • Morris Jette's avatar
      Reset node's CpuLoad more frequently · fae55cbe
      Morris Jette authored
      Reset a node's CpuLoad value at least once each SlurmdTimeout seconds.
      Previously the value would not be reset unless communications with the
      slurmd did not happen for at least 1/3 of the SlurmdTimeout value.
      That means nodes that were actively running and terminating jobs would
      not get the CpuLoad value reset in a timely fashion. Added a CpuLoad
      reset timer to prevent this.
      fae55cbe
  3. 15 Mar, 2014 3 commits
    • Morris Jette's avatar
      retry slurm.conf file · 42081d87
      Morris Jette authored
      Add logic to sleep and retry if slurm.conf can't be read.
      Without this, the slurmd daemons may die and when the SlurmdTimeout
      is reached, the nodes will be marked DOWN and their jobs will be
      killed.
      In the long term, it would be good to exit only if the read files
      on program startup, and the daemons keep running with old configuration
      on reconfiguration, but I don't have time to do that work now.
      42081d87
    • Morris Jette's avatar
      job_submit/lua for error memory reference · e2524968
      Morris Jette authored
      Fix invalid memory reference if script returns error message
      for user. Previous code failed to set static variable to NULL
      resulting in xfree of memory previously freed elsewhere.
      e2524968
    • Morris Jette's avatar
      Add support for Torque/PBS job arrays · 11968284
      Morris Jette authored
      Add support for job array options in the qsub command, in #PBS
      options for sbatch scripts and set the appropriate environment
      variables in the spank_pbs plugin (PBS_ARRAY_ID and PBS_ARRAY_INDEX).
      Note that Torque uses the "-t" option and PBS Pro uses the "-J"
      option.
      11968284
  4. 14 Mar, 2014 4 commits
  5. 13 Mar, 2014 5 commits
  6. 11 Mar, 2014 3 commits
  7. 10 Mar, 2014 2 commits
  8. 08 Mar, 2014 2 commits
  9. 07 Mar, 2014 6 commits
  10. 06 Mar, 2014 3 commits
  11. 05 Mar, 2014 1 commit
  12. 04 Mar, 2014 5 commits