1. 21 Mar, 2014 8 commits
  2. 20 Mar, 2014 6 commits
  3. 19 Mar, 2014 2 commits
  4. 18 Mar, 2014 7 commits
  5. 17 Mar, 2014 8 commits
  6. 16 Mar, 2014 4 commits
    • Morris Jette's avatar
      Export "SLURM*" env vars if --export=NONE · 9b4f3634
      Morris Jette authored
      Previously if the sbatch --export=NONE option was used then several
      Slurm environment variables were not propagated from the sbatch
      command (SLURM_SUBMIT_DIR, SLURM_SUBMIT_HOST, SLURM_JOB_NAME, etc.)
      9b4f3634
    • Morris Jette's avatar
      schedule enhancement for reservation · 08f0f57c
      Morris Jette authored
      Scheduler enhancements for reservations: When a job needs to run in
      reservation, but can not due to busy resources, then do not block all jobs
      in that partition from being scheduled, but only the jobs in that
      reservation.
      08f0f57c
    • Morris Jette's avatar
      Correction for CpuLoad updates · e6dcb284
      Morris Jette authored
      This corrects logic in commit fae55cbe
      to properly support front-end system configurations
      e6dcb284
    • Morris Jette's avatar
      Reset node's CpuLoad more frequently · fae55cbe
      Morris Jette authored
      Reset a node's CpuLoad value at least once each SlurmdTimeout seconds.
      Previously the value would not be reset unless communications with the
      slurmd did not happen for at least 1/3 of the SlurmdTimeout value.
      That means nodes that were actively running and terminating jobs would
      not get the CpuLoad value reset in a timely fashion. Added a CpuLoad
      reset timer to prevent this.
      fae55cbe
  7. 15 Mar, 2014 5 commits
    • Morris Jette's avatar
      retry slurm.conf file · 42081d87
      Morris Jette authored
      Add logic to sleep and retry if slurm.conf can't be read.
      Without this, the slurmd daemons may die and when the SlurmdTimeout
      is reached, the nodes will be marked DOWN and their jobs will be
      killed.
      In the long term, it would be good to exit only if the read files
      on program startup, and the daemons keep running with old configuration
      on reconfiguration, but I don't have time to do that work now.
      42081d87
    • Morris Jette's avatar
      remove vestigial function · 364a984d
      Morris Jette authored
      The function slurm_api_set_conf_file() is never referenced.
      Remove it. No change in logic.
      364a984d
    • Morris Jette's avatar
      remove return code that is always zero · 63faf1f0
      Morris Jette authored
      No change in logic. Just remove redundant function return code.
      63faf1f0
    • Morris Jette's avatar
      job_submit/lua for error memory reference · e2524968
      Morris Jette authored
      Fix invalid memory reference if script returns error message
      for user. Previous code failed to set static variable to NULL
      resulting in xfree of memory previously freed elsewhere.
      e2524968
    • Morris Jette's avatar
      add "scontrol requeuehold" to command's help · 2e2cd2b9
      Morris Jette authored
      Add requeuehold command to information generated by scontrol's
      help command
      bug 642
      2e2cd2b9