1. 22 Mar, 2014 1 commit
    • Morris Jette's avatar
      Fix sview abort when adding/removing columns · fbfd0e4d
      Morris Jette authored
      When adding or removing columns to most data types (jobs, partitions,
      nodes, etc.) on some system types an abort is generated. This appears
      to be because when columns displayed change, on some systems that
      changes the address of "model", while on others the address does not
      change (like my laptops). This fix explicitly sets the last_model to
      NULL when the columns are changed rather than relying upon the data
      structure's address to change.
      fbfd0e4d
  2. 21 Mar, 2014 8 commits
  3. 20 Mar, 2014 6 commits
  4. 19 Mar, 2014 2 commits
  5. 18 Mar, 2014 7 commits
  6. 17 Mar, 2014 8 commits
  7. 16 Mar, 2014 4 commits
    • Morris Jette's avatar
      Export "SLURM*" env vars if --export=NONE · 9b4f3634
      Morris Jette authored
      Previously if the sbatch --export=NONE option was used then several
      Slurm environment variables were not propagated from the sbatch
      command (SLURM_SUBMIT_DIR, SLURM_SUBMIT_HOST, SLURM_JOB_NAME, etc.)
      9b4f3634
    • Morris Jette's avatar
      schedule enhancement for reservation · 08f0f57c
      Morris Jette authored
      Scheduler enhancements for reservations: When a job needs to run in
      reservation, but can not due to busy resources, then do not block all jobs
      in that partition from being scheduled, but only the jobs in that
      reservation.
      08f0f57c
    • Morris Jette's avatar
      Correction for CpuLoad updates · e6dcb284
      Morris Jette authored
      This corrects logic in commit fae55cbe
      to properly support front-end system configurations
      e6dcb284
    • Morris Jette's avatar
      Reset node's CpuLoad more frequently · fae55cbe
      Morris Jette authored
      Reset a node's CpuLoad value at least once each SlurmdTimeout seconds.
      Previously the value would not be reset unless communications with the
      slurmd did not happen for at least 1/3 of the SlurmdTimeout value.
      That means nodes that were actively running and terminating jobs would
      not get the CpuLoad value reset in a timely fashion. Added a CpuLoad
      reset timer to prevent this.
      fae55cbe
  8. 15 Mar, 2014 4 commits
    • Morris Jette's avatar
      retry slurm.conf file · 42081d87
      Morris Jette authored
      Add logic to sleep and retry if slurm.conf can't be read.
      Without this, the slurmd daemons may die and when the SlurmdTimeout
      is reached, the nodes will be marked DOWN and their jobs will be
      killed.
      In the long term, it would be good to exit only if the read files
      on program startup, and the daemons keep running with old configuration
      on reconfiguration, but I don't have time to do that work now.
      42081d87
    • Morris Jette's avatar
      remove vestigial function · 364a984d
      Morris Jette authored
      The function slurm_api_set_conf_file() is never referenced.
      Remove it. No change in logic.
      364a984d
    • Morris Jette's avatar
      remove return code that is always zero · 63faf1f0
      Morris Jette authored
      No change in logic. Just remove redundant function return code.
      63faf1f0
    • Morris Jette's avatar
      job_submit/lua for error memory reference · e2524968
      Morris Jette authored
      Fix invalid memory reference if script returns error message
      for user. Previous code failed to set static variable to NULL
      resulting in xfree of memory previously freed elsewhere.
      e2524968