1. 03 Feb, 2012 1 commit
    • Morris Jette's avatar
      Fix for srun with --exclude and --nodes · a4551158
      Morris Jette authored
      Fix for srun allocating running within existing allocation with --exclude
      option and --nnodes count small enough to remove more nodes.
      
          > salloc -N 8
          salloc: Granted job allocation 1000008
          > srun -N 2 -n 2 --exclude=tux3 hostname
          srun: error: Unable to create job step: Requested node configuration is not available
      
      Patch from Phil Eckert, LLNL.
      a4551158
  2. 02 Feb, 2012 1 commit
  3. 01 Feb, 2012 2 commits
    • Morris Jette's avatar
      Fix job requeue bug · c0a7a7a4
      Morris Jette authored
      Fix bug when requeued batch job is scheduled to run on a different node
      zero, but attemts job launch on old node zero causing fatal error
      "Invalid host_index -1 for job #"
      c0a7a7a4
    • Morris Jette's avatar
      Avoid slurmctld abort due to bad pointer · 43936335
      Morris Jette authored
      Avoid slurmctld abort due to bad pointer when setting an advanced
      reservation MAINT flag if it contains no nodes (only licenses).
      43936335
  4. 31 Jan, 2012 3 commits
  5. 27 Jan, 2012 2 commits
  6. 25 Jan, 2012 1 commit
    • Morris Jette's avatar
      Set DEFAULT flag in partition structure · 9f4ef925
      Morris Jette authored
      Set DEFAULT flag in partition structure when slurmctld reads the
      configuration file. Patch from Rémi Palancher. Note the flag is set
      when the information is sent via RPC for sinfo.
      9f4ef925
  7. 24 Jan, 2012 1 commit
  8. 20 Jan, 2012 1 commit
  9. 19 Jan, 2012 1 commit
  10. 18 Jan, 2012 1 commit
  11. 13 Jan, 2012 3 commits
  12. 09 Jan, 2012 2 commits
  13. 28 Dec, 2011 1 commit
  14. 21 Dec, 2011 1 commit
  15. 19 Dec, 2011 1 commit
  16. 17 Dec, 2011 1 commit
  17. 15 Dec, 2011 1 commit
  18. 14 Dec, 2011 1 commit
  19. 09 Dec, 2011 4 commits
  20. 08 Dec, 2011 1 commit
  21. 06 Dec, 2011 1 commit
    • Morris Jette's avatar
      Permit pending job to exeeded partition limit with QOS flag change. · 0e1abeda
      Morris Jette authored
      One of our testers discovered a regression in version 2.3.1.  If a job is
      pending due to PartitionNodeLimit and the limit is relieved with a
      'sacctmgr modify qos name=<qos name> set flags=partitionmaxnodes' new jobs
      exceeding the partition limit (but not the QOS limit) are allowed to run.
      However, the pending job is never allowed to run.  Attached is a patch to
      address this problem.  FYI, this problem doesn't exist in version 2.4.
      Patch from Bill Brophy, Bull.
      0e1abeda
  22. 05 Dec, 2011 2 commits
  23. 02 Dec, 2011 1 commit
  24. 01 Dec, 2011 1 commit
    • jette's avatar
      Fix for "fatal: cons_res: sync loop not progressing" · d70a9ac4
      jette authored
      This was due to a bug in select/cons_res with some configuration
      optiions and job options, especially if there is more than one
      thread per core and the job option includes "--threads-per-core=1".
      Fixes problem reported by CSCS.
      d70a9ac4
  25. 30 Nov, 2011 2 commits
  26. 23 Nov, 2011 1 commit
  27. 22 Nov, 2011 2 commits