1. 21 Mar, 2016 6 commits
  2. 18 Mar, 2016 4 commits
    • Morris Jette's avatar
      Added SchedulingParameters option of "bf_min_prio_reserve" · 45560872
      Morris Jette authored
      Jobs below the specified threshold will not have resources reserved for them.
      bug 2565
      45560872
    • Tim Wickberg's avatar
      Fix typo. · 886df85b
      Tim Wickberg authored
      886df85b
    • Morris Jette's avatar
      Merge branch 'slurm-15.08' · 78d1bcf8
      Morris Jette authored
      78d1bcf8
    • Morris Jette's avatar
      Fix for srun abort on SIGSTOP+SIGCONT · 1ed38f26
      Morris Jette authored
      Avoid possibly aborting srun that gets simultaneous SIGSTOP+SIGCONT while
          creating the job step. The result is that the signal hanlder gets a
          argument (the signal received) of zero.
      
      Here's a log, window 1:
      $ srun hostname
      srun: Job step creation temporarily disabled, retrying
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 18
      srun: I Got signal 0
      srun: Cancelled pending job step
      
      Window 2:
      $  kill -STOP 18696 ; kill -CONT 18696
      $  kill -STOP 18696 ; kill -CONT 18696
      $  kill -STOP 18696 ; kill -CONT 18696
      ....
      
      bug 2494
      1ed38f26
  3. 17 Mar, 2016 5 commits
    • Morris Jette's avatar
      Allocated CPU count fix for select/serial · 2178163f
      Morris Jette authored
      Copy logic from select/cons_res to select/serial that is equivalent
      to commit ec50cb2f
      2178163f
    • Morris Jette's avatar
      Change calculation of node's allocated CPUs · ec50cb2f
      Morris Jette authored
      Change how a node's allocated CPU count is calculated to avoid double
          counting CPUs allocated to multiple jobs at the same time.
          Previous logic would sum the maximum number of CPUs allocated by each
          partition for any time slice, which could double count CPUs allocated
          to multiple jobs. New logic ORs bitmap of allocated CPUs for every
          partition and time slice, then counts the total for a given node.
          This avoids double counting CPUs allocated to multiple jobs, but
          does not remove from the count CPUs which have been allocated to
          jobs which might be suspended by the gang scheduler (either for
          time slicing or preemption).
      ec50cb2f
    • Tim Wickberg's avatar
      Merge branch 'slurm-15.08' · 4770189e
      Tim Wickberg authored
      4770189e
    • Tim Wickberg's avatar
      Merge branch 'slurm-14.11' into slurm-15.08 · dd2324a7
      Tim Wickberg authored
      Update NEWS as well.
      dd2324a7
    • Tim Wickberg's avatar
      Prevent uid update from corrupting assoc_hash table. · 60b58b70
      Tim Wickberg authored
      The uid is used as part of the hash function, must remove old reference
      and recalculate if it may change, otherwise _delete_assoc_hash
      will not find it again when the association is removed, causing
      slurmctld to segfault.
      
      Bug 2560.
      60b58b70
  4. 16 Mar, 2016 15 commits
  5. 15 Mar, 2016 9 commits
  6. 14 Mar, 2016 1 commit