1. 30 May, 2012 2 commits
  2. 29 May, 2012 1 commit
  3. 25 May, 2012 2 commits
    • Rod Schultz's avatar
      Change SchedulerParamters option from "bf_res=" to "bf_resolution=" · 0f590296
      Rod Schultz authored
      This change makes the code consistent with the documentation.
      Note that "bf_res=" will continue to be recognized for now.
      Patch from Rod Schultz, Bull.
      0f590296
    • Don Albert's avatar
      Modify scontrol show job to require -dd option to print batch script. · 8ed1b303
      Don Albert authored
      I have implemented the changes as you suggested:   using a "-dd" option to indicate that the display of the script is wanted, and setting both the "SHOW_DETAIL" and a new "SHOW_DETAIL2" flag.
      
      Since "scontrol" can be run interactively as well,  I added a new "script" option to indicate that display of both the script and the details is wanted if the job is a batch job.
      
      Here are the man page updates for "man scontrol".   For the "-d, --details" option:
      
             -d, --details
                    Causes  the  show command to provide additional details where available.  Repeating the option more than
                    once (e.g., "-dd") will cause the show job command to also list the batch script, if the job was a batch
                    job.
      
      For the interactive "details" option:
      
             details
                    Causes  the  show  command  to provide additional details where available.  Job information will include
                    CPUs and NUMA memory allocated on each node.  Note that on computers  with  hyperthreading  enabled  and
                    SLURM  configured  to allocate cores, each listed CPU represents one physical core.  Each hyperthread on
                    that core can be allocated a separate task, so a job's CPU count and task count  may  differ.   See  the
                    --cpu_bind  and  --mem_bind  option  descriptions  in  srun man pages for more information.  The details
                    option is currently only supported for the show job command. To also list the  batch  script  for  batch
                    jobs, in addition to the details, use the script option described below instead of this option.
      
      And for the new interactive "script" option:
      
             script Causes the show job command to list the batch script for batch jobs in addition to the  detail  informa-
                    tion described under the details option above.
      
      Attached are the patch file for the changes and a text file with the results of the tests I did to check out the changes.   The patches are against SLURM 2.4.0-rc1.
      
              -Don Albert-
      8ed1b303
  4. 24 May, 2012 3 commits
  5. 23 May, 2012 3 commits
  6. 22 May, 2012 1 commit
  7. 16 May, 2012 4 commits
  8. 11 May, 2012 1 commit
  9. 10 May, 2012 1 commit
  10. 09 May, 2012 2 commits
    • Don Lipari's avatar
      Reset priority of system held jobs when dependency is satisfied · 9e9298b1
      Don Lipari authored
      The symptom is that SLURM schedules lower priority jobs to run when higher priority, dependent jobs have their dependencies satisfied.  This happens because dependent jobs still have a priority of 1 when the job queue is sorted in the schedule() function.  The proposed fix forces jobs to have their priority updated when their dependencies are satisfied.
      9e9298b1
    • Don Lipari's avatar
      Reset priority of system held jobs when dependency is satisfied · bf9f2452
      Don Lipari authored
      The symptom is that SLURM schedules lower priority jobs to run when higher priority, dependent jobs have their dependencies satisfied.  This happens because dependent jobs still have a priority of 1 when the job queue is sorted in the schedule() function.  The proposed fix forces jobs to have their priority updated when their dependencies are satisfied.
      bf9f2452
  11. 04 May, 2012 1 commit
  12. 03 May, 2012 1 commit
    • Matthieu Hautreux's avatar
      Fix segv in slurmctld for job step with relative option · 9bb178c3
      Matthieu Hautreux authored
      Here is the way to reproduce it :
      [root@cuzco27 georgioy]# salloc -n64 -N4 --exclusive
      salloc: Granted job allocation 8
      [root@cuzco27 georgioy]#srun -r 0 -n 30 -N 2 sleep 300&
      [root@cuzco27 georgioy]#srun -r 1 -n 40 -N 3 sleep 300&
      [root@cuzco27 georgioy]# srun: error: slurm_receive_msg: Zero Bytes were transmitted or received
      srun: error: Unable to create job step: Zero Bytes were transmitted or received
      9bb178c3
  13. 02 May, 2012 1 commit
  14. 27 Apr, 2012 2 commits
  15. 26 Apr, 2012 2 commits
  16. 24 Apr, 2012 1 commit
  17. 23 Apr, 2012 2 commits
  18. 20 Apr, 2012 1 commit
  19. 18 Apr, 2012 1 commit
  20. 17 Apr, 2012 3 commits
  21. 12 Apr, 2012 1 commit
  22. 10 Apr, 2012 4 commits