- 29 Nov, 2013 2 commits
-
-
Morris Jette authored
proctrack/cgroup - Add locking to prevent race condition where one job step is ending for a user or job at the same time another job stepsis starting and the user or job container is deleted from under the starting job step. bug 447
-
David Bigagli authored
Substantial performance improvement for systems with Shared=YES or FORCE and large numbers of running jobs (replace bubble sort with quick sort). Bug 525
-
- 27 Nov, 2013 2 commits
-
-
Morris Jette authored
Original code worked only for Cray systems. For other systems it set gres_alloc to the total number of each GRES allocated on each node to any job
-
Morris Jette authored
Original code worked only for Cray systems. For other systems it set gres_alloc to the total number of each GRES allocated on each node to any job
-
- 26 Nov, 2013 3 commits
-
-
Chris Scheller authored
-
Morris Jette authored
-
David Bigagli authored
-
- 25 Nov, 2013 1 commit
-
-
Danny Auble authored
-
- 22 Nov, 2013 1 commit
-
-
David Bigagli authored
-
- 16 Nov, 2013 2 commits
-
-
Phil Eckert authored
-
Chrysovalantis Paschoulas authored
-
- 15 Nov, 2013 3 commits
-
-
Rod Schultz authored
limits are configured as 0.
-
Morris Jette authored
bug 511
-
Morris Jette authored
Add ability to clear a node's DRAIN flag using scontrol or sview by setting it's state to "UNDRAIN". The node's base state (e.g. "DOWN" or "IDLE") will not be changed. bug 514
-
- 14 Nov, 2013 1 commit
-
-
Morris Jette authored
bug 511
-
- 13 Nov, 2013 3 commits
-
-
Morris Jette authored
This might have worked fine for core reservations or when there are sufficient idle nodes to use, the the select_g_resv_test() function clears the node bitmap for nodes that it can not use and the reservation create logic did not restore that bitmap after a failed resource selection attempt. This logic restores the node bitmap on a failed call to select_g_resv_test() so we can add nodes to the bitmap of available nodes rather than having it repeatedly cleared. The logic also adds some performance enhancements that I will add to in the next commit.
-
David Bigagli authored
-
Morris Jette authored
This fixes a bug where a system is enforcing memory limits and the job already has a step running on some of the nodes then tries to start another step using some of those nodes. For example wwith DefMemPerNode configured and the select plugin enforcing memory limits, try: salloc -N2 bash $ srun -N1 sleep 10& $ srun -N2 hostname Without this patch, the second srun would fail instead of pend.
-
- 09 Nov, 2013 1 commit
-
-
David Bigagli authored
-
- 08 Nov, 2013 1 commit
-
-
Danny Auble authored
-
- 05 Nov, 2013 1 commit
-
-
Morris Jette authored
Correction to hostlist parsing bug introduced in v2.6.4 for hostlists with more than one numeric range in brackets (e.g. rack[0-3]_blade[0-63]"). bug505
-
- 04 Nov, 2013 6 commits
-
-
Morris Jette authored
Just a start for now
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Added -S/--core-spec option to salloc, sbatch and srun commands to reserve specialized cores for system use. Modify sview and scontrol to set/get core_spec struct job_info / slurm_job_info_t: Added core_spec struct job_descriptorjob_desc_msg_t: Added core_spec
-
David Bigagli authored
-
- 03 Nov, 2013 1 commit
-
-
jette authored
The system really can not handle larger job arrays without adding a job array data structure, but this puts some of the infrastructure in place now.
-
- 02 Nov, 2013 1 commit
-
-
Martins Innus authored
to offer a greater priority range based upon utilization.
-
- 01 Nov, 2013 3 commits
-
-
Morris Jette authored
Add argument to priority plugin's priority_p_reconfig function to note when the association and QOS used_cpu_run_secs field has been reset. Without this flag, we remove time on "scontrol setdebug" or "scontrol setdebugflag" that can result in used_cpu_run_secs going negative or otherwise get bad values. Correction to logic added in commit 6d793189 bug 423
-
Morris Jette authored
Add argument to priority plugin's priority_p_reconfig function to note when the association and QOS used_cpu_run_secs field has been reset. Without this flag, we remove time on "scontrol setdebug" or "scontrol setdebugflag" that can result in used_cpu_run_secs going negative or otherwise get bad values. Correction to logic added in commit 6d793189 bug 423
-
Morris Jette authored
Fix to work with change logic introduced in Slurm version 2.6.3 scheduling logic which prevented Maui/Moab from starting jobs.
-
- 31 Oct, 2013 1 commit
-
-
David Bigagli authored
-
- 30 Oct, 2013 3 commits
-
-
Matthias Jurenz authored
-
Morris Jette authored
Add support for -W block=true (wait for job completion) Clear PBS_NODEFILE environment variable Credit to NCSC
-
Morris Jette authored
-
- 29 Oct, 2013 3 commits
-
-
David Bigagli authored
-
Morris Jette authored
Add support for -W block=true (wait for job completion) Clear PBS_NODEFILE environment variable Credit to NCSC
-
Morris Jette authored
-
- 28 Oct, 2013 1 commit
-
-
Morris Jette authored
Add support for dependencies of job array elements (e.g. "sbatch --depend=afterok:123_4 ..."). This does not support depenendencies of ALL job array elements, only individual job array elements.
-