- 14 Jan, 2013 1 commit
-
-
Morris Jette authored
Correction to CPU allocation count logic in for cores without hyperthreading.
-
- 11 Jan, 2013 1 commit
-
-
Morris Jette authored
-
- 10 Jan, 2013 3 commits
-
-
Morris Jette authored
Used to specify the communication protocol to be used for ALPS/BASIL.
-
jette authored
-
Morris Jette authored
-
- 09 Jan, 2013 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
allow the use of accounting features like associations, qos and limits but not keep track of jobs or steps in accounting.
-
- 08 Jan, 2013 3 commits
-
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
Phase 1 of effort. See "man sbatch" option -a/--array option for details. Creates job records using sbatch. Reports job arrays using scontrol or squeue. More work coming soon...
-
- 03 Jan, 2013 5 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- 28 Dec, 2012 1 commit
-
-
Morris Jette authored
-
- 22 Dec, 2012 1 commit
-
-
Danny Auble authored
stack.
-
- 21 Dec, 2012 3 commits
-
-
Morris Jette authored
If sched/backfill starts a job with a QOS having NO_RESERVE and not job time limit, start it with the partition time limit (or one year if the partition has no time limit) rather than NO_VAL (140 year time limit); If a standby job, which in this case has the NO_RESERVE flag set, is submitted without a time limit, and is backfilled, it will get an EndTime waaayyyy into the future. JobId=99 Name=cmdll UserId=eckert(1043) GroupId=eckert(1043) Priority=12083 Account=sa QOS=standby JobState=RUNNING Reason=None Dependency=(null) Requeue=1 Restarts=0 BatchFlag=1 ExitCode=0:0 RunTime=00:00:14 TimeLimit=12:00:00 TimeMin=N/A SubmitTime=2012-12-20T11:49:36 EligibleTime=2012-12-20T11:49:36 StartTime=2012-12-20T11:49:44 EndTime=2149-01-26T18:16:00 so I looked at the code in /src/plugins/sched/backfill: if (job_ptr->start_time <= now) { int rc = _start_job(job_ptr, resv_bitmap); if (qos_ptr && (qos_ptr->flags & QOS_FLAG_NO_RESERVE)){ job_ptr->time_limit = orig_time_limit; job_ptr->end_time = job_ptr->start_time + (orig_time_limit * 60); Using the debugger I found that if the job does not have a specified time limit, the job_ptr->time_limit is equal to NO_VAL when it hits this code.
-
Danny Auble authored
2.5.
-
Morris Jette authored
Identify node states on which HealthCheckProgram should be executed.
-
- 20 Dec, 2012 4 commits
-
-
Danny Auble authored
slurm.conf with NodeAddr's signals going to a step could be handled incorrectly.
-
Danny Auble authored
would of also killed the allocation.
-
Morris Jette authored
This is a variation of "slurm_load_nodes", but accepts a node name argument, potentially resultingin substantial performance improvement for "sinfo --nodes=NAME".
-
Morris Jette authored
This is a variation of "slurm_load_jobs", but accepts a user ID argument, potentially resulting in substantial performance improvement.
-
- 19 Dec, 2012 5 commits
-
-
Danny Auble authored
to make one job run.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-N1 -n#.
-
Morris Jette authored
This adds the fields to the data structures and configuration file to be used later for executing a program at the beginning and end of each reservation.
-
- 18 Dec, 2012 1 commit
-
-
Morris Jette authored
-
- 17 Dec, 2012 4 commits
-
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
-
Chris Read authored
-
- 14 Dec, 2012 4 commits
-
-
Morris Jette authored
-
Danny Auble authored
-
Chris Reed authored
Without this patch, use of sched/builtin would always result in FIFO scheduling, even if priority/multifactor was configured
-
Danny Auble authored
-
- 13 Dec, 2012 2 commits
-
-
jette authored
-
Danny Auble authored
each block independently.
-