- 27 Dec, 2012 1 commit
-
-
jette authored
For Slurm, we always want to treat a malloc failure as fatal.
-
- 22 Dec, 2012 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
stack.
-
- 21 Dec, 2012 8 commits
-
-
Morris Jette authored
If sched/backfill starts a job with a QOS having NO_RESERVE and not job time limit, start it with the partition time limit (or one year if the partition has no time limit) rather than NO_VAL (140 year time limit); If a standby job, which in this case has the NO_RESERVE flag set, is submitted without a time limit, and is backfilled, it will get an EndTime waaayyyy into the future. JobId=99 Name=cmdll UserId=eckert(1043) GroupId=eckert(1043) Priority=12083 Account=sa QOS=standby JobState=RUNNING Reason=None Dependency=(null) Requeue=1 Restarts=0 BatchFlag=1 ExitCode=0:0 RunTime=00:00:14 TimeLimit=12:00:00 TimeMin=N/A SubmitTime=2012-12-20T11:49:36 EligibleTime=2012-12-20T11:49:36 StartTime=2012-12-20T11:49:44 EndTime=2149-01-26T18:16:00 so I looked at the code in /src/plugins/sched/backfill: if (job_ptr->start_time <= now) { int rc = _start_job(job_ptr, resv_bitmap); if (qos_ptr && (qos_ptr->flags & QOS_FLAG_NO_RESERVE)){ job_ptr->time_limit = orig_time_limit; job_ptr->end_time = job_ptr->start_time + (orig_time_limit * 60); Using the debugger I found that if the job does not have a specified time limit, the job_ptr->time_limit is equal to NO_VAL when it hits this code.
-
Danny Auble authored
-
Danny Auble authored
2.5.
-
https://github.com/SchedMD/slurmjette authored
-
jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Identify node states on which HealthCheckProgram should be executed.
-
- 20 Dec, 2012 9 commits
-
-
Danny Auble authored
-
Danny Auble authored
slurm.conf with NodeAddr's signals going to a step could be handled incorrectly.
-
Danny Auble authored
would of also killed the allocation.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
This is a variation of "slurm_load_nodes", but accepts a node name argument, potentially resultingin substantial performance improvement for "sinfo --nodes=NAME".
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
This is a variation of "slurm_load_jobs", but accepts a user ID argument, potentially resulting in substantial performance improvement.
-
- 19 Dec, 2012 13 commits
-
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
to make one job run.
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
jette authored
-
Danny Auble authored
-
Danny Auble authored
-N1 -n#.
-
Morris Jette authored
-
Morris Jette authored
This adds the fields to the data structures and configuration file to be used later for executing a program at the beginning and end of each reservation.
-
- 18 Dec, 2012 7 commits
-
-
https://github.com/SchedMD/slurmjette authored
-
jette authored
-
Danny Auble authored
related to it.
-
Danny Auble authored
deprecated.
-
Morris Jette authored
-
Kent Engström authored
This is useful in a submit plugin script that needs to do different things depending on the account, as the the setting of account from default account does not happen until after the script has run.
-
Morris Jette authored
-