- 11 Oct, 2013 3 commits
-
-
Martin Perry authored
-
Morris Jette authored
Previous logic only reported un-reserved node map. New logging adds information about each job testing and where/when it is scheduled resources.
-
Danny Auble authored
slurm.conf when using the DBD.
-
- 10 Oct, 2013 1 commit
-
-
jette authored
Induced by bf_continue option and deleting a partition.
-
- 09 Oct, 2013 3 commits
-
-
David Bigagli authored
to reflect only the latest supported format.
-
Morris Jette authored
if bf_continue option is configured and slurm is reconfigured during one of the sleep cycles, then the backfill scheduler will reference an invalid partition pointer.
-
Morris Jette authored
Previous logic would place more tasks on each node than specified by --ntasks-per-node, using fewer nodes than desired. This only happens with exclusive node allocations (e.g. in partition configuration Shared=Exclusive).
-
- 08 Oct, 2013 2 commits
-
-
Morris Jette authored
EpilogSlurmctld pthread is passed required arguments rather than a pointer to the job record, which under some conditions could be purged and result in an invalid memory reference.
-
Morris Jette authored
EpilogSlurmctld pthread is passed required arguments rather than a pointer to the job record, which under some conditions could be purged and result in an invalid memory reference.
-
- 07 Oct, 2013 2 commits
-
-
Don Lipari authored
reboot of a block if only jobs in the list are running on it when cnodes go into a failure state.
-
David Bigagli authored
-
- 04 Oct, 2013 3 commits
-
-
Nathan Yee authored
_p_ in the plugin function names _g_ in the global function names
-
Morris Jette authored
-
David Bigagli authored
This functionality has been obsoleted by the LogTimeFormat.
-
- 03 Oct, 2013 9 commits
-
-
Morris Jette authored
Modify slurmctld message retry logic to support Cray cold-standby SDB.
-
Rod Schultz authored
-
Morris Jette authored
-
Morris Jette authored
From "Missing time limit" to "Time limit specification required, but not provided"
-
Morris Jette authored
-
David Bigagli authored
segfault.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- 02 Oct, 2013 4 commits
-
-
David Bigagli authored
is not parsed correctly.
-
Morris Jette authored
gres/gpu and gres/mic - Do not treat the existence of an empty gres.conf file as a fatal error. There may be no gres devices on that node so we do not require the file. Assume gres counts of zero if no file.
-
Morris Jette authored
bug 436
-
Morris Jette authored
bug 436
-
- 01 Oct, 2013 8 commits
-
-
Morris Jette authored
just print the job ID rather than "Submitted batch job #"
-
Eric Winter authored
job CPU count not loaded correctly Partition time limit format wrong (minutes rather than hhmmss format).
-
Eric Winter authored
-
Eric Winter authored
-
Morris Jette authored
Convert bitmap functions to use int32_t instead of int in data structures and function arguments. This is to reliably enable use of bitmaps containing up to 4 billion elements. Several data structures containing index values were also changed from data type int to int32_t: - Struct job_info / slurm_job_info_t: Changed exc_node_inx, node_inx, and req_node_inx from type int to type int32_t - job_step_info_t: Changed node_inx from type int to type int32_t - Struct partition_info / partition_info_t: Changed node_inx from type int to type int32_t - block_job_info_t: Changed cnode_inx from type int to type int32_t - block_info_t: Changed ionode_inx and mp_inx from type int to type int32_t - Struct reserve_info / reserve_info_t: Changed node_inx from type int to type int32_t
-
Danny Auble authored
isn't doing the launching.
-
David Bigagli authored
updated the srun man page.
-
Morris Jette authored
-
- 30 Sep, 2013 3 commits
-
-
Danny Auble authored
-
Morris Jette authored
sched/backfill - Prevent possible memory corruption due to use of bf_continue option and long running scheduling cycle (pending jobs could have been cancelled and purged).
-
Morris Jette authored
Change max message length from 100MB to 1GB before generating "Insane message length" error.
-
- 27 Sep, 2013 2 commits
-
-
Danny Auble authored
can only be a TORUS (1).
-
Danny Auble authored
-