- 27 Jan, 2012 2 commits
-
-
Danny Auble authored
Lucero Palau.
-
Morris Jette authored
This patch was previously applied to SLURM v2.4 and is being back-ported due to problems being reported in SLURM v2.3. Original commit is here https://github.com/SchedMD/slurm/commit/4c0eea7b8c20ccb1cacad51838a1ea8257cc637d
-
- 25 Jan, 2012 1 commit
-
-
Morris Jette authored
Set DEFAULT flag in partition structure when slurmctld reads the configuration file. Patch from Rémi Palancher. Note the flag is set when the information is sent via RPC for sinfo.
-
- 24 Jan, 2012 1 commit
-
-
Morris Jette authored
-
- 22 Jan, 2012 1 commit
-
-
jette authored
Fix race condition that could generate job_cnt_comp underflow errors on front-end architectures (Cray or IBM BlueGene systems).
-
- 20 Jan, 2012 1 commit
-
-
Morris Jette authored
Fix for possible invalid memory reference in slurmctld in job dependency logic. Patch from Carles Fenoy (Barcelona Supercomputer Center).
-
- 19 Jan, 2012 1 commit
-
-
Danny Auble authored
all jobs would be returned even if the flag was set. Patch from Bill Brophy, Bull.
-
- 18 Jan, 2012 2 commits
-
-
Morris Jette authored
Fix bug in --switch option with topology resulting in bad switch count use. Patch from Alejandro Lucero Palau (Barcelona Supercomputer Center).
-
Morris Jette authored
Avoid calling jobacct_gather_g_getinfo() until there is data to read from the socket.
-
- 15 Jan, 2012 1 commit
-
-
jette authored
-
- 14 Jan, 2012 1 commit
-
-
Danny Auble authored
-
- 13 Jan, 2012 3 commits
-
-
Danny Auble authored
number.
-
Morris Jette authored
-
Morris Jette authored
Let operators see reservation data even if "PrivateData=reservations" flag is set in slurm.conf. Patch from Don Albert, Bull.
-
- 09 Jan, 2012 2 commits
-
-
Morris Jette authored
Fix bug in srun --multi-prog configuration file to avoid printing duplicate record error when "*" is used at the end of the file for the task ID. It means all task IDs not otherwise identified.
-
Morris Jette authored
Fix race condition where sbcast command can result in deadlock of slurmd daemon. Patch by Don Albert, Bull.
-
- 04 Jan, 2012 1 commit
-
-
jette authored
Made squeue -n and -w options more consistent with salloc, sbatch, srun, and scancel. Patch by Don Lipari, LLNL.
-
- 28 Dec, 2011 2 commits
-
-
Danny Auble authored
so you can specify any connection type you would like (TORUS or MESH) as the default in dynamic mode. Previously it always defaulted to TORUS.
-
Morris Jette authored
-
- 27 Dec, 2011 1 commit
-
-
jette authored
Add new command, sdiag, which reports a variety of job scheduling statistics. Based upon work by Alejandro Lucero Palau, BSC.
-
- 21 Dec, 2011 1 commit
-
-
Morris Jette authored
-
- 19 Dec, 2011 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
Behavior of srun --multi-prog modified so that any program arguments specified on the command line will be appended to the program arguments specified in the program configuration file.
-
- 17 Dec, 2011 1 commit
-
-
Morris Jette authored
-
- 16 Dec, 2011 1 commit
-
-
Danny Auble authored
source dir.
-
- 15 Dec, 2011 1 commit
-
-
Morris Jette authored
Prevent resetting a held job's priority when updating other job parameters. Patch from Alejandro Lucero Palau, BSC.
-
- 14 Dec, 2011 2 commits
-
-
Morris Jette authored
Patch from John Thiltges, University of Nebraska-Lincoln.
-
Danny Auble authored
-
- 13 Dec, 2011 1 commit
-
-
Danny Auble authored
-
- 09 Dec, 2011 8 commits
-
-
Danny Auble authored
-
Danny Auble authored
starts in accounting.
-
Danny Auble authored
-
Morris Jette authored
Add an srun shepard process to cancel a job and/or step of the srun process is killed abnormally (e.g. SIGKILL).
-
Danny Auble authored
-
Danny Auble authored
starts in accounting.
-
Danny Auble authored
-
Morris Jette authored
-
- 08 Dec, 2011 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
- 06 Dec, 2011 1 commit
-
-
Morris Jette authored
One of our testers discovered a regression in version 2.3.1. If a job is pending due to PartitionNodeLimit and the limit is relieved with a 'sacctmgr modify qos name=<qos name> set flags=partitionmaxnodes' new jobs exceeding the partition limit (but not the QOS limit) are allowed to run. However, the pending job is never allowed to run. Attached is a patch to address this problem. FYI, this problem doesn't exist in version 2.4. Patch from Bill Brophy, Bull.
-