- 23 May, 2019 3 commits
-
-
Morris Jette authored
an MPI problem caused a test failure for me and since it's not running an MPI job, just disable the MPI plugin
-
Brian Christiansen authored
CID 198639 Continuation of 8a1e5a52 Bug 6950
-
Morris Jette authored
Mostly in copywrite notices, but some comments and documents also. bug 7090
-
- 22 May, 2019 16 commits
-
-
Tim Wickberg authored
-
Ben Roberts authored
Bug 7092.
-
Tim Wickberg authored
Can happen if SrunPortRange has been set too small, especially on shared login nodes launching multiple large-scale srun processes.
-
Ben Roberts authored
Bug 6916
-
Ben Roberts authored
Bug 6995
-
Brian Christiansen authored
Bug 6467
-
Brian Christiansen authored
-
Marshall Garey authored
Job steps that run on cloud nodes and use the alias_list - in other words, SlurmctldParameters=cloud_dns is not in slurm.conf - all talk directly back to the slurmctld. To make that happen, we set the parent tank of each stepd to -1. However, we also set the rank of each stepd to 0. this meant that when each stepd sent a REQUEST_STEP_COMPLETE RPC to the slurmctld, they would tell slurmctld to clean up node 0 in the step allocation. So, multi-node step allocations weren't cleaning up after the steps completed and would cause subsequent job steps to hang. The step allocations would only clean up properly at the end of the job. Ensure that each stepd uses the correct rank so that job steps are properly cleaned up after each step completes. Bug 6467.
-
Alejandro Sanchez authored
They were associated to these two commits: b4d7de48 6871185a Bug 5562.
-
Alejandro Sanchez authored
-
Alejandro Sanchez authored
They were associated to these two commits: b4d7de48 6871185a Bug 5562.
-
Morris Jette authored
Bug 6998.
-
Morris Jette authored
Modify "scontrol show dwstat" test to work if no active sessions, configurations, etc.
-
Morris Jette authored
Correct some logic in commit 38daa7a9. The variable "file_in" did not exist in some places where the function was called from.
-
Morris Jette authored
Modify test to work on cray/kachina system when allocated KNL node.
-
Morris Jette authored
Test broke on cray/kachina when allocated KNL
-
- 21 May, 2019 21 commits
-
-
Dominik Bartkiewicz authored
Bug 6822
-
Moe Jette authored
Bug 7061
-
Danny Auble authored
Bug 7061
-
Danny Auble authored
Bug 7061 Co-authored-by: Morris Jette <jette@schedmd.com>
-
Tim Wickberg authored
-
Morris Jette authored
Error reported by CLANG Cherry pick to 18.08. Bug 6996.
-
Morris Jette authored
Jobs with zero nodes/CPUs are permitted to create and destroy persistent burst buffers. Bug 7034.
-
Morris Jette authored
Bug 7034.
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Dominik Bartkiewicz authored
unlimited could get overwritten with default queue depth preventing the whole queue from being looked at -- especially in a high-throughput envrionment. Bug 6822 Co-authored-by: Morris Jette <jette@schedmd.com>
-
Danny Auble authored
Bug 5562
-
Danny Auble authored
Bug 5562
-
Danny Auble authored
Bug 5562
-
Danny Auble authored
-
Alejandro Sanchez authored
Node memory overallocation wouldn't be properly detected since we would just be interpreting the available memory as RealMemory - MemSpecLimit, ignoring other job's memory usage. Bug 5562.
-
Alejandro Sanchez authored
This compares a job memory request against each selected node available memory, interpreting the latter for now as RealMemory - MemSpecLimit. Bug 5562.
-
Alejandro Sanchez authored
Place all three memory cases (per cpu, per node and all node memory) in a single loop, since all three cases need to traverse all job_resources selected nodes. Preparation for a follow-up commit that contains the real fix. Bug 5562.
-
Morris Jette authored
Move common (or similar) logic to globals and remove it from the individual tests.
-
Tim Wickberg authored
-
Tim Wickberg authored
Add handling for acct_gather_energy/xcc and acct_gather_profile/influxdb. Bug 6829.
-