- 05 Jun, 2019 7 commits
-
-
Morris Jette authored
Previous logic could hang indefinitely if srun did not return It also performed unneeded test setup before confirming that the system size was adequate to execute the test. bug 7050
-
Morris Jette authored
Remove excess tabs to properly align code Make comment formats consistent
-
Nate Rini authored
No functional changes. Bug 7050.
-
Nate Rini authored
Bug 7050.
-
Nate Rini authored
Bug 7050.
-
Nate Rini authored
Handles cases where last line of slurm.conf is not empty. Bug 7050.
-
Marshall Garey authored
One part of this test can fail in a heterogeneous system. Do not run the part of the test that would fail if the system is heterogeneous. Bug 7051.
-
- 03 Jun, 2019 1 commit
-
-
Morris Jette authored
backport test7.20 changes from master to v18.08. bug 7050
-
- 29 May, 2019 1 commit
-
-
Brian Christiansen authored
select/cons_res is recommended. Bug 7138
-
- 27 May, 2019 2 commits
-
-
Michael Hinton authored
Bug 6987.
-
Ross Dickson authored
Bug 6466.
-
- 25 May, 2019 3 commits
-
-
Danny Auble authored
Bug 7101.
-
Felip Moll authored
The name variable hasn't been set yet, so this is always NULL. Print the uid/gid instead. While here, treat uid/gid as uint32_t, and use strtoul() rather than atoi() to avoid issues with high-number uid/gid values. Fixes GCC 9 warning. Bug 7101.
-
Felip Moll authored
Bug 7101.
-
- 24 May, 2019 2 commits
-
-
Nate Rini authored
Use RETRY_DELAY per to mirror job complete delay but without a max retry count for the time being. Bug 6970.
-
Danny Auble authored
Signed-off-by: Brian Christiansen <brian@schedmd.com>
-
- 23 May, 2019 11 commits
-
-
Brian Christiansen authored
Bug 6964
-
Brian Christiansen authored
The reason was being set after the message was sent to the db. Also clear the draing and reboot states before the message is sent so that the event state will show DOWN. Bug 6964
-
Brian Christiansen authored
Bug 6964
-
Brian Christiansen authored
so that new jobs can't get on the node. Bug 6964
-
Dominik Bartkiewicz authored
for completing job. Bug 6927
-
Ben Roberts authored
Bug 6980
-
Ben Roberts authored
Bug 6945
-
Dominik Bartkiewicz authored
Bug 6926
-
Alejandro Sanchez authored
Continuation of 89b791bf. Bug 7045.
-
Alejandro Sanchez authored
To indicate that a job is dependent or has an invalid dependency. Not used for now, just added and removed according to its meaning. Bug 7045.
-
Albert Gil authored
Bug 7080
-
- 22 May, 2019 6 commits
-
-
Ben Roberts authored
Bug 7092.
-
Tim Wickberg authored
Can happen if SrunPortRange has been set too small, especially on shared login nodes launching multiple large-scale srun processes.
-
Ben Roberts authored
Bug 6916
-
Ben Roberts authored
Bug 6995
-
Marshall Garey authored
Job steps that run on cloud nodes and use the alias_list - in other words, SlurmctldParameters=cloud_dns is not in slurm.conf - all talk directly back to the slurmctld. To make that happen, we set the parent tank of each stepd to -1. However, we also set the rank of each stepd to 0. this meant that when each stepd sent a REQUEST_STEP_COMPLETE RPC to the slurmctld, they would tell slurmctld to clean up node 0 in the step allocation. So, multi-node step allocations weren't cleaning up after the steps completed and would cause subsequent job steps to hang. The step allocations would only clean up properly at the end of the job. Ensure that each stepd uses the correct rank so that job steps are properly cleaned up after each step completes. Bug 6467.
-
Alejandro Sanchez authored
They were associated to these two commits: b4d7de48 6871185a Bug 5562.
-
- 21 May, 2019 6 commits
-
-
Morris Jette authored
Error reported by CLANG Cherry pick to 18.08. Bug 6996.
-
Dominik Bartkiewicz authored
unlimited could get overwritten with default queue depth preventing the whole queue from being looked at -- especially in a high-throughput envrionment. Bug 6822 Co-authored-by: Morris Jette <jette@schedmd.com>
-
Alejandro Sanchez authored
Node memory overallocation wouldn't be properly detected since we would just be interpreting the available memory as RealMemory - MemSpecLimit, ignoring other job's memory usage. Bug 5562.
-
Alejandro Sanchez authored
This compares a job memory request against each selected node available memory, interpreting the latter for now as RealMemory - MemSpecLimit. Bug 5562.
-
Alejandro Sanchez authored
Place all three memory cases (per cpu, per node and all node memory) in a single loop, since all three cases need to traverse all job_resources selected nodes. Preparation for a follow-up commit that contains the real fix. Bug 5562.
-
Tim Wickberg authored
Add handling for acct_gather_energy/xcc and acct_gather_profile/influxdb. Bug 6829.
-
- 17 May, 2019 1 commit
-
-
Tim Wickberg authored
This is select/cons_res, not select/cons_tres.
-