- 26 Dec, 2018 4 commits
-
-
Felip Moll authored
Bug 6283
-
Felip Moll authored
SizesByAccount and SizesByAccountAndWckey now defaults to display the root account totals, or the exact requested Account=. To query like the old behavior a new option 'AcctAsParent' has been added which will still shown the subaccounts of the requested accounts in Account=. Bug5793
-
Marshall Garey authored
Bug 6289
-
Morris Jette authored
Introduced in d9b9eb23 Bug 6292
-
- 21 Dec, 2018 4 commits
-
-
Danny Auble authored
Bug 5441 Co-authored-by: Felip Moll <felip.moll@schedmd.com> Co-authored-by: Danny Auble <da@schedmd.com>
-
Felip Moll authored
Bug 5441
-
Dominik Bartkiewicz authored
Bug 5971
-
Dominik Bartkiewicz authored
from a node. Bug 5971
-
- 20 Dec, 2018 3 commits
-
-
Brian Christiansen authored
Bug 6229
-
Danny Auble authored
Bug 6239
-
Brian Christiansen authored
Bug 6229
-
- 19 Dec, 2018 2 commits
-
-
Nate Rini authored
bug 6197
-
Danny Auble authored
Bug 6239
-
- 18 Dec, 2018 5 commits
-
-
Dominik Bartkiewicz authored
Bug 5682 and 4584 Backport of ba07a6e09b6071
-
Dominik Bartkiewicz authored
other than knl. Bug 5603
-
Danny Auble authored
Bug 5603
-
Moe Jette authored
This patch also simplifies the code using bit_cnt instead of _bitstr_bits(). This isn't really necessary, but does look cleaner. Bug 6216
- 17 Dec, 2018 1 commit
-
-
Broderick Gardner authored
It was using the local cluster, sending the query to the wrong table Bug 6159
-
- 14 Dec, 2018 2 commits
-
-
Brian Christiansen authored
Bug 6237
-
Dominik Bartkiewicz authored
if OverTimeLimit is Unlimited. Bug 6093
-
- 11 Dec, 2018 1 commit
-
-
Tim Wickberg authored
-
- 09 Dec, 2018 1 commit
-
-
Tim Wickberg authored
New X11 forwarding code will only support forwarding back to salloc or an allocating srun command. Using this option within sbatch was always hit-or-miss. If the user submitting was disconnected from the alloc host for any reason their xauth credentials would likely fail even if they managed to get assigned the same local TCP port for forwarding. Bug 3647.
-
- 07 Dec, 2018 4 commits
-
-
Matthias Gerstner authored
In some systems there can be multiple user accounts for uid 0, therefore the check for literal user name "root" might be insufficient. Bug 6184
-
Matthias Gerstner authored
This pam module is tailored towards running in the context of remote ssh logins. When running in a different context like a local sudo call then the module could be influenced by e.g. passing environment variables like SLURM_CONF. By limiting the module to only perform its actions when running in the sshd context by default this situation can be avoided. An additional pam module argument service=<service> allows an Administrator to control this behavior, if different behavior is explicitly desired. Bug 6184
-
Nate Rini authored
Only print a warning for 18.08. If a user has SLURM_MEM_PER_CPU or SLURM_MEM_PER_NODE environment variables set for some reason this situation could be happening by accident, and we don't want to prevent the srun command from launching steps at this point. Bug 6058.
-
Broderick Gardner authored
Bug 5648.
-
- 06 Dec, 2018 5 commits
-
-
Janne Blomqvist authored
The Linux kernel default hard limit of 4096 for the number of file descriptors is quite small. Debian/Ubuntu have for a long time overridden this, increasing it to 1M. Recently systemd also bumped the default to 512k. https://github.com/systemd/systemd/blob/master/NEWS https://github.com/systemd/systemd/pull/10244 https://lists.fedoraproject.org/archives/list/devel@lists.fedoraproject.org/thread/ZN5TK3D6L7SE46KGXICUKLKPX2LQISVX/ https://github.com/systemd/systemd/commit/09dad04c49cae3ad2b319c9b4e7773fedd34309a Here the limits are increased as follows: - slurmd: 128k; some workloads like Hadoop/Spark need a lot of fd's, and recommend that the limit is increased to at least 64k. - slurmctld: 64k; per the Slurm high throughput and big system guides which recommend a file-max of at least 32k. - slurmdbd: 64k, matching slurmctld, though slurmdbd shouldn't need that many fd's, bumping the limit shouldn't hurt either. Bug 6171
-
Tim Wickberg authored
Bug 5248
-
Mike Nolta authored
Bug 6055
-
Mike Nolta authored
Add the following slurmctld return codes to the lua plugin: ESLURM_ACCESS_DENIED ESLURM_ACCOUNTING_POLICY ESLURM_INVALID_NODE_COUNT ESLURM_JOB_MISSING_SIZE_SPECIFICATION ESLURM_MISSING_TIME_LIMIT Bug 6055
-
Tim Wickberg authored
Rework one timer error message while here. Bug 5861.
-
- 05 Dec, 2018 8 commits
-
-
Felip Moll authored
Backups already run it when dropping to backup. Bug 6098.
-
Felip Moll authored
Backups already run it when dropping to backup. Bug 6098.
-
Marshall Garey authored
Also throw an error message within stepd_available() if the nodename is not set or cannot be inferred correctly. Bug 5399.
-
Tim Wickberg authored
-
Trey Dockendorf authored
Bug 6120
-
Tim Wickberg authored
Bug 6155
-
Tim Wickberg authored
Bug 6155
-
Felip Moll authored
When bf_continue is set, and locks are released during a backfill cycle, other operations can make new resorces available while part way through the queue. When backfill continues the cycle and evaluates new jobs, it may allocate some of these newly available resources to lower priority jobs, rather than to higher priority jobs that were already considered in this backfill cycle. This patch introduces bf_ignore_newly_avail_nodes to SchedulerParameters to solve this issue. This option will ignore nodes made available when the backfill scheduler yields when resuming the backfill cycle. Bug 5279.
-