- 20 Jul, 2018 1 commit
-
-
Tim Wickberg authored
-
- 19 Jul, 2018 12 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
Update slurm.spec and slurm.spec-legacy as well.
-
Tim Wickberg authored
-
Tim Wickberg authored
The lower limit of 1024 may be too short for srun with large-scale jobs, and lead to problems processing task completion messages in a timely fashion. Rather than adjust that, unify the two separate macros into SLURM_DEFAULT_LISTEN_BACKLOG with the higer 4096 value. Bug 5164.
-
Tim Wickberg authored
Without Delegate=yes, systemd will "fix" the cgroup hierarchies whenever 'systemctl daemon-reload' is called, which will then remove any restrictions placed on memory or device access for a given job. This is a problem especially since 'systemctl daemon-reload' may be called automatically by rpm/yum or a variety of config file mangers, leading to jobs escaping from slurmd/slurmstepd's control. This setting should work for systemd versions >= 205. https://www.freedesktop.org/wiki/Software/systemd/ControlGroupInterface/ Bug 5292.
-
Morris Jette authored
-
Morris Jette authored
addresses problem reported by clang
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Morris Jette authored
bug introduced in commit a7d9313d
-
Felip Moll authored
When a job with time_end=0 and TRES null exists from an association that is currently inside a reservation, the hourly rollup segfaults. Bug 5143
-
Tim Wickberg authored
And from underlying slurm_msg_sendto_timeout call as well.
-
- 18 Jul, 2018 12 commits
-
-
Morris Jette authored
Add function to clear total_gres at start of scheduling cycle Modify logic to avoid overflow on gpu counter
-
Dominik Bartkiewicz authored
As reported by Avalon Johnson on slurm-users https://groups.google.com/forum/#!topic/slurm-users/BsMQ7Uk1PLw Bug 5287.
-
Alejandro Sanchez authored
bug 4373, comment #24
-
Felip Moll authored
Removed the sentence which incorrectly stated that when not using the gres flag enforce-binding option, cpus other than the ones defined in gres.conf could be used for a gpu. Bug 5189
-
Brian Christiansen authored
-
Brian Christiansen authored
srun was already fixed in b7053bda (Bug 3294). Bug 5126
-
Brian Christiansen authored
-
Felip Moll authored
bug 5189
-
Morris Jette authored
Add salloc/sbatch/srun option of --gres-flags=disable-binding to disable filtering of CPUs with respect to generic resource locality. This option is currently required to use more CPUs than are bound to a GRES (i.e. if a GPU is bound to the CPUs on one socket, but resources on more than one socket are required to run the job). This option may permit a job to be allocated resources sooner than otherwise possible, but may result in lower job performance. bug 5189
-
Tim Wickberg authored
-
Broderick Gardner authored
'have_innodb' is deprecated. Bug 5317.
-
Broderick Gardner authored
Cleanup printf formaters and ensure they match the types: %zu for size_t %zd for ssize_t Bug 5417.
-
- 17 Jul, 2018 15 commits
-
-
Felip Moll authored
When printing arrays in squeue and setting the SLURM_BITSTR_LEN variable to 0 or to NULL, the length of the output defaulted to 64, when the documentation says it will default to "unlimited". This patch fixes this situation. Bug 5440
-
Marshall Garey authored
Because of a bug in the some versions of the Linux kernel, disable constraining kernel memory space with cgroups by default. Bug 5223.
-
Tim Wickberg authored
-
Morris Jette authored
Coverity CID 186991
-
Marshall Garey authored
Logic was switched around in 17.11, enable_user_top is now the correct option. Bug 5165.
-
Tim Wickberg authored
-
Alejandro Sanchez authored
This is not working reliably even when setting SchedulerParameters=enable_hetero_steps and/or using OpenMPI with Slurm's mpi/pmi2, as it was previously documented. Bug 5309.
-
Tim Wickberg authored
-
Tim Wickberg authored
Bug 5426.
-
Jessica Nettelblad authored
Bug 5426.
-
Tim Wickberg authored
Bug 5426.
-
Jessica Nettelblad authored
Bug 5426.
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
Still needs some work here. Ideally the length of the script should be read out of the msg struct, rather than recomputed. That would also move us closer to allowing binary data in the script.
-