- 07 Jun, 2018 1 commit
-
-
Alejandro Sanchez authored
Add new page, add to Makefile.am and run autogen.sh. Update additional documentation to point at this new page. Bug 3792.
-
- 06 Jun, 2018 1 commit
-
-
Brian Christiansen authored
which were marked down due to ResumeTimeout. If a cloud node was marked down due to not responding by ResumeTimeout, the code inadvertently added the node back to the avail_node_bitmap -- after being cleared by set_node_down_ptr(). The scheduler would then attempt to allocate the node again, which would cause a loop of hitting ResumeTimeout and allocating the downed node again. Bug 5264
-
- 05 Jun, 2018 2 commits
-
-
Tim Wickberg authored
Bug 5180.
-
Killian authored
Bug 5206.
-
- 04 Jun, 2018 1 commit
-
-
Morris Jette authored
I was seeing rare failures on the test due to timing issues. This increased timeout seems to fix the issue for me.
-
- 02 Jun, 2018 1 commit
-
-
Michael Hinton authored
NONE was not documented. Bug 5161
-
- 01 Jun, 2018 1 commit
-
-
Morris Jette authored
Avoid left-over test input file
-
- 31 May, 2018 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
No functional change. Bug 4999.
-
Alejandro Sanchez authored
There were two code paths building an allocation response by calling its own static _build_alloc_msg() function: 1. src/slurmctld/proc_req.c 2. src/slurmctld/srun_comm.c These two functions diverged and both had members that were not filled in but were filled in the other. This patch makes it so we change the signature of the one in proc_req.c to make it extern and then in srun_comm.c we call this newly common function. Also added cpu_freq_[min|max|gov] members in the common one since these were the only members missing in proc_req.c function (the one in srun_comm.c had more members missing, like all the ntasks_per*, account, qos or resv_name). Bug 4999.
-
- 30 May, 2018 24 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
Update slurm.spec and slurm.spec-legacy as well
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Marshall Garey authored
Only trust MUNGE signed values, unless the RPC was signed by SlurmUser or root. CVE-2018-10995.
-
Tim Wickberg authored
-
Tim Wickberg authored
Already vetted by slurmctld/slurmd, no need to re-check here.
-
Tim Wickberg authored
-
Tim Wickberg authored
Do not defer until later, and do not potentially miss out on proper validation of the user_name field which can lead to improper authentication handling. CVE-2018-10995.
-
Tim Wickberg authored
If the auth value (from MUNGE) does not match the requested value, ensure it is listed as a valid extended gid for that user instead.
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Dominik Bartkiewicz authored
Bug 5038.
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
Value of 2113 is where it fits in with 17.11, so pin it here.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Michael Hinton authored
-
Tim Wickberg authored
Caused by pthread_cancel cleanup by commit e5f03971 in 17.11.6. Bug 5181.
-
Tim Wickberg authored
The race condition was created in a7c8964e in 17.11.6 when removing the (unsafe) pthread_cancel code handling thread termination. Bug 5164
-
- 24 May, 2018 1 commit
-
-
Brian Christiansen authored
Commits f18390e8 and eed76f85 modified the stepd so that if the stepd encountered an unkillable step timeout that the stepd would just exit the stepd. If the stepd is a batch step then it would reply back to the controller with a non-zero exit code which will drain the node. But if an srun allocation/step were to get into the unkillable step code, the steps wouldn't let the waiting srun or controller know about the step going away -- leaving a hanging srun and job. This patch enables the stepd to notify the waiting sruns and the ctld of the stepd being done and drains the node for srun'ed alloction and/or steps. Bug 5164
-
- 21 May, 2018 1 commit
-
-
Dominik Bartkiewicz authored
g_qos_count, g_qos_max_priority, must be call under qos write lock. Bug 5159.
-
- 19 May, 2018 2 commits
-
-
Brian Christiansen authored
Display correct path.
-
Bjørn-Helge Mevik authored
Bug 5151
-
- 18 May, 2018 1 commit
-
-
Brian Christiansen authored
Commits 4454316e and 76706b51 adjusted the updating of priority logic so that when a non-authorized user modifies the priority it will only be temporary -- in most cases the user will never see that change. Bug 5151
-