- 09 Aug, 2017 10 commits
-
-
Morris Jette authored
No change in any logic
-
Morris Jette authored
Coverity CID 174180
-
Morris Jette authored
Coverity CID 174181
-
Morris Jette authored
Otherwise use of libpmi will necessitate users also load libslurm, which has never been the case in past. Bug introduced in commit f9687bbc
-
Morris Jette authored
In all of these cases, the input account name is NULL, so there should never be a failure. In every case, the returned association pointer is checked anyway. Coverity CID 44719, 44720, 44721
-
Morris Jette authored
Coverity, CID 45150
-
Artem Polyakov authored
bug 4061
-
Morris Jette authored
Coverity CID 163084, 163085
-
Morris Jette authored
Coverity CID 44762
-
Morris Jette authored
If SLURM_BITSTR_LEN is set to some unreasonably large value it could result in the slurmctld allocating a huge memory buffer and consuming vast amounts of time. Coverity CID 45264
-
- 08 Aug, 2017 10 commits
-
-
Tim Wickberg authored
CID 174162.
-
Morris Jette authored
-
Morris Jette authored
Coverity CID 45173
-
Isaac Hartung authored
If one or more clusters in a federation are down, then print an appropriate warning and exit rather than cause the test to fail. bug 4033
-
Tim Wickberg authored
Mistake made in f9687bbc; strlcpy returns a count of chars copied instead of a pointer to the string.
-
Tim Wickberg authored
Otherwise the log will be spammed with "Buffer size limit exceeded". Bug 3624.
-
Tim Wickberg authored
Gets rid of one weird strlcpy call.
-
Tim Wickberg authored
-
Tim Wickberg authored
Ensure proper termination in places that were otherwise missing it, and remove some awkward termination handling in other locations.
-
Tim Wickberg authored
-
- 07 Aug, 2017 9 commits
-
-
Tim Wickberg authored
-
Justin Lecher authored
Starting from glibc-2.25 the macros major and minor are only available from sys/sysmacros.h. This patch uses an autoconf macro to detect the location and includes the header accordingly. Bug 3982.
-
Artem Polyakov authored
Unlike MPI, UCX requires each sender to have a unique tag, otherwise messages will get mixed up. Signed-off-by: Artem Polyakov <artpol84@gmail.com>
-
Artem Polyakov authored
Fix direct connection table size to be equal to number of nodes int the job instead of job step. This was triggering assert when, for example, allocation was 16 nodes but srun was only using 2 of them and not the first ones. Signed-off-by: Artem Polyakov <artpol84@gmail.com>
-
Danny Auble authored
middle of the other id_*. I think it is ok to have the _id at the end for this since we want the offset near it.
-
Danny Auble authored
double ',' as we do with QOS in the TRES limits. So I removed the logic to check for it.
-
Danny Auble authored
-
Danny Auble authored
-
Dominik Bartkiewicz authored
Bug 4019
-
- 05 Aug, 2017 1 commit
-
-
Morris Jette authored
Change to slurm_mutex_init Coverity CID 171460
-
- 04 Aug, 2017 10 commits
-
-
Morris Jette authored
-
Artem Polyakov authored
-
Danny Auble authored
just unfreed memory, no real concern.
-
Artem Polyakov authored
-
Danny Auble authored
-
Danny Auble authored
-
Artem Polyakov authored
Replase "bool" with "int" as the return type of `pmixp_io_fd()`. This was causing an interesting hidden bug affecting the performance. Since this function was returning boolean value it was always returning "1" instead of actual fd number. fd=1 is set to /dev/null for slurmstepd, this is a char device that is always read/write ready from the poll() perspective. So poll was continuously interrupting and progress was fine, but CPU usage was ~100%.
-
Artem Polyakov authored
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
-
Artem Polyakov authored
Signed-off-by: Artem Polyakov <artpol84@gmail.com>
-
Artem Polyakov authored
(introduced in prev commit "Fix collective error path (timeout)" Signed-off-by: Artem Polyakov <artpol84@gmail.com>
-