- 29 Dec, 2016 4 commits
-
-
Morris Jette authored
Previous logic was failing if no specialized cores configured on node and job wanted to configure some bug 3349
-
Dominik Bartkiewicz authored
Null terminate before strchr().
-
Morris Jette authored
-
Morris Jette authored
This is a new message when "PrologFlags=contain" or "PrologFlags=alloc" is configured. bug 3351
-
- 28 Dec, 2016 3 commits
-
-
Alejandro Sanchez authored
Cancel interactive job if Prolog failure with "PrologFlags=contain" configured. bug 3351
-
Morris Jette authored
Minor fix to commit 1e6b7522 in the case where user specification of specialized cores is disabled. bug 3349
-
Matthieu Hautreux authored
If a job specifies --core-spec and the node has identified specialized cores using the CpuSpecList option which are not the default cores (e.g. the first cores rather than the last cores on each socket), the the user's cores will be selected from those identified in the configuration. bug 3349
-
- 27 Dec, 2016 1 commit
-
-
Morris Jette authored
Was printing int64 using %ld, which generated a warning on some systems
-
- 21 Dec, 2016 1 commit
-
-
Morris Jette authored
Do not allocate specialized cores to jobs using the --exclusive option. bug 3349
-
- 19 Dec, 2016 2 commits
-
-
Morris Jette authored
This commit should be part of b5f621d0 when the man page was removed
-
Morris Jette authored
Fix memory and file descriptor leaks in slurmd daemon's sbcast logic.
-
- 16 Dec, 2016 4 commits
-
-
Danny Auble authored
The part_ptr is sent into the function, there is no reason to look it up again. Coverity reported this.
-
Tim Wickberg authored
TravisCI uses 2-cpu nodes, so this causes the test to fail as a suitable node cannot be found.
-
Tim Wickberg authored
-
Tim Wickberg authored
Plugins are deprecated and will be removed in 17.02.
-
- 15 Dec, 2016 5 commits
-
-
Danny Auble authored
version is lower than the min version, set it to the min. Bug 3050
-
Alejandro Sanchez authored
-
Morris Jette authored
Address new coverity error report
-
Morris Jette authored
sched/backfill - Fix logic to reserve resources for jobs that require a node reboot (i.e. to change KNL mode) in order to start. bug 3346
-
Danny Auble authored
go into JobAdminHeld. Bug 3201
-
- 14 Dec, 2016 3 commits
-
-
Morris Jette authored
Fix for possible infinite loop in select/cons_res plugin when trying to satisfy a job's ntasks_per_core or socket specification. bug 3329
-
Tim Wickberg authored
Bug 2992.
-
Morris Jette authored
Modify regression test1.89 to avoid leaving vestigial job. Also reduce logging to reduce likelyhood of Expect buffer overflow. bug 3273
-
- 13 Dec, 2016 3 commits
-
-
Morris Jette authored
-
Tim Wickberg authored
Reverts most of commit 84023f27. Searching the PATH in slurmd can fail due to root_squash'd NFS filesystems, leading to the "wrong" program being launched. If you'd like the performance benefit from avoiding this lookup during each separate task launch, set SLURM_TEST_EXEC=1 instead which will perform the lookup once within srun, which then ensures the lookup happens under the users own environment and not that of the slurmd. Bug 2992.
-
Tim Wickberg authored
-
- 12 Dec, 2016 3 commits
-
-
Alejandro Sanchez authored
-
Alejandro Sanchez authored
-
Morris Jette authored
If running burst_buffer/cray plugin on emulated Cray (NOT a real Cray system with Cray APIs) and the burst buffer state information includes information about buffers associated with jobs which are either completed or no longer exist (i.e. the burst buffer state information is old), then ignore that old state information. This might possibly be related to bug 3295 if their configuration is bad.
-
- 09 Dec, 2016 1 commit
-
-
Danny Auble authored
level.
-
- 08 Dec, 2016 9 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
Revise commit e6ef1f0c The root cause of the failure is a bug in HWLOC that has since been fixed in HWLOC version 1.11.5.
-
Tim Wickberg authored
If the second call to getgrouplist() found additional groups, ngroups will be overwritten with this new larger value, while the gids list would be truncated. (ngroups is a value-result arg.) This will then lead to _gids_cache_lookup() returning the wrong number of groups including invalid parts of memory, which are likely to include some zeros. Those zeros could then make it to the setgroups() call and thus give the user access to the root group. Especially as setgroups will succeed as long as the array does not contain -1 as a gid. Bug 3320.
-
Tim Wickberg authored
-
Danny Auble authored
-
Morris Jette authored
task/cgroup - Change error message if CPU binding can not take place to better identify the root cause of the problem. Specifically, if the hwloc_get_obj_below_by_type() function call completely fails that is likely due to task/affinity not being configured, so cpusets are not configured. Previous message was "task/cgroup: task[%u] infinite loop broken while trying to provision compute elements using %s (bitmap:%s)" The new message is "task/cgroup: hwloc_get_obj_below_by_type() failing, task/affinity plugin also required"
-
Dominik Bartkiewicz authored
uint32_t needs %u on 32-bit platforms. Noticed by clang/travisci.
-
- 07 Dec, 2016 1 commit
-
-
Danny Auble authored
Bug 3258
-