- 21 Sep, 2017 8 commits
-
-
Tim Wickberg authored
Removed by mistake in d8d41cff.
-
Morris Jette authored
Problem was do to "go to proceed;" before setting "nodename" or "lpeerid".
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Send pack job's task IDs, node count, tasks_per_node and node_list from srun to slurmd as needed by MPI. This information is constructed by srun combining information from each individual job component's step information. Set global SLURM_NODELIST, SLURM_JOB_NODELIST, SLURM_STEP_NUM_NODES, SLURM_TASKS_PER_NODE, SLURM_NNODES, SLURM_NTAKS and SLURM_STEP_NUM_TASKS for pack step. Modify pmi2 plugin to launch a single agent for pack step rather than one per component. Modify pmi2 and pmix plugins to gather global step information for pack steps. Make task distribtion be configurable by step component (i.e. "block" on one and "cyclic" on another).
-
Morris Jette authored
Required for pack step support
-
Morris Jette authored
-
- 20 Sep, 2017 9 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Tim Wickberg authored
Enforce this assumption at configure time; if this is not true there are some serious security risks that would result.
-
Tim Wickberg authored
-
Tim Wickberg authored
Previous commit removed this flag as part of Cygwin/Solaris cleanup. Which then removes them from Makefile.in files after autogen.sh.
-
Tim Wickberg authored
Cygwin removal previously done in ada78c4e, but the configure script was defined. Solaris is a dead platform; commits to remove last traces of it will follow.
-
Tim Wickberg authored
- gid instead of uid used in group_cache_lookup_job(). - needle->username is optional, and will lead to a failure in getgrouplist() when NULL is passed in. Hat tip to Brian for discovering.
-
- 19 Sep, 2017 9 commits
-
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Brian Christiansen authored
in salloc, sbatch, srun before process the command line options. Bug 3499
-
Brian Christiansen authored
-
Danny Auble authored
plugin when constraining devices.
-
Danny Auble authored
-
Danny Auble authored
from the string.
-
Danny Auble authored
correctly in sacct.
-
Danny Auble authored
-
- 18 Sep, 2017 3 commits
-
-
Tim Wickberg authored
CID 177191.
-
Tim Wickberg authored
-
Tim Wickberg authored
-
- 17 Sep, 2017 11 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
Modify _open_as_other so that both _file_bcast_register_file and _prolog_error can both use it. (That code was originally extracted from _file_bcast_register_file, but tailored to _prolog_error's requirements. Less duplication is good.)
-
Tim Wickberg authored
In slurmctld: - Handle the lookup, and send results across. - Periodically flush the cache. In slurmd: - Load value sent as part of prolog into cache - Have task launch use the new call instead to fetch the jobid-cached values when available - Remove old job-specific records from the cache as part of the prolog. - Ignore the slowly increasing memory footprint from the cache, as this is what the old implementation did anyways. It'll be O(users in cluster) at worst, and flushes on a reconfigure. - Simplify the slurmstepd communication path. group_cache_lookup will always succeed. Left to do: - Convert other lookup locations in slurmd to the new approach, and remove the compatibility shim. Bug 3322.
-
Tim Wickberg authored
Loosely based on src/slurmd/req.c. But with major alterations in preparation for use with PrologFlags=SendGIDs. Convert slurmd to use a compatibility shim in here for now.
-
Tim Wickberg authored
Bug 3322.
-
Tim Wickberg authored
The slurmd is always sending the ngid / gids over the pipe, so the setgroups() call is all that matters here. Move it out to _drop_priviledges instead.
-
Tim Wickberg authored
Bug 3322.
-
Tim Wickberg authored
Patch was broken by a rebase before pushing. Fixing as a clean patch. This reverts commit b0838485.
-
Tim Wickberg authored
-
Tim Wickberg authored
Regression caught my slight oversight from 88d5a801.
-