- 25 Oct, 2016 7 commits
-
-
Morris Jette authored
Replace sjstat, seff and sjobexit RPM packages with a single "contribs" package.
-
Danny Auble authored
-
Morris Jette authored
Remove separate slurm_blcr package. If Slurm is build with BLCR support, the files will now be part of the main Slurm packages. bug 2061
-
Morris Jette authored
Document that node Weight takes precedence over load with LLN scheduling. bug 3204
-
Tim Wickberg authored
Follow on to commit c3266fca for 17.02+.
-
Tim Wickberg authored
task/cray's _get_numa_nodes() function needs to run before task/cgroup cleans up the cgroup hierarchies, otherwise ALPS memory compaction will never run. Also move task_p_add_pid() outside the #ifdef HAVE_NATIVE_CRAY block so that the plugin will load (albeit without any functionality) on non-Cray systems for testing purposes. Revise documentation and provided slurm.conf templates as well. Bug 3154.
-
Morris Jette authored
Do not include SLURM_JOB_DERIVED_EC, SLURM_JOB_EXIT_CODE, or SLURM_JOB_EXIT_CODE in PrologSlurmctld environment (not available yet). bug 1431
-
- 24 Oct, 2016 1 commit
-
-
Morris Jette authored
burst_buffer/cray: Accept new jobs on backup slurmctld daemon without access to dw_wlm_cli command. No burst buffer actions will take place. Newly submitted jobs will be accepted and stay in pending state. Jobs depedent upon stage-in or stage-out will remain in their current state until the action can take place.
-
- 21 Oct, 2016 1 commit
-
-
Morris Jette authored
Do not process SALLOC_HINT, SBATCH_HINT or SLURM_HINT environment variables if any of the following salloc, sbatch or srun command line options are specified: -B, --cpu_bind, --hint, --ntasks-per-core, or --threads-per-core.bug 3118
-
- 20 Oct, 2016 2 commits
-
-
Tim Wickberg authored
_select_nodes_parts() was resetting state_reason to an admin hold without regard to admin vs user hold state. state_reason is the only place that user vs. admin is distinguished, so this prevented users from releasing these jobs. Bug introduced by commit fb46c84b in 16.05.5. Bug 3197.
-
Danny Auble authored
This is an addition to commit cb7ed937
-
- 19 Oct, 2016 3 commits
-
-
Morris Jette authored
-
Ole H Nielsen authored
bug 3191
-
Ole H Nielsen authored
bug 3192
-
- 18 Oct, 2016 5 commits
-
-
Tim Wickberg authored
Continuation of commit 2fd4d7a6. MySQL dropped support for 'ignore', unconditionally remove that to avoid issues with mismatching client + server versions like in bug 3189.
-
Dominik Bartkiewicz authored
Improve reported estimates of start and end times for pending jobs. bug 3184
-
Morris Jette authored
Cray: Prevent abort in backfill scheduling logic for requeued job that has been cancelled while NHC is running. bug 3185
-
Joseph Mingrone authored
Old command line would not work with FreeBSD. New command line should work with all systems bug 3021
-
Tomasz Abramczyk authored
Add mechanism to constrain kernel memory allocation using cgroups. New cgroup.conf parameters added: ConstrainKmemSpace, MaxKmemPercent, and MinKmemSpace. bug 2917
-
- 17 Oct, 2016 1 commit
-
-
Danny Auble authored
new glibc 2.24+ that depricates readdir_r.
-
- 14 Oct, 2016 2 commits
-
-
Morris Jette authored
Add suppoprt for --mem_bind=prefer option to prefer, but not restrict memory use to the identified NUMA node. bug 3126
-
Morris Jette authored
Modify cpu_bind and mem_bind map and mask options to accept a repetition count to better support large task count. For example: "mask_mem:0x0f*2,0xf0*2" is equivalent to "mask_mem:0x0f,0x0f,0xf0,0xf0" bug 3065
-
- 13 Oct, 2016 4 commits
-
-
Morris Jette authored
Added node_features/knl_generic plugin for KNL support on non-Cray systems. NOTE: This plugin is still under development.
-
Morris Jette authored
Do not propagate SLURM_UMASK environment variable to batch script. bug 2609
-
Morris Jette authored
task/affinity plugin: Honor a job's --ntasks-per-socket and --ntasks-per-core options in task binding. bug 3118
-
Bjørn-Helge Mevik authored
Correct a bitmap test function (used only by the select/bluegene plugin). The effect of this bug is probably very limited as it will in almost all cases revert prematurely to a bit-by-bit test rather than using a full-word test. bug 3145
-
- 12 Oct, 2016 6 commits
-
-
Tim Wickberg authored
Cannot use ClusterName without reading a config file that may not exist. Bug 3026.
-
Tim Wickberg authored
This introduced an inadvertent dependency on the config file, which does not exist when setting up a new cluster. Bug 3026. This reverts commit c39f9ac9.
-
Morris Jette authored
task/affinity plugin: Honor a job's --ntasks-per-socket and --ntasks-per-core options in task binding. bug 3118
-
Morris Jette authored
Preserve non-KNL node features when updating the KNL node features for a multi-node job in which the non-KNL node features vary by node.
-
Morris Jette authored
node_features/knl_cray plugin: If the reconfiguration of nodes for an interactive job fails, kill the job (it can't be requeued like a batch job).
-
Morris Jette authored
node_features/knl_cray plugin: Add separate thread to interact with capmc in response to unexpected node reboots. bug 3153
-
- 11 Oct, 2016 6 commits
-
-
Alejandro Sanchez authored
bug 3091
-
Morris Jette authored
Prevent possible divide by zero in select/cons_res if a node's board count is higher than it's socket count. bug 3155
-
Morris Jette authored
If a node's socket or core count are changed at registration time (e.g. a KNL node's NUMA mode is changed), change it's board count to match. bug 3155
-
Morris Jette authored
Cray: The slurmd can manipulate the socket/core/thread values reported based upon the configuration. The logic failed to consider select/cray with SelectTypeParameters=other_cons_res as equivalent to select/cons_res. bug 3155
-
Tim Wickberg authored
abs() should not be used on long long variables as it would truncate if strictly confirming to C99. Use llabs() instead. Fix to commit 2aefc66b.
-
Tim Wickberg authored
-
- 07 Oct, 2016 2 commits
-
-
Morris Jette authored
Previous logic would result in sub-optimal job allocations and under some conditions invalid memory references resulting in the slurmctld daemon crashing or aborting. bug 2732
-
Morris Jette authored
-