- 27 Oct, 2016 4 commits
-
-
Danny Auble authored
issue with gang scheduling. Bug 3211
-
Tim Wickberg authored
-
Brian Christiansen authored
Federated submissions
-
Morris Jette authored
-
- 26 Oct, 2016 6 commits
-
-
Morris Jette authored
Fix bug that was clearing MAINT mode on nodes scheduled for reboot (bug introduced in version 16.05.5 to address bug in overlapping reservations, commit 5eee1d28). Note that a node's MAINT flag is used for both a requested reboot and maintenance reservation. What I'd like to do is add a new node state flag to differenciate between these two cases, but that involves some significant changes that could introduce instability, so it will be defered to version 17.02 bug 3210
-
Alejandro Sanchez authored
salloc are requested with -n tasks < hosts from -w hostlist or from -N.
-
Danny Auble authored
-
Danny Auble authored
requested with -n tasks < hosts from -w hostlist.
-
Morris Jette authored
bug 2149
-
Morris Jette authored
Add new SchedulerParameter (max_array_tasks) to limit the maximum number of tasks in a job array independently from the maximum task ID (MaxArraySize). bug 2676
-
- 25 Oct, 2016 9 commits
-
-
Dominik Bartkiewicz authored
Bug 3194
-
Morris Jette authored
Add SbcastParameters configuration option to control default file destination directory and compression algorithm. bug 2977
-
Morris Jette authored
Replace sjstat, seff and sjobexit RPM packages with a single "contribs" package.
-
Danny Auble authored
-
Morris Jette authored
Remove separate slurm_blcr package. If Slurm is build with BLCR support, the files will now be part of the main Slurm packages. bug 2061
-
Morris Jette authored
Document that node Weight takes precedence over load with LLN scheduling. bug 3204
-
Tim Wickberg authored
Follow on to commit c3266fca for 17.02+.
-
Tim Wickberg authored
task/cray's _get_numa_nodes() function needs to run before task/cgroup cleans up the cgroup hierarchies, otherwise ALPS memory compaction will never run. Also move task_p_add_pid() outside the #ifdef HAVE_NATIVE_CRAY block so that the plugin will load (albeit without any functionality) on non-Cray systems for testing purposes. Revise documentation and provided slurm.conf templates as well. Bug 3154.
-
Morris Jette authored
Do not include SLURM_JOB_DERIVED_EC, SLURM_JOB_EXIT_CODE, or SLURM_JOB_EXIT_CODE in PrologSlurmctld environment (not available yet). bug 1431
-
- 24 Oct, 2016 1 commit
-
-
Morris Jette authored
burst_buffer/cray: Accept new jobs on backup slurmctld daemon without access to dw_wlm_cli command. No burst buffer actions will take place. Newly submitted jobs will be accepted and stay in pending state. Jobs depedent upon stage-in or stage-out will remain in their current state until the action can take place.
-
- 21 Oct, 2016 1 commit
-
-
Morris Jette authored
Do not process SALLOC_HINT, SBATCH_HINT or SLURM_HINT environment variables if any of the following salloc, sbatch or srun command line options are specified: -B, --cpu_bind, --hint, --ntasks-per-core, or --threads-per-core.bug 3118
-
- 20 Oct, 2016 2 commits
-
-
Tim Wickberg authored
_select_nodes_parts() was resetting state_reason to an admin hold without regard to admin vs user hold state. state_reason is the only place that user vs. admin is distinguished, so this prevented users from releasing these jobs. Bug introduced by commit fb46c84b in 16.05.5. Bug 3197.
-
Danny Auble authored
This is an addition to commit cb7ed937
-
- 19 Oct, 2016 3 commits
-
-
Morris Jette authored
-
Ole H Nielsen authored
bug 3191
-
Ole H Nielsen authored
bug 3192
-
- 18 Oct, 2016 5 commits
-
-
Tim Wickberg authored
Continuation of commit 2fd4d7a6. MySQL dropped support for 'ignore', unconditionally remove that to avoid issues with mismatching client + server versions like in bug 3189.
-
Dominik Bartkiewicz authored
Improve reported estimates of start and end times for pending jobs. bug 3184
-
Morris Jette authored
Cray: Prevent abort in backfill scheduling logic for requeued job that has been cancelled while NHC is running. bug 3185
-
Joseph Mingrone authored
Old command line would not work with FreeBSD. New command line should work with all systems bug 3021
-
Tomasz Abramczyk authored
Add mechanism to constrain kernel memory allocation using cgroups. New cgroup.conf parameters added: ConstrainKmemSpace, MaxKmemPercent, and MinKmemSpace. bug 2917
-
- 17 Oct, 2016 1 commit
-
-
Danny Auble authored
new glibc 2.24+ that depricates readdir_r.
-
- 14 Oct, 2016 2 commits
-
-
Morris Jette authored
Add suppoprt for --mem_bind=prefer option to prefer, but not restrict memory use to the identified NUMA node. bug 3126
-
Morris Jette authored
Modify cpu_bind and mem_bind map and mask options to accept a repetition count to better support large task count. For example: "mask_mem:0x0f*2,0xf0*2" is equivalent to "mask_mem:0x0f,0x0f,0xf0,0xf0" bug 3065
-
- 13 Oct, 2016 4 commits
-
-
Morris Jette authored
Added node_features/knl_generic plugin for KNL support on non-Cray systems. NOTE: This plugin is still under development.
-
Morris Jette authored
Do not propagate SLURM_UMASK environment variable to batch script. bug 2609
-
Morris Jette authored
task/affinity plugin: Honor a job's --ntasks-per-socket and --ntasks-per-core options in task binding. bug 3118
-
Bjørn-Helge Mevik authored
Correct a bitmap test function (used only by the select/bluegene plugin). The effect of this bug is probably very limited as it will in almost all cases revert prematurely to a bit-by-bit test rather than using a full-word test. bug 3145
-
- 12 Oct, 2016 2 commits
-
-
Tim Wickberg authored
Cannot use ClusterName without reading a config file that may not exist. Bug 3026.
-
Tim Wickberg authored
This introduced an inadvertent dependency on the config file, which does not exist when setting up a new cluster. Bug 3026. This reverts commit c39f9ac9.
-