- 15 Dec, 2017 6 commits
-
-
Yair Yarom authored
bug 3582
-
Brian Christiansen authored
when a job requests no tasks and more memory than MaxMemPer{CPU|NODE}. e.g. sbatch --wrap="sleep 10" Bug 4515
-
Brian Christiansen authored
This will give expected results. Found while working on Bug 4515.
-
Danny Auble authored
Bug 4478 comment 25.
-
Danny Auble authored
And print an appropriate fatal error message rather than relying upon random errno value. Bug 4523
-
Alejandro Sanchez authored
Bug 4478.
-
- 13 Dec, 2017 2 commits
-
-
Tim Wickberg authored
-
Marshall Garey authored
Based off of Ryan Cox's original contribs/pam_slurm_adopt/README. Bug 3567.
-
- 12 Dec, 2017 5 commits
-
-
Brian Christiansen authored
-
Brian Christiansen authored
In the federation case, the origin job is completed in the database when a sibling job starts the job. The complete message is then sent again to the database when the job is completed on the sibling cluster but it is updated with the sibling job's exit code. The jobcomp plugin didn't handle the multiple updates to the record. This change allows the existing record to be updated. Bug 4493
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- 11 Dec, 2017 8 commits
-
-
Morris Jette authored
-
David Gloe authored
Bug 4500 The pid files in slurm.conf and the systemd service files must match, or systemd will time out looking for the wrong pid file. Currently, the Cray slurm.conf template has different pid files for slurmctld and slurmd than the service files. There's no reason for us to use these nonstandard pid files, and it will save us some headaches to switch over.
-
Danny Auble authored
-
Morris Jette authored
Continuation of commit 4c1c1e40 Bug 4169
-
Morris Jette authored
-
Marcin Stolarek authored
bug 4496
-
Morris Jette authored
bug 4407
-
Morris Jette authored
-
- 09 Dec, 2017 3 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
Remove errant '2' from the macro name.
-
- 08 Dec, 2017 9 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
In 1.10+ they changed the hid_t from an int to a long int which messes things up as they use the top 32 bits for stuff right off the bat. This fixes the scenario by handing the number with a int32_t instead of an int. Bug 3795
-
Morris Jette authored
-
Morris Jette authored
Add sbatch "--batch" option to identify features required on batch node. For example "sbatch --batch=haswell ...". Add "BatchFeatures" field to output of "scontrol show job". Export job's batch_features field to job_submit/lua plugin.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Fix potential node reboot timeout problem for "scontrol reboot" command. bug 4203
-
Morris Jette authored
Coverity CID 180614
-
- 07 Dec, 2017 6 commits
-
-
Tim Wickberg authored
The - character is treated as a range if not first or last in the [] brackets. Moving it in between . and / broke the regex subtly. Inadvertently broken by a268b644. Bug 4417.
-
Danny Auble authored
Bug 4169
-
Morris Jette authored
Add node and partition configuration options of "CpuBind" to control default task binding. Modify the scontrol to report and modify these parameters. Add "NumaCpuBind" option to knl.conf file to automatically change a node's CpuBind parameter based upon changes to a node's NUMA mode.
-
Morris Jette authored
-
Morris Jette authored
Found using test38.17
-
Felip Moll authored
Otherwise poll() cannot monitor these ports properly, leading to potential network traffic problems. Bug 4467.
-
- 06 Dec, 2017 1 commit
-
-
Danny Auble authored
until the prolog and extern step are fully ran/launched. Only matters if running with PrologFlags=[contain|alloc]. patch 2 of 2 Bug 4458
-