- 11 Jan, 2018 2 commits
-
-
Morris Jette authored
node_feature/knl_cray - Fix memory leak that can occur during normal operation. This will happen when an update request for a specific node happens.
-
Morris Jette authored
If CnselectPath and/or SyscfgPath defined in knl_cray.conf file and slurmctld reconfigured, the original values of those paraemters would be over-written and their memory leaked.
-
- 05 Jan, 2018 1 commit
-
-
Felip Moll authored
Avoid setting node in COMPLETING state indefinitely if the job initiating the node reboot is cancelled while the reboot in in progress. Bug introduced in commit 7d246784 Bug 4536
-
- 03 Jan, 2018 3 commits
-
-
Danny Auble authored
increment ids for the tres_table. Bug 4553 Turns out auto_increment=# doesn't work in MySQL at the time of writing but does work correctly in MariaDB >=10.2.4+ See bug for more information.
-
Danny Auble authored
This reverts commit 12794fb0.
-
Danny Auble authored
increment ids for the tres_table. Bug 4553 Turns out auto_increment=# doesn't work in MySQL at the time of writing but does work correctly in MariaDB >=10.2.4+ See bug for more information.
-
- 28 Dec, 2017 1 commit
-
-
Tim Wickberg authored
-
- 15 Dec, 2017 1 commit
-
-
Morris Jette authored
This adds support for the --bb option "access_mode" in addition to "access" for better compatability with Cray's DataWarp options. Related to bug 4528
-
- 08 Dec, 2017 1 commit
-
-
Danny Auble authored
In 1.10+ they changed the hid_t from an int to a long int which messes things up as they use the top 32 bits for stuff right off the bat. This fixes the scenario by handing the number with a int32_t instead of an int. Bug 3795
-
- 05 Dec, 2017 1 commit
-
-
Alejandro Sanchez authored
Since NO_VAL = SLURM_BATCH_SCRIPT, the else statement would only compare the job_id and not the step_id, thus when a batch step was removed all the steps from that job would be removed too. Then when attempting to remove the extern step in the next iteration, it was already removed and we were incorrectly erroring out. Bug 4458.
-
- 30 Nov, 2017 1 commit
-
-
Alejandro Sanchez authored
Fix memory leak of MailDomain configuration string when slurmctld daemon is reconfigured. bug 4272 (comment 35)
-
- 29 Nov, 2017 1 commit
-
-
Brian Christiansen authored
slurm_load_job() prior to 17.11 returns the error code in errno and not in rc. With the addition of 47175901 if a job is removed from memory before sbatch checks for the job again, sbatch could get in an loop checking for the job. This only happens if you have a very small MinJobAge (<10) -- which is not recommended.
-
- 28 Nov, 2017 3 commits
-
-
Danny Auble authored
Bug 4323
-
Isaac Hartung authored
Bug 4306.
-
Danny Auble authored
to start. Bug 4434
-
- 10 Nov, 2017 1 commit
-
-
Isaac Hartung authored
This now matches the sinfo documentation. Bug 4306.
-
- 07 Nov, 2017 2 commits
-
-
Alejandro Sanchez authored
Issue could be triggered when updating a partition node(s) with node(s) that were already in the partition, incorrectly increasing the node_record->part_cnt (number of associated partitions) and thus incorrectly extending the array of pointers to partitions associated with this node, leading to an array with repeated associated partitions pointers. Bug 4318.
-
Brian Gilmer authored
On CLE 6.0 mungedir is /usr; a 'module unload' call then removes /usr/bin from PATH which is rather inconvenient. Bug 4334.
-
- 03 Nov, 2017 1 commit
-
-
Isaac Hartung authored
Memory TRES was getting the pn_min_memroy value when updating the job. But the TRES memory value is the total memory of the job (e.g pn_min_memory * cpus || pn_min_memory * nodes). Bug 4177
-
- 01 Nov, 2017 3 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Ryan Day authored
CVE-2017-15566. Bug 4228.
-
- 30 Oct, 2017 2 commits
-
-
Danny Auble authored
Starting in MariaDB 10.2 many of the api commands started setting errno erroneously. Backport of 5b934425
-
Danny Auble authored
This reverts commit 7e5d3d15. Turns out the spank_task_privileged needs to execute inside the child process instead of the slurmstepd. Bug 4298
-
- 28 Oct, 2017 1 commit
-
-
Morris Jette authored
If configured with NodeFeatures=knl_cray and there are non-KNL nodes which include no features the slurmctld will abort without this patch when attempting strtok_r(NULL). bug 4294
-
- 27 Oct, 2017 1 commit
-
-
Isaac Hartung authored
Bug 4279
-
- 25 Oct, 2017 2 commits
-
-
Danny Auble authored
Before it would allow all sorts of things like adding/subtracting/multiplying/etc. It would cause warnings such as /home/bart/slurm-tmp5/src/common/layouts_mgr.c: In function ‘_layouts_load_automerge’: /home/bart/slurm-tmp5/src/common/layouts_mgr.c:363:21: error: ‘*’ in boolean context, suggest ‘&&’ instead [-Werror=int-in-bool-context] *lvalue = *lvalue * *rvalue; \ ~~~~~~~~^~~~ /home/bart/slurm-tmp5/src/common/layouts_mgr.c:1034:4: note: in expansion of macro ‘_entity_update_kv_helper’ _entity_update_kv_helper(type_t, operator); \ ^~~~~~~~~~~~~~~~~~~~~~~~ /home/bart/slurm-tmp5/src/common/layouts_mgr.c:1086:4: note: in expansion of macro ‘_layouts_load_merge’ _layouts_load_merge(bool, s_p_get_boolean); [tag] [reply] [−] Private Comment 16 Bug 4062
-
Felip Moll authored
Setting -Werror tricks the test into failing on the "error: In the GNU C Library, "major" is defined by <sys/sysmacros.h>. For historical compatibility, it is currently defined by <sys/types.h> as well, but we plan to remove this soon. To use "major", include <sys/sysmacros.h> directly. If you did not intend to use a system-defined macro "major", you should undefine it after including <sys/types.h>." error. Since the normal Slurm build uses -Werror, this warning on including both headers will then cause the build itself to fail. Bug 3982.
-
- 24 Oct, 2017 6 commits
-
-
Marshall Garey authored
Bug 4246
-
Brian Christiansen authored
Bug 4246
-
Brian Christiansen authored
Bug 4246
-
Brian Christiansen authored
when a job spans a reservation that was modified (multiple records for the same reservation). Bug 4246
-
Brian Christiansen authored
Bug 4246
-
Alejandro Sanchez authored
Down waking nodes right after ResumeTimeout has been reached if they are not responding. Otherwise we have to wait for ping_nodes() to handle this work, thus SlurmdTimeout comes into play giving the sensation to the end user that nodes got stuck in ALLOCATED# and job in CF state until ping_nodes() decides to mark them DOWN and requeue the job. Bug 4182
-
- 19 Oct, 2017 5 commits
-
-
Morris Jette authored
Update to commit 859f6c82
-
Tim Wickberg authored
-
Dominik Bartkiewicz authored
E.g., gpu:tesla:2 would have parsed as quantity "telsa" of gpu. Change the index value to the end of the array (-1 index value). Bug 4250.
-
Dominik Bartkiewicz authored
Rather than end up with "%.-1s" printed out in the output as snprintf refused to parse the format specifier. Bug 4164.
-
Felip Moll authored
bugzilla #4238 - Added a hint for the window manager to all popups and windows in order to get max, min, close buttons in Gnome, XFCE, and others.
-
- 18 Oct, 2017 1 commit
-
-
Danny Auble authored
Bug 4244
-