- 30 Jun, 2016 4 commits
-
-
Alejandro Sanchez authored
In bug #1499, customer asked to precisely identify which srun options apply to job allocations, which to step allocations and which apply to both. So the srun man page has been updated with this information.
-
Morris Jette authored
Move test for job wait reason value of BurstBufferResources and BurstBufferStageIn later in the scheduling logic. This will result in job reasons related to QOS, account, and partition limits to be displayed instead of burst buffer (if applicable) bug 2835
-
Danny Auble authored
id wasn't able to be made.
-
Morris Jette authored
-
- 29 Jun, 2016 4 commits
-
-
Tim Wickberg authored
-
Danny Auble authored
-
Danny Auble authored
-
Tim Wickberg authored
-
- 24 Jun, 2016 3 commits
-
-
Morris Jette authored
node_features/knl_cray plugin modified to get current node MCDRAM mode using cnselect command rather than capmc command. This gets the current BIOS setting, which could differ from what is configured to be set on boot, at least if something other than Slurm is modifying KNL modes. bug 2850
-
Morris Jette authored
node_features/knl_cray plugin modified to get current node NUMA mode using cnselect command rather than capmc command. This gets the current BIOS setting, which could differ from what is configured to be set on boot, at least if something other than Slurm is modifying KNL modes. bug 2850
-
Morris Jette authored
-
- 23 Jun, 2016 4 commits
-
-
Tim Wickberg authored
-
Morris Jette authored
-
Morris Jette authored
bug 2744
-
Tim Wickberg authored
- Previously long paths/filenames were truncated at 256 characters. Use PATH_MAX instead. - Function previously truncated the executable name at the first space character during own string copy function. Replace with strlcpy. - Refactor to remove extraneous temp variables, use strlcpy instead of snprintf, getenvp instead of local env parser.
-
- 22 Jun, 2016 2 commits
-
-
Tim Wickberg authored
-
Morris Jette authored
Cray/capmc_resume script modify a node's features (as needed) when the reinit (reboot) command is issued rather than wait for the nodes to change to the "on" state. This will insure the correct node features in the event the capmc_resume script times out or otherwise fails before the nodes come back up. bug 2832
-
- 20 Jun, 2016 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
bug 2582
-
- 19 Jun, 2016 1 commit
-
-
Morris Jette authored
The slurm_get_auth_info() function returns an xmalloc'ed string which few places were xfree'ing. The function in the past did not xmalloc a value, but returned a pointer to a cached string. When that was changed, the calls to the function were not all changed, resulting in leaks if AuthInfo was defined in slurm.conf.
-
- 15 Jun, 2016 2 commits
-
-
Tim Wickberg authored
-
Danny Auble authored
it.
-
- 14 Jun, 2016 2 commits
-
-
Tim Wickberg authored
This reverts commit 7843e158. Conflicts: NEWS
-
Morris Jette authored
Recalculate a job's memory allocation after node reboot if job requests all of a node's memory and FastSchedule=0 is configurated. Intel KNL memory size can change on reboot with various MCDRAM modes. bug 2822
-
- 13 Jun, 2016 1 commit
-
-
Morris Jette authored
Prior logic was getting confused with NUMA containing no cores. bug 2745
-
- 10 Jun, 2016 8 commits
-
-
Danny Auble authored
doesn't have a parent.
-
Danny Auble authored
-
Jason Bacon authored
-
Morris Jette authored
This adds support for burst buffer re-issue of pre-load operation when slurmctld restarts with job in configuring state. Also copy NEWS item to v16.05.
-
Morris Jette authored
Prevent job stuck in configuring state if slurmctld daemon restarted while PrologSlurmctld is running. bugs 2789 and 2810
-
Danny Auble authored
can work on a BGAS node. Bug 2806
-
Danny Auble authored
of continuously printing the message over and over as the problem will most likely not resolve itself. Bug 2797
-
Morris Jette authored
Prevent job stuck in configuring state if slurmctld daemon restarted while PrologSlurmctld is running. bugs 2789 and 2810
-
- 09 Jun, 2016 3 commits
-
-
Danny Auble authored
can work on a BGAS node. Bug 2806
-
Danny Auble authored
of continuously printing the message over and over as the problem will most likely not resolve itself. Bug 2797
-
Danny Auble authored
account or wckey.
-
- 08 Jun, 2016 1 commit
-
-
Danny Auble authored
-
- 07 Jun, 2016 3 commits
-
-
Andy Riebs authored
-
Morris Jette authored
Fix for tracking job resource allocation when slurmctld is reconfigured while Cray Node Health Check (NHC) is running. Previous logic would fail to record the job's allocation then perform release operation upon NHC completeion, resulting in underflow error messages. bug 2353
-
Dominik Bartkiewicz authored
While here, mark options const, and add leading underscore to denote this as a static function (only called within hostlist.c). Also change strcmp to xstrcmp. Commit a6ffef22 changed this function and would alter the input hn, which led to subsequent calls to the function having wrong prefix lengths for that hostrange precluding it from matching correctly. Bug 2558.
-