- 05 Dec, 2016 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
cray/burst_buffer - If slurmctld daemon restarts with pending job and burst buffer having unknown file stage-in status, teardown the buffer, defer the job, and start stage-in over again. bug 3295
-
Morris Jette authored
Add more detail to log message and change from error to debug2 with an explanation of how this happens
-
- 02 Dec, 2016 8 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
bug 3314
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
No change in logic, just like up columns in an array structure
-
Morris Jette authored
Add support for SALLOC_CONSTRAINT, SBATCH_CONSTRAINT and SLURM_CONSTRAINT environment variables to set default constraints for salloc, sbatch and srun commands respectively. Bug 3317
-
- 01 Dec, 2016 8 commits
-
-
Dominik Bartkiewicz authored
-
Dominik Bartkiewicz authored
limits after the node selection to make sure it doesn't violate those limits and if it does change the reason for waiting so we don't reserve resources on jobs violating accounting limits. Bug 3029
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
The definition for "ConstrainCores" in the cgroup.conf parsing logic appeared twice. Remove the second definition.
-
Nicolas Joly authored
Bug 3301.
-
Morris Jette authored
-
Morris Jette authored
node_features/knl_cray - Fix possible race condition when changing node state that could result in old KNL mode as an active features. bug 3235
-
- 30 Nov, 2016 7 commits
-
-
Morris Jette authored
-
Nicolas Joly authored
Fix void argument definitions as well. No functional changes. Bug 3286.
-
Morris Jette authored
-
Morris Jette authored
No change in logic
-
Morris Jette authored
cray/burst_buffer - Increase time to synchronize operations between threads from 5 to 60 seconds ("setup" operation time observed over 17 seconds). This should fix a race condition between a thread performing a buffer creation (setup) and a thread looking for unexpected buffers. If a buffer is found during the time window allowed for creation, it's space will be counted twice. First by the status checking thread and second by the thread doing the creation. The deallocation only happens once, so the used space information can be left with an invalid value. bug 3295
-
Tim Wickberg authored
Never used and is uninitialized making backtraces more confusing. Fix whitespace in bcast_parameters struct while here. No functional change.
-
Tim Wickberg authored
static variable means multiple active decompression streams will corrupt zlib's internal state, which can lead to a segfault. Bug 3299.
-
- 29 Nov, 2016 7 commits
-
-
Alejandro Sanchez authored
On a reconfig, the exc_node_bitmap is cleared but then it was not built again since last_work_scan was declared as a local static variable in _do_power_work(). The fix is to make it global within the plugin and reinitialize it to 0 on _init_power_config(). Bug 3078.
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Morris Jette authored
-
Morris Jette authored
For example: Nodes=nid00001 CPU_IDs=2-3 Mem=1000 GRES_IDX=gpu:alpha(IDX:2) Nodes=nid00002 CPU_IDs=0-1 Mem=1000 GRES_IDX=gpu:alpha(IDX:0)
-
- 28 Nov, 2016 7 commits
-
-
Alejandro Sanchez authored
-
Tim Wickberg authored
-
Morris Jette authored
-
Tim Wickberg authored
-
Tim Wickberg authored
Add new WHOLE_NODE_REQUIRED/WHOLE_NODE_USER/WHOLE_NODE_MCS macros to help cleanup tests rather than rely on magic values. Warning: these are similar to the JOB_SHARED_ macros, but the logic for zero vs one is different. USER/MCS are the same across these. No functional change.
-
Aline Roy authored
-
Aline Roy authored
-