- 13 Nov, 2016 1 commit
-
-
Danny Auble authored
-
- 12 Nov, 2016 1 commit
-
-
Danny Auble authored
-
- 11 Nov, 2016 16 commits
-
-
Morris Jette authored
Move where we set the configuration table bitmaps in order to support the backup slurmctld starting and recovering previously saved KNL mode information (which can necessitate rebuilding the node configuration table). bug 3241
-
Danny Auble authored
-
Tim Wickberg authored
Bug 3255.
-
Tim Wickberg authored
-
Morris Jette authored
-
Alejandro Sanchez authored
No functional change. Bug 3237.
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
Page needs to display properly even if the CDN gets rid of this, or for machines with no internet connectivity. Fix a few minor style issues, and remove some unused javascript.
-
Tim Wickberg authored
Change header and footer over to new design, switch around the css files, and adjust the build system to match. Design by Grant Zabriskie.
-
Tim Wickberg authored
Stop bundling the file in docs (will be preserved on SchedMD site, and is in git), and unlink from build.
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Morris Jette authored
Request will be rejected without type specification
-
David Gloe authored
Bug 3253.
-
- 10 Nov, 2016 3 commits
-
-
Tim Wickberg authored
If the input value mod 512 == 0, the value would be subject to unintended rounding. Rework the function to check against this on each unit promotion. Bug 3252.
-
Morris Jette authored
It was causing the loss of node available_features on startup with node_features/knl_cray bug 3241
-
Morris Jette authored
-
- 09 Nov, 2016 3 commits
-
-
Tim Wickberg authored
-
Morris Jette authored
Set per-node HBM availability as a GRES based upon the KNL node's MCDRAM state bug 3171
-
Alejandro Sanchez authored
Caused by race for local_energy which is dynamically allocated. Bail out of the update if that hasn't been allocated yet. Bug 3237.
-
- 08 Nov, 2016 5 commits
-
-
Morris Jette authored
bug 3213
-
Morris Jette authored
select/linear plugin modified to better support heterogeneous clusters when topology/none is also configured. Note that use of the select/cons_res plugin is strongly recommended for heterogeneous clusters. The use of OverSubscribe=exclusive can be used if whole node allocations is desired. bug 3212
-
Alejandro Sanchez authored
Bug 3224.
-
Morris Jette authored
-
Morris Jette authored
If a job is started by the main scheduling logic and requeued while the backfill scheduler has locks released, that can result in an invalid data structure in select/cons_res. Namely, the backfill scheduler's attempt to start the job would clear the job resources node_bitmap. That leaves a NULL pointer in the select/cons_res plugin generating an abort. (That pointer is needed to clean up the job allocation records when the Epilog or Cray Node Health Check, NHC, are complete and the resources become available for another job. bug 3230
-
- 07 Nov, 2016 1 commit
-
-
Morris Jette authored
Backup slurmctld will now 1. Not abort due to NULL pointer (needed to move code around on restart) 2. Recover KNL MCDRAM and NUMA modes from state save files if capmc and cnselect not available bug 3241
-
- 05 Nov, 2016 1 commit
-
-
Morris Jette authored
cray/burst_buffer - Update "instance" parsing to match updated dw_wlm_cli output. bug 3222
-
- 04 Nov, 2016 7 commits
-
-
Morris Jette authored
Expand the dw_wlm_cli script to include persistent and job-specific burst buffers. This script is used by burst_buffer/cray.
-
Morris Jette authored
This is a new field and the fix only applies to an emulated burst buffer configuration (i.e. dw_wlm_cli script made to look like a real DataWarp system)
-
Morris Jette authored
Change error() to verbose(). New logic is needed to address this issue once we know how to determine the KNL MCDRAM size.
-
Morris Jette authored
-
Morris Jette authored
cray/burst_buffer - Preserve job ID and don't translate to job array ID after slurmctld restart. Prior logic would not set array_task_id to NO_VAL, so all job-buffer IDs would be reported in the form "JobID=0_0(123)" rather than "JobID=123"
-
Morris Jette authored
cray/busrt_buffer - Internally track both allocated and unusable space. The reported UsedSpace in a pool is now the allocated space (previously was unusable space). Base available space on whichever value leaves least free space. bug 3222
-
Tim Wickberg authored
Previously disconnected from build system, and most code removed by commit 0b14a3a7 back on 15.08-pre1.
-
- 03 Nov, 2016 2 commits
-
-
Tim Wickberg authored
We don't build on Tru-64, and there are a lot more platform-dependent pieces of code within Slurm than we've indicated here.
-
Tim Wickberg authored
-