- 16 Aug, 2011 4 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
- 15 Aug, 2011 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
The squeue command was printing bad I/O nodes or c-nodes information for pending jobs. There was on midplane name, just the I/O or c-node specification.
-
Morris Jette authored
test8.22 Bluegene/Q only: Stress test of running many job step allocations within the job's allocation test8.23 Bluegene/Q only: Test that multple jobs allocations are properly packed within a midplane
-
- 13 Aug, 2011 3 commits
-
-
Morris Jette authored
These changes more thouroughly test Bluegene/Q job step placement algorithms and validate several recent bug fixes in the SLURM code.
-
Morris Jette authored
On Bluegene/Q systems, the job step allocation needs to be larger than requested in some cases due to the job allocation geometry (e.g. a 5 cnode allocation needs to be scaled up to at least 6 cnodes). This enhancement fixes that logic if multiple size increases are needed.
-
Danny Auble authored
-
- 12 Aug, 2011 9 commits
-
-
Danny Auble authored
next parallel step is ran on a sub block, SLURM won't over subscribe cnodes.
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
This reverts commit c5d63854 from 8/11/2011. The memory copy is not a leak, but is required to avoid memory corruption.
-
Morris Jette authored
make sure that a job has a step_list before creating an interator for it
-
Morris Jette authored
Improve logging messages and readability of some code
-
Morris Jette authored
This prevents bad node index values in a job step completion record from crashing slurmctld, as is possible if srun has bad configuration information about a job step or other failure.
-
- 11 Aug, 2011 8 commits
-
-
Morris Jette authored
Add a basic test of Bluegene/Q job step allocations within an existing job allocation.
-
Morris Jette authored
on a Bluegene/Q system when srun's --test-only option is used within an existing allocation then launch the job directly with the slurmd daemon and do not use IBM's "runjob" command. Useful for testing.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
BLUEGENE - Modify "scontrol show step" to show I/O nodes (BGL and BGP) or c-nodes (BGQ) allocated to each step. Change field name from "Nodes=" to "BP_List=".
-
- 10 Aug, 2011 6 commits
-
-
Morris Jette authored
The test is now more generic to support all Bluegene system types
-
Danny Auble authored
cannot fit into the available shape.
-
Morris Jette authored
Modify existing tests so they all run as desired on an emulated Bluegene/Q system
-
Morris Jette authored
Previous code would fail when trying to launch more than 4096 tasks, which is a problem on BGQ systems where SLURM actually launches job steps.
-
Morris Jette authored
The SLURM_JOB_CPUS_PER_NODE and SLURM_TASKS_PER_NODE environment variables were being improperly set for IBM Bluegene systems
-
Danny Auble authored
or not.
-
- 09 Aug, 2011 7 commits
-
-
Morris Jette authored
This change applies only to Cray systems and only when the srun wrapper for aprun. Map --exclusive to -F exclusive and --share to -F share. Note this does not consider the partition's Shared configuration, so it is an imperfect mapping of options.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-