- 09 Aug, 2011 5 commits
-
-
Morris Jette authored
On Cray systems only, the value of avail_node_bitmap was not being properly set for non-responsive nodes.
-
Morris Jette authored
A node DOWN to ALPS will be marked DOWN to SLURM only after reaching SlurmdTimeout. In the interim, the node state will be NO_RESPOND. This change makes behavior makes SLURM handling of the node DOWN state more consistent with ALPS. This change effects only Cray systems.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Fix the node state accounting to be consistent with the node state set by ALPS.
-
- 08 Aug, 2011 2 commits
-
-
Morris Jette authored
Split set_node_down() into two functions: set_node_down() will continue to accept a node name as an argument and set_node_down_ptr() which is new and accepts a node pointer as an argument and will be faster.
-
Morris Jette authored
Test4.5 was failing due to failure to parse node count with "K" suffix and change in case of node state name.
-
- 06 Aug, 2011 2 commits
-
-
Danny Auble authored
state of block to Free if need be instead of leaving it in Term
-
Morris Jette authored
Modify salloc, sbatch and srun man pages to clarify how max node count is used.
-
- 05 Aug, 2011 4 commits
-
-
Danny Auble authored
be the same.
-
Danny Auble authored
previously marked down by alps.
-
Danny Auble authored
previously marked down by alps.
-
Danny Auble authored
set.
-
- 04 Aug, 2011 7 commits
-
-
Morris Jette authored
Require SchedulerTimeSlice configuration parameter to be at least 5 seconds to avoid thrashing slurmd daemon. Addresses Cray bug 774692
-
Morris Jette authored
Change in GRES behavior for job steps: A job step's default generic resource allocation will be set to that of the job. If a job step's --gres value is set to "none" then none of the generic resources which have been allocated to the job will be allocated to the job step. Add srun environment value of SLURM_STEP_GRES to set default --gres value for a job step.
-
Morris Jette authored
Major update to src/common/node_select.h to document the API calls. Minor updates to node select plugin web page.
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
different structures one in the bluegene plugin and one in common.
-
- 03 Aug, 2011 14 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
add SLURM_GRES environment variable to srun
-
Morris Jette authored
The information describing the select plugin APIs did not reflect modifications made in SLURM version 2.3. Documentation for many functions was added and other function descriptions were modified.
-
Morris Jette authored
-
Morris Jette authored
On Bluegene systems, smap's command-line mode would generate an invalid memory reference due to an uninitialized variable.
-
Morris Jette authored
In test8.8, set a time limit and explicitly spawn bash in order to insure we don't just use SallocDefault
-
Morris Jette authored
Modify test1.45 to work on BlueGene systems. Minimum job allocation size is set equal to the smallest block size.
-
Morris Jette authored
Remove the following select plugin functions: select_p_ba_reset(), select_p_ba_request_apply() and select_p_ba_remove_block(). They are no longer used.
-
Danny Auble authored
will try to register when we are running from the slurmctld.
-
Danny Auble authored
-
Danny Auble authored
a POLLERR the dbd_fail callback is called.
-
- 02 Aug, 2011 6 commits
-
-
Danny Auble authored
-
Danny Auble authored
the DBD where both remained up but were disconnected the slurmctld would get registered again with the DBD.
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
Modify the error message if srun's task count exceeds the node count. Log using info() rather than error() and explain how it it handled.
-
Morris Jette authored
This adds another argument to the test using SLURM's internal API assoc_mgr_init(), modified in commit 1b838865 on July 29, 2011
-