Commits · a4bea076a17a569d08f6a083dcdc8aa000ef16dc · Manuel G. Marciani / ces_slurm_simulator

10 Apr, 2012 1 commit
- Add internal links to MPI guide sub-sections · a4bea076
  jette authored Apr 09, 2012
  
  a4bea076
09 Apr, 2012 2 commits
- BGQ - fixed issue where if a user asked for a specific node count and more · 19845159
  Danny Auble authored Apr 09, 2012
```
tasks than possible without overcommit the request would be allowed on more
nodes than requested.
```
  19845159
- Decrease timer in slurmctld/agent for better performance · 3543c7a3
  Morris Jette authored Apr 09, 2012
  
  3543c7a3
07 Apr, 2012 1 commit

Improve slurmctld performance for typical configuration · c84d1e0b

Morris Jette authored Apr 06, 2012

These changes reduce the overhead of some functions for improved
slurmctld throughput.

Overhead of slurm_preempt_init() goes from 20% of CPU time to NIL
and if preemption is disabled then slurm_job_preempt_check() goes
from 12% of CPU time to NIL.

c84d1e0b

06 Apr, 2012 6 commits
- Avoid uninitialized variable use · 18992a24
  Morris Jette authored Apr 06, 2012
  
  18992a24
- Remove redundant NULL return from xmalloc · 4b30af95
  Morris Jette authored Apr 06, 2012
  
  4b30af95
- BGQ - remove return if previous state == current state of object when an · 37d7cb33
  Danny Auble authored Apr 05, 2012
```
event comes in through the realtime interface.
```
  37d7cb33
- Add new stress test to run 2000 jobs and time it per second · 77e2f0e3
  Danny Auble authored Apr 05, 2012
  
  77e2f0e3
- change test to use new globals function · 4b57557c
  Danny Auble authored Apr 05, 2012
  
  4b57557c
- add new wait_for_all_jobs function to globals · e3b4df9e
  Danny Auble authored Apr 05, 2012
  
  e3b4df9e
05 Apr, 2012 3 commits
- Merge branch 'slurm-2.3' · 88ba26d5
  Morris Jette authored Apr 05, 2012
  
  88ba26d5
- Prevent users from extending the EndTime of running jobs · 62edab22
  Don Lipari authored Apr 04, 2012
```
While safeguards are in place to prevent unauthorized users from extending the
TimeLimit of their running jobs, there were no such restrictions for extending
the EndTime.  This patch adds the same constraints to modifying EndTime that
currently exists for modifying TimeLimit.
```
  62edab22
- Add an authentication credential to the PMI2 spawn job RPC · 1f6957b7
  Morris Jette authored Apr 04, 2012
  
  1f6957b7
04 Apr, 2012 2 commits
- Minor changes to log messages for better clarity · 9c850f4b
  Morris Jette authored Apr 04, 2012
  
  9c850f4b
- Add MPICH2/PMI2 test · a11e703c
  Morris Jette authored Apr 04, 2012
  
  a11e703c
03 Apr, 2012 8 commits
- Disable open file test with mpi/pki2 plugin · c0c1cd43
  Morris Jette authored Apr 03, 2012
  
  c0c1cd43
- Add new function slurm_mpi_plugin_init() · 1011d8b8
  Morris Jette authored Apr 03, 2012
```
The new function permits API users to specify a non-default MPI plugin
before any other APIs launch the default plugin.
```
  1011d8b8
- More PMI2 cosmetic mods · 76cf8063
  Morris Jette authored Apr 03, 2012
  
  76cf8063
- Minor updates to PMI2 code and documentation · 49e07b2d
  Morris Jette authored Apr 03, 2012
```
Add documentation for the mpi/pmi2 plugin.
Minor changes to code formatting and logic, but old code should work fine.
```
  49e07b2d
- add (void) argument to mpi plugin p_mpi_hook_client_single_task_per_node function · a1ef8b37
  Morris Jette authored Apr 03, 2012
```
No change in logic
```
  a1ef8b37
- Original PMI2 plugin from Hongjia Cao, NUDT · 9976beaa
  Hongjia Cao authored Apr 03, 2012
  
  9976beaa
- Merge branch 'slurm-2.3' · 58189717
  Morris Jette authored Apr 02, 2012
  
  58189717
- Limit depth of circular job dependency check · 0caecbc5
  Morris Jette authored Apr 02, 2012
```
Add support for new SchedulerParameters of max_depend_depth defining the
maximum number of jobs to test for circular dependencies (i.e. job A waits
for job B to start and job B waits for job A to start). Default value is
10 jobs.
```
  0caecbc5
02 Apr, 2012 9 commits

Merge branch 'slurm-2.3' · 0b7a56ca
Morris Jette authored Apr 02, 2012
```
Conflicts:
	NEWS
```
0b7a56ca
Note gres File option does not support regular expressions. · fce94e9f
Morris Jette authored Apr 02, 2012

fce94e9f
Improve MPI document formatting · c5436151
Morris Jette authored Apr 02, 2012

c5436151
Add UPC documentation · 1dcdfba2
Morris Jette authored Apr 02, 2012

1dcdfba2
Add Hongjia Cao as a primary SLURM developer · 06c92c25
Morris Jette authored Apr 02, 2012

06c92c25
Update another web pointer to mail archive · e262bd02
Morris Jette authored Mar 28, 2012

e262bd02

Fix in select/cons_res+topology+job with node range count · cd84134c

Morris Jette authored Mar 28, 2012

The problem was conflicting logic in the select/cons_res plugin. Some of the code was trying to get the job the maximum node count in the range while other logic was trying to minimize spreading out of the job across multiple switches. As you note, this problem only happens when a range of node counts is specified and the select/cons_res plugin and the topology/tree plugin and even then it is not easy to reproduce (you included all of the details below).

Quoting Martin.Perry@Bull.com:

> Certain combinations of topology configuration and srun -N option produce
> spurious job rejection with "Requested node configuration is not
> available" with select/cons_res. The following example illustrates the
> problem.
>
> [sulu] (slurm) etc> cat slurm.conf
> ...
> TopologyPlugin=topology/tree
> SelectType=select/cons_res
> SelectTypeParameters=CR_Core
> ...
>
> [sulu] (slurm) etc> cat topology.conf
> SwitchName=s1 Nodes=xna[13-26]
> SwitchName=s2 Nodes=xna[41-45]
> SwitchName=s3 Switches=s[1-2]
>
> [sulu] (slurm) etc> sinfo
> PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST
> ...
> jkob         up   infinite      4   idle xna[14,19-20,41]
> ...
>
> [sulu] (slurm) etc> srun -N 2-4 -n 4 -p jkob hostname
> srun: Force Terminated job 79
> srun: error: Unable to allocate resources: Requested node configuration is
> not available
>
> The problem does not occur with select/linear, or topology/none, or if -N
> is omitted, or for certain other values for -N (for example, -N 4-4 and -N
> 2-3 work ok). The problem seems to be in function _eval_nodes_topo in
> src/plugins/select/cons_res/job_test.c. The srun man page states that when
> -N is used, "the job will be allocated as many nodes as possible within
> the range specified and without delaying the initiation of the job."
> Consistent with this description, the requested number of nodes in the
> above example is 4 (req_nodes=4).  However, the code that selects the
> best-fit topology switches appears to make the selection based on the
> minimum required number of nodes (min_nodes=2). It therefore selects
> switch s1.  s1 has only 3 nodes from partition jkob. Since this is fewer
> than req_nodes the job is rejected with the "node configuration" error.
>
> I'm not sure where the code is going wrong.  It could be in the
> calculation of the number of needed nodes in function _enough_nodes.  Or
> it could be in the code that initializes/updates req_nodes or rem_nodes. I
> don't feel confident that I understand the logic well enough to propose a
> fix without introducing a regression.
>
> Regards,
> Martin

cd84134c

Format change, no change in logic · 92d99010
Morris Jette authored Mar 28, 2012

92d99010

Use site maximum for option switch wait time. · 2581fe62

Morris Jette authored Mar 27, 2012

When the optional max_time is not specified for --switches=count, the site
max (SchedulerParameters=max_switch_wait=seconds) is used for the job.
Based on patch from Rod Schultz.

2581fe62

30 Mar, 2012 3 commits
- Fixed moab_2_slurmdb.pl script to correctly work for end records. · 046a633b
  Danny Auble authored Mar 30, 2012
  
  046a633b
- BLUEGENE - fix a host of memory leaks · 360e4c7c
  Danny Auble authored Mar 29, 2012
  
  360e4c7c
- sview add norealtime flag to the mix to be able to be added or removed. · 3c3e468c
  Danny Auble authored Mar 29, 2012
  
  3c3e468c
29 Mar, 2012 3 commits

Added CrpCPUMins to the output of sshare -l for those using hard limit · d1ae3d81
Mark Nelson authored Mar 28, 2012
```
accounting.  Work contributed by Mark Nelson.
```
d1ae3d81

Fix in select/cons_res+topology+job with node range count · f64b29a2

Morris Jette authored Mar 28, 2012

The problem was conflicting logic in the select/cons_res plugin. Some of the code was trying to get the job the maximum node count in the range while other logic was trying to minimize spreading out of the job across multiple switches. As you note, this problem only happens when a range of node counts is specified and the select/cons_res plugin and the topology/tree plugin and even then it is not easy to reproduce (you included all of the details below).

Quoting Martin.Perry@Bull.com:

> Certain combinations of topology configuration and srun -N option produce
> spurious job rejection with "Requested node configuration is not
> available" with select/cons_res. The following example illustrates the
> problem.
>
> [sulu] (slurm) etc> cat slurm.conf
> ...
> TopologyPlugin=topology/tree
> SelectType=select/cons_res
> SelectTypeParameters=CR_Core
> ...
>
> [sulu] (slurm) etc> cat topology.conf
> SwitchName=s1 Nodes=xna[13-26]
> SwitchName=s2 Nodes=xna[41-45]
> SwitchName=s3 Switches=s[1-2]
>
> [sulu] (slurm) etc> sinfo
> PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST
> ...
> jkob         up   infinite      4   idle xna[14,19-20,41]
> ...
>
> [sulu] (slurm) etc> srun -N 2-4 -n 4 -p jkob hostname
> srun: Force Terminated job 79
> srun: error: Unable to allocate resources: Requested node configuration is
> not available
>
> The problem does not occur with select/linear, or topology/none, or if -N
> is omitted, or for certain other values for -N (for example, -N 4-4 and -N
> 2-3 work ok). The problem seems to be in function _eval_nodes_topo in
> src/plugins/select/cons_res/job_test.c. The srun man page states that when
> -N is used, "the job will be allocated as many nodes as possible within
> the range specified and without delaying the initiation of the job."
> Consistent with this description, the requested number of nodes in the
> above example is 4 (req_nodes=4).  However, the code that selects the
> best-fit topology switches appears to make the selection based on the
> minimum required number of nodes (min_nodes=2). It therefore selects
> switch s1.  s1 has only 3 nodes from partition jkob. Since this is fewer
> than req_nodes the job is rejected with the "node configuration" error.
>
> I'm not sure where the code is going wrong.  It could be in the
> calculation of the number of needed nodes in function _enough_nodes.  Or
> it could be in the code that initializes/updates req_nodes or rem_nodes. I
> don't feel confident that I understand the logic well enough to propose a
> fix without introducing a regression.
>
> Regards,
> Martin

f64b29a2

Format change, no change in logic · ebca432e
Morris Jette authored Mar 28, 2012

ebca432e

28 Mar, 2012 2 commits
- BLUEGENE - only call the fini for the plugin if on a bluegene system · 825c8eb7
  Danny Auble authored Mar 28, 2012
  
  825c8eb7
- BGQ - better cleanup for ending the status threads. · 8db1b04f
  Danny Auble authored Mar 28, 2012
  
  8db1b04f