Commits · b9a3b4a4873445d4520a99297a5af097140ed069 · Manuel G. Marciani / ces_slurm_simulator

27 Nov, 2012 14 commits
- Fix display of steps with 'scontrol show step' · b9a3b4a4
  Nathan Yee authored Nov 27, 2012
  
  b9a3b4a4
- Merge remote-tracking branch 'origin/slurm-2.4' · ec2a48d7
  Danny Auble authored Nov 27, 2012
  
  ec2a48d7
- BLUEGENE - clearer debug message · ba74dcd4
  Danny Auble authored Nov 27, 2012
  
  ba74dcd4
- BGQ - handle pending actions on a block better when trying to deallocate it. · e4431036
  Danny Auble authored Nov 27, 2012
  
  e4431036
- BLUEGENE - With Dynamic layout mode - Fix issue where if a larger block · 0dad50ff
  Danny Auble authored Nov 27, 2012
```
was already in error and isn't deallocating and underlying hardware goes
bad one could get overlapping blocks in error making the code assert when
a new job request comes in.
```
  0dad50ff
- Merge branch 'slurm-2.4' · 83fd28f1
  Morris Jette authored Nov 27, 2012
  
  83fd28f1
- Increase sbcast credential cache from 64 to 256 entries · 60143498
  Morris Jette authored Nov 27, 2012
  
  60143498
- Add version number to web page header · 7bca73e7
  Morris Jette authored Nov 26, 2012
  
  7bca73e7
- Correction for missing bracket in previous commit · a6875f55
  Morris Jette authored Nov 26, 2012
  
  a6875f55
- BGQ - Add 64 tasks per node as a valid option for srun when used with · d3435cfc
  Danny Auble authored Nov 26, 2012
```
overcommit.
```
  d3435cfc
- BGQ - Add 64 tasks per node as a valid option for srun when used with · 4f085be3
  Danny Auble authored Nov 26, 2012
```
overcommit.
```
  4f085be3
- If the PrologSlurmctld fails, then requeue the job indefinitely · ea8818bb
  Morris Jette authored Nov 26, 2012
```
Previously only requeued the job once
```
  ea8818bb
- Tweak test for memory limit enforcement, avoid oversubscription and test failing · 581cc216
  Morris Jette authored Nov 26, 2012
  
  581cc216
- Cosmetic changes · cbfacd24
  Morris Jette authored Nov 26, 2012
  
  cbfacd24
26 Nov, 2012 10 commits
- Fix open files open on fork/exec of slurmstepd · 4520730a
  Danny Auble authored Nov 26, 2012
  
  4520730a
- Merge branch 'master' of https://github.com/SchedMD/slurm · cd99545c
  jette authored Nov 26, 2012
  
  cd99545c
- Reset a reservation's nodes when it's partition's node set changes · 6ee4a730
  jette authored Nov 26, 2012
```
Without this change, the nodes associated with a reservation would
only be updated if the partition's nodes were reset using the
"scontrol update partition" command, but not if they were reset
using "scontrol reconfigure"
```
  6ee4a730
- ENERGY - RAPL - make it so the pkg2cpu is set only once instead of over · 75e48ad9
  Danny Auble authored Nov 26, 2012
```
written by other cpus sharing the package
```
  75e48ad9
- Energy RAPL - alter code to close open files (and only open them once · dfe86832
  Danny Auble authored Nov 26, 2012
```
where needed)
```
  dfe86832
- Fix for srun command to log job timeout · a625f282
  jette authored Nov 26, 2012
```
This reverts most of commit
https://github.com/SchedMD/slurm/commit/570941362ffdc57e9e3d4723bc4f728ae04789d8
and adds a call from slurmctld to srun prior to deallocating nodes and
notifying slurmd to cancel the tasks
```
  a625f282
- Merge branch 'master' of https://github.com/SchedMD/slurm · 29fd445d
  jette authored Nov 26, 2012
  
  29fd445d
- Modify srun to abandon I/O 60 seconds after the last task ends · 09d0935f
  jette authored Nov 26, 2012
```
Otherwise an aborted slurmstepd can cause the srun process to hang
indefinitely; a problem reported in trouble ticket 149.
```
  09d0935f
- Add missing header to avoid warning · db611576
  Morris Jette authored Nov 26, 2012
  
  db611576
- Add timeout on srun's I/O connect message to better handle some failure modes · 8405b4eb
  Morris Jette authored Nov 21, 2012
```
If the slurmstepd connects task I/O, but aborts after srun accepts the connect
and before slurmstepd writes data then srun could possibly hand indefinitely.
This probably does not explain failures seen at CEA, but can't hurt matters.
then the sr
```
  8405b4eb
25 Nov, 2012 3 commits
- Merge branch 'slurm-2.4' · c25595ff
  jette authored Nov 25, 2012
  
  c25595ff
- Add sleep to munge call on socket timeout · 669f0260
  jette authored Nov 25, 2012
```
This could happen in srun too and would typically indicate that
the munged is too busy to respond
```
  669f0260
- Removed some unused variables · 3f1552be
  jette authored Nov 25, 2012
  
  3f1552be
22 Nov, 2012 7 commits
- BGQ - fixed minor typos to run on a real system · 94c9c61a
  Danny Auble authored Nov 21, 2012
  
  94c9c61a
- Add an extra spot for allocation · e92176a9
  Danny Auble authored Nov 21, 2012
  
  e92176a9
- Validate we allocated things correctly · 27e56dfa
  Danny Auble authored Nov 21, 2012
  
  27e56dfa
- CRAY - allow steps to be made in slurmctld · cb657188
  Danny Auble authored Nov 21, 2012
  
  cb657188
- Comment out now check on launch plugins since the it makes the function · 57094136
  Danny Auble authored Nov 21, 2012
```
useless in most cases.
```
  57094136
- Cray - Add message thread for handling messages from the slurmctld and · 97e6b2eb
  Danny Auble authored Nov 21, 2012
```
introduce step accounting for a Cray.
```
  97e6b2eb
- remove extra hooks just added (figured out a better way to do it) · a58f4535
  Danny Auble authored Nov 21, 2012
  
  a58f4535
21 Nov, 2012 6 commits

Remove some currently unused variables · 3e8b10b3
Morris Jette authored Nov 21, 2012

3e8b10b3
Very minor format change to conform with Linux kernel coding style · c4eb8b1a
Morris Jette authored Nov 21, 2012

c4eb8b1a

slurmstepd : correct a bug in the IO thread termination monitoring · f297242e

Matthieu Hautreux authored Nov 13, 2012

A dedicated thread (_kill_thr) is launched by slurmstepd at the end of a
step in order to destroy the IO thread if it does not manage to correctly
terminate by itself after 300 seconds.

Two bugs are corrected in this logic by this patch.

First, the performed sleep(300) is not protected against interruptions
and this delay can be reduced to a few seconds in case of signals received
by slurmstepd, thus, reducing the delay and forcing the IO thread to
terminate before the expiration of the grace time. The logic is modified
to ensure that the delay is respected using a loop around the sleep().

Second, to terminate the IO thread, a SIGKILL is delivered to the IO thread
using pthread_kill. However, sending SIGKILL using pthread_kill is a
process-wide operation (see man pthread_kill), thus all the slurmstepd
threads are killed and slurmstepd is terminated. This logic is modified
by using pthread_cancel() instead of pthread_kill() thus letting the
pthread_join() of _wait_for_io() having a chance to act as expected.

Without this patch, when _kill_thr is interrupted, slurmstepd is
terminated, letting the step in a incomplete state, as the node may not
have been able to send the REQUEST_STEP_COMPLETE to the controler.
Thus, consecutive steps can no longer be executed and stay permanently in
the "Job step creation temporarily disabled, retrying" state.

f297242e

Correct a bug with -w in step management resulting in inadequate memory errors returned to srun · ac86cc37

Matthieu Hautreux authored Nov 12, 2012

When requesting a particular nodelist for a step, if at least one of the node is
still used by a former step (no REQUEST_STEP_COMPLETE received from that node),
the current behavior is to return ESLURM_INVALID_TASK_MEMORY and srun aborting
with "Memory required by task is not available".

This can be reproduced by launching consecutive steps with the -w parameter set
to $SLURM_NODELIST and introducing delays in the spank epilog on the execution
nodes.

The behavior is changed to only defer the execution of the step by returning
ESLURM_NODES_BUSY when it is detected that some nodes are blocked because of
already used memory.

ac86cc37

Correct a bug in consecutive steps management due to asynchronous step completions · 4c97337d

Matthieu Hautreux authored Nov 12, 2012

When using consecutive steps, it appears that in some cases, the time required
by the slurmstepd on the execution nodes to inform the controler of the completion
of the step is higher than the time required to request the following step.
In that scenario, the controler can reject the step by returning the error code
ESLURM_REQUESTED_NODE_CONFIG_UNAVAILABLE even if the step could be executed if
all the former steps were correctly finished.

This can be reproduced by launching consecutive steps and introducing dalys in
the spank epilog on the execution nodes.

The behavior is changed to only defer the execution of the step by returning
ESLURM_NODES_BUSY when all the available nodes are not idle considering the
former steps.

4c97337d

Modify poe step timeout logic to log step termination as being due to time limit · 8c026516
Morris Jette authored Nov 21, 2012

8c026516