Commits · ef8cc0a78534237d8a5ab22cbd97807b378d178b · Manuel G. Marciani / ces_slurm_simulator

11 Oct, 2011 8 commits

proctrack/cgroup: no longer rely on release agent to clean step cg · ef8cc0a7

Matthieu Hautreux authored Oct 09, 2011

With release_agent notified at the step cgroup level, the step cgroup
can be removed while slurmstepd as not yet finished its internals
epilog mechanisms. Inhibiting release agent at the step level and
ensuring its proper removal helps to guarantee that the node will only
be eligible for job execution when the resources will be completely
available (no longer used by the job or the epilogs).

ef8cc0a7

xcgroup: no longer treat ESRCH as an error when adding a pid to cgroup · 871b5d33

Matthieu Hautreux authored Oct 09, 2011

A delay occurs between a task creation and its addition to a different
cgroup than the inherited one. In the meantime, the process can disapear
resulting in a ESRCH during the addition in the second cgroup. Now react
to that event as a warning instead of an error.

871b5d33

slurmstepd: Move wait-for-parent code into fork_all_tasks · 591d8934

Mark A. Grondona authored Oct 07, 2011

Move the code that waits for parent signal before exec(2) out of
exec_task() and into fork_all_tasks() directly. This makes all
the code that handles the fork-and-wait into slurmstepd/mgr.c,
and allows the exec_wait_child_wait_for_parent() function to
be used in place of explicit read().

591d8934

slurmstepd: move tty setup into fork_all_tasks · b33cd7c8

Mark A. Grondona authored Oct 07, 2011

tty setup needs to occur before child tasks block waiting from signal
to the parent, so move this code out of exec_task() into fork_all_tasks()
so that the wait-for-signal-from-parent code can also later move out
of exec_task().

b33cd7c8

slurmstepd: Fix race in run_script_as_user · 9d8ae0f7

Mark A. Grondona authored Oct 07, 2011

As reported by Sam Lang on slurm-dev, task_epilog scripts are not
held before exec, and thus there is a race condition between when
the task_epilog is launched and slurmstepd calls slurm_container_add()
during which the task_epilog script could either run to completion, or
launch other processes that escape any job container defined by
configuration.

Use the new "exec_wait" api to have the child wait before exec just
as is done in fork_all_tasks.

Based on an original idea by Sam Lang <samlang@gmail.com>.

9d8ae0f7

slurmstepd: Use exec_wait_info interface in fork_all_tasks · 6e41137a

Mark A. Grondona authored Oct 07, 2011

Remove the explicitly coded fork-and-wait-before-exec code from
slurmstepd fork_all_tasks and replace with the "exec_wait" API.
This change should be functionally identical to the previous
code.

6e41137a

slurmstepd: Add abstraction for fork-and-wait · e124e872

Mark A. Grondona authored Oct 06, 2011

Abstract the code in slurmstepd fork_all_tasks that allows the parent
to signal children before they call exec into an "exec_wait_info"
interface. This will allow the code to be easily reused in other
parts of slurmstepd (e.g. task epilog) without cut-and-paste of code.

e124e872

Fix job hold type problem · 272e3390

jette authored Oct 10, 2011

Prevent job hold by operator or account coordinator of his own job from
being an Administrator Hold rather than User Hold by default.

272e3390

07 Oct, 2011 1 commit

Prevent crash with MaxMemPerCPU=0 · 06eca2de

Morris Jette authored Oct 07, 2011

Prevent slurmctld crashing with divide by zero with a configuration of MaxMemPerCPU=0.

06eca2de

05 Oct, 2011 2 commits
- removed other unneeded variables. · 4f015589
  Danny Auble authored Oct 05, 2011
  
  4f015589
- BLUEGENE - If removing blocks from system that once existed cleanup of old · 51edcafb
  Danny Auble authored Oct 05, 2011
```
block happens correctly now.
```
  51edcafb
04 Oct, 2011 3 commits
- Correct faq.html numbering · 899b2104
  Morris Jette authored Oct 04, 2011
  
  899b2104
- Major re-write of CPU Management web page · f4102fe7
  Morris Jette authored Oct 04, 2011
```
Major re-write of the CPU Management User and Administrator Guide (web
page) by Martin Perry, Bull.
```
  f4102fe7
- Fix for cray/srun wrapper parsing for some perl version · 4c1e65dd
  Morris Jette authored Oct 03, 2011
  
  4c1e65dd
03 Oct, 2011 1 commit
- BGQ - fix to set up corner correctly for sub block jobs. · b836839f
  Danny Auble authored Oct 03, 2011
  
  b836839f
30 Sep, 2011 4 commits

Do not print error on duplicated select plugins · 820c0f39

Mark A. Grondona authored Sep 29, 2011

PluginDir is a path. It shouldn't be an error to have duplicate
plugins in your path. Plus, the error is not helpful because it
doesn't specify which path is not being loaded. Therefore, just
remove the error and load the first plugin in the path as expected.

820c0f39

Fix bugs in sched/backfill, time limits and QOS · 4df8a986

Morris Jette authored Sep 30, 2011

Fix bugs in sched/backfill with respect to QOS reservation support and job
time limits. Patch from Alejandro Lucero Palau (Barcelona Supercomputer Center).

4df8a986

Clarify use of QOS NoReserve flag in documentation · 75186592
Morris Jette authored Sep 30, 2011

75186592

Fix in GRES cpu count availability for each resources · e0890cd9

Morris Jette authored Sep 30, 2011

Fix to GRES allocation logic when resources are associated with specific
CPUs on a node. Patch from Steve Trofinoff, CSCS.

e0890cd9

29 Sep, 2011 6 commits
- Fix for accounting where your cluster isn't numbered in counting order · 183888a0
  Danny Auble authored Sep 29, 2011
```
(i.e. 1-9,0 instead of 0-9).  The bug would cause 'sacct -N nodename' to
not give correct results on these systems.
```
  183888a0
- BLUEGENE - Fix if running in Static/Overlap mode and full system block · 6b7d41b5
  Danny Auble authored Sep 28, 2011
```
is in an error state, won't deny jobs.
```
  6b7d41b5
- BLUEGENE - Fix minor potential memory leak when setting block error reason. · 7c25f668
  Danny Auble authored Sep 28, 2011
  
  7c25f668
- BLUEGENE - handle reason of blocks in error more correctly between · 01d49db4
  Danny Auble authored Sep 28, 2011
```
restarts of the slurmctld.
```
  01d49db4
- Rename node_ptr to be fe_ptr instead to avoid confusion. · a1ebcb8c
  Danny Auble authored Sep 28, 2011
  
  a1ebcb8c
- BLUEGENE - Update correctly the state in the reason of a block if an · 3a507bc2
  Danny Auble authored Sep 28, 2011
```
admin sets the state to error.
```
  3a507bc2
28 Sep, 2011 4 commits
- Advise use of logrotate · 91e543d4
  Morris Jette authored Sep 28, 2011
```
Advise use of the logrotate tool in order to avoid SLURM log files
from growing too large. Patch from Rod Shultz, Bull.
```
  91e543d4
- Do not treat absense of gres.conf file as fatal error. · 38e0af47
  Morris Jette authored Sep 28, 2011
```
Do not treat the absence of a gres.conf file as a fatal error on systems
configured with GRES, but set GRES counts to zero. These counts can be
Counts can be altered by node_config_load() in the gres plugin.
```
  38e0af47
- Fix for handling QOS limits per user on a reconfig of the slurmctld. · 30ef1e98
  Danny Auble authored Sep 27, 2011
  
  30ef1e98
- Added extra talk from SUG 2011 · 885d17f4
  Danny Auble authored Sep 27, 2011
  
  885d17f4
27 Sep, 2011 1 commit

Allow job owner to use scontrol notify · 141d87a4

Mark A. Grondona authored Sep 26, 2011

The slurmctld code that processes job notify messages unecessarily
restricts these messages to be from the slurm user or root. This
patch allows users to send notifications to their own jobs.

141d87a4

26 Sep, 2011 4 commits
- more cosmetic fixes for the new 4.6 compiler · c960af1e
  Danny Auble authored Sep 26, 2011
  
  c960af1e
- Fix for sview reservation tab when finding correct reservation. · 8532ea9d
  Danny Auble authored Sep 26, 2011
  
  8532ea9d
- Added presentations for SUG 2011 · 87f00843
  Danny Auble authored Sep 26, 2011
  
  87f00843
- Cosmetic mods for GCC v4.6 · 413b1c2c
  Morris Jette authored Sep 26, 2011
```
Many cosmetic modifications to eliminate warning message from GCC version
4.6 compiler, mostly due to unused variables.
```
  413b1c2c
19 Sep, 2011 1 commit
- update NEWS from last checkin · 16fd2265
  Danny Auble authored Sep 19, 2011
  
  16fd2265
17 Sep, 2011 1 commit
- BLUEGENE - Fix for if changing the defined blocks in the bluegene.conf and · 50cafcf7
  Danny Auble authored Sep 16, 2011
```
jobs happen to be running on blocks not in the new config.
```
  50cafcf7
16 Sep, 2011 2 commits

Problem using salloc/mpirun with task affinity socket binding · 98b203d4

Morris Jette authored Sep 15, 2011

salloc/mpirun does not play well together with task affinity socket binding.  The following example illustrates the problem.

[sulu] (slurm) mnp> salloc -p bones-only -N1-1 -n3 --cpu_bind=socket mpirun cat /proc/self/status | grep Cpus_allowed_list
salloc: Granted job allocation 387
--------------------------------------------------------------------------
An invalid physical processor id was returned ...

The problem is that with mpirun jobs Slurm launches only a single task, regardless of the value of -n. This confuses the socket binding logic in task affinity.  The result is that task affinity binds the task to only a single cpu, instead of all the allocated cpus on the socket.  When mpi attempts to bind to any of the other allocated cpus on the socket, it gets the "invalid physical processor id" error. Note that the problem may occur even if socket binding is not explicitly requested by the user.  If task/affinity is configured and the allocated CPUs are a whole number of sockets, Slurm will use "implicit auto binding" to sockets, triggering the problem.
Patch from Martin Perry (Bull).

98b203d4

Describe mechanism to reserve CPUs rather than whole nodes · 7e181113

Morris Jette authored Sep 15, 2011

Update reservation web page to describe mechanism to reserve CPUs rather than whole nodes and provide an example.

7e181113

15 Sep, 2011 2 commits

Avoid prematurely clearing a job's user/admin held reason · 37ca1d1a

Morris Jette authored Sep 15, 2011

Avoid clearing a job's reason from JobHeldAdmin or JobHeldUser when it is
otherwise updated using scontrol or sview commands. Patch based upon work
by Phil Eckert (LLNL).

37ca1d1a

Don't delete backup slurmctld pid file early · 98ff4a6a

Morris Jette authored Sep 15, 2011

Do not remove the backup slurmctld's pid file when it assumes control, only
when it actually shuts down. Patch from Andriy Grytsenko (Massive Solutions
Limited).

98ff4a6a