Commits · 66cfa45ad893bf50d35ea8abfbb4539fa28f21f1 · Manuel G. Marciani / ces_slurm_simulator

21 Mar, 2012 23 commits

spank: refactor intialization code · 66cfa45a

Mark A. Grondona authored Jan 13, 2012

Refactor the post_opt handling code embedded in _spank_init() into
a spank_stack_post_opt() function, then call this in remote context
from a new spank_init_remote() function.

66cfa45a

spank: handle missing plugstack.conf · 3344092a

Mark A. Grondona authored Feb 22, 2012

Instead of trying to handle missing plugstack.conf early in the code,
just treat missing plugstack.conf the same as an empty config.

3344092a

spank: abstract spank_stack initialization code · 443aee4d

Mark A. Grondona authored Jan 06, 2012

Move struct spank_stack initialization code into a spank_stack_init()
function so that it can be called from multiple call sites.

443aee4d

spank: consolidate common code in _do_call_stack · e4e3baab

Mark A. Grondona authored Jan 05, 2012

Simplify code in _do_call_stack() by extracting case statement
to assign current callback symbol to its own function. Since all
spank functions have the same prototype we can then use the same
code to call _all_ callbacks, reducing greatly the number of lines
of code required.

e4e3baab

spank: consilidate checks for spank_get/set/unsetenv calls · 61cd1115

Mark A. Grondona authored Jan 06, 2012

Consolidate common code in spank_getenv, spank_setenv, spank_unsetenv
which checks for validity of the current context, spank handle, etc.

61cd1115

spank: consolidate error checks in job control functions · c3227f9a
Mark A. Grondona authored Jan 05, 2012
```
Consilidate checks for correct spank context in spank_job_control*
functions to avoid code duplication.
```
c3227f9a

spank: consolidate globals in plugstack.c · 2eb0b999

Mark A. Grondona authored Jan 05, 2012

The use of globals in plugstack.c is cumbersome and prevents the
future expansion of spank plugins, e.g. calling spank plugins from
multiple contexts within the same process or reinitializing the
spank plugin state.

This patch consolidates the current globals (spank_stack, spank_ctx,
spank_optval, and option_cache) into a global "spank stack" structure
and expands many of the functions internal to plugstack.c to operate
on a struct spank_stack instead of globally.

2eb0b999

spank: fix handling of remote spank_init_post_opt · 3a522459

Mark A. Grondona authored Jan 11, 2012

There was likely a typo/thinko/patcho in the handling of the
return code from _do_call_stack(SPANK_INIT_POST_OPT) in _spank_init
in "remote" context. This error caused spank_init() to always
succeed, since the test less than zero would always return 0 or 1.

3a522459

spank: refuse to load the same plugin more than once · 7a60bf95

Mark A. Grondona authored Jan 11, 2012

Avoid loading the same plugin more than once in plugstack.c.
Most likely this will be a configuration error, so we should
catch it early. If the same .so appears in the plugin stack
more than once, it is likely to cause very strange errors,
since dlopen() will only map the library a single time.

7a60bf95

change owner of slurmctld and slurmdbd log files · 3470c651

Morris Jette authored Mar 21, 2012

Change the owner of slurmctld and slurmdbd log files to the appropriate
user. Without this change the files will be created by and owned by the
user starting the daemons (likely user root).

3470c651

Merge branch 'slurm-2.3' · e78802d3
Morris Jette authored Mar 21, 2012

e78802d3

CRAY: Fix support for SlurmdTimeout=0 · 4dd9e697

Morris Jette authored Mar 21, 2012

CRAY: Fix support for configuration with SlurmdTimeout=0 (never mark
node that is DOWN in ALPS as DOWN in SLURM).

4dd9e697

Add delay to test for job info propagation · 7636f0f2
Morris Jette authored Mar 21, 2012

7636f0f2

Modify the step completion RPC between slurmd and slurmstepd · ed31e6c7

Morris Jette authored Mar 21, 2012

in the tightly coupled functions slurmd:stepd_completion and
slurmstepd:_handle_completion, a jobacct structure is
send from the main daemon to the step daemon to provide
the statistics of the children slurmstepd and do the aggregation.

The methodology used to send the structure is the use of
jobacct_gather_g_{setinfo,getinfo} over a pipe (JOBACCT_DATA_PIPE).
As {setinfo,getinfo} use a common internal lock and reading
or writing to a pipe is equivalent to holding a lock, slurmd and
slurmstepd have to avoid using both setinfo and getinfo over a
pipe or deadlock situations can occured. For example :
slurmd(lockforread,write)/slurmstepd(write,lockforread).

This patch remove the call to jobacct_gather_g_setinfo in slurmd
and the call to jobacct_gather_g_getinfo in slurmstepd ensuring
that slurmd only do getinfo operations over a pipe and slurmstepd
only do setinfo over a pipe. Instead jobacct_gather_g_{pack,unpack}
are used to marshall/unmarshall the data for transmission over the
pipe.
Patch by Matthieu Hautreux, CEA.

The patch committed here is a variation on the work by Matthieu.
Specifically, the logic is added to slurmstepd to read a new format
of RPC including an RPC version number and buffer with the data
structure. The slurmd however will not send the RPC in the new format
until SLURM version 2.5.

ed31e6c7

Add possible reason for failure to test · 3bdcf40f
Morris Jette authored Mar 21, 2012

3bdcf40f
Merge branch 'slurm-2.3' · 644fc9a7
Morris Jette authored Mar 21, 2012

644fc9a7
Minor test mods for old RedHat distro · 455283c2
Morris Jette authored Mar 21, 2012

455283c2
Merge branch 'slurm-2.3' · f23f6ccc
Morris Jette authored Mar 21, 2012

f23f6ccc
make test work better on different systems · 47aebf2c
Morris Jette authored Mar 21, 2012

47aebf2c
result of autogen.sh · 304cccb6
Morris Jette authored Mar 20, 2012

304cccb6
Merge branch 'slurm-2.3' · 8c905e93
Morris Jette authored Mar 20, 2012

8c905e93
Modify Makefiles to support Hardening flags · a7e89e72
Morris Jette authored Mar 20, 2012

a7e89e72
Cosmetic mods · fb4cabaa
Morris Jette authored Mar 20, 2012
```
Replace some " \t" with just "\t" (that's a tab)
```
fb4cabaa

20 Mar, 2012 7 commits
- Improve support for overlapping reservations · 73351553
  Morris Jette authored Mar 20, 2012
```
Improve support for overlapping advanced reservations.
Patch from Bill Brophy, Bull.
```
  73351553
- Minor updates to PriorityFlags logic and documentation · 264c7fbc
  Morris Jette authored Mar 20, 2012
  
  264c7fbc
- Merge pull request #14 from cfenoy/master · dd3d8b56
  Morris Jette authored Mar 20, 2012
```
Added PriorityFlags configuration parameter
```
  dd3d8b56
- Merge pull request #13 from grondo/2.3-step-memcg-fixes · d835060d
  Morris Jette authored Mar 20, 2012
```
task/cgroup: minor job step memcg fixes
```
  d835060d
- Improve task binding logic · f2fab483
  Morris Jette authored Mar 20, 2012
```
Improve task binding logic by making fuller use of HWLOC library,
especially with respect to Opteron 6000 series processors. Work contributed
by Komoto Masahiro.
```
  f2fab483
- added documentation for PriorityFlags · c915d921
  Carles Fenoy authored Mar 20, 2012
  
  c915d921
- added PriorityFlags parameter to allow configuration of multifactor plugin behavior · bd643a4c
  Carles Fenoy authored Mar 20, 2012
  
  bd643a4c
19 Mar, 2012 1 commit
- Minor tweaks to quickstart admin guide · af4ed738
  Morris Jette authored Mar 19, 2012
  
  af4ed738
18 Mar, 2012 3 commits

task/cgroup: delete job step memcg instead of using force_empty · a93afcd1

Mark A. Grondona authored Mar 17, 2012

The current task/cgroup memory code writes to force_empty at job step
completion and then waits for the release agent to be triggered to
remove the memcg. However, force_empty only causes clean cache pages
to be dropped from the memcg and does not actually move charges to
the parent [1].

This has two unfortunate side-effects. First, pages that can't be
dropped by force_empty are in-use and could stay that way indefinitely
(e.g. system library that is in-use until just after force_empty
completes). Thus, the step memcg never becomes 'empty' and the release
agent is not activated. Second, cached pages that can be freed are
likely associated with the job itself, and those files and libraries
will have to be paged in again for subsequent job steps.

In contrast, calling rmdir(2) on a memcg with no active tasks
causes *all* current charges to move to parent, which is really what
we want in this case. This allows cached libraries and binaries to
stay resident and be associated with the job, and also ensures that
the step memcg is removed immediately as the job step ends.

Thus, this patch replaces the write to force_empty with a call
to xcgroup_delete() on the step memcg, which in turn removes
the memcg with rmdir(2).

The functionality of this patch depends on the previous fix that
uses xcgroup_move_process() to move slurmstepd to the root memcg.
Otherwise, there will be leftover slurmstepd threads in the job
step memcg, and the rmdir will fail with EBUSY.

 [1] Sec 4.3: http://www.kernel.org/doc/Documentation/cgroups/memory.txt

a93afcd1

task/cgroup: use xcgroup_move_process to move slurmstepd to root memcg · 2dd13506

Mark A. Grondona authored Mar 17, 2012

In task_cgroup_memory_fini() the implementation attempts to move
the existing slurmstepd task to the root memory cgroup by writing
the result of getpid(2) to the root memory's 'task' file. This
does not work, however, because slurmstepd is multi-threaded and
thus only the main thread is moved.

This patch replaces the explicit write to 'tasks' with a call to
the new xcgroup_move_process() call, which handles moving all
threads in the process.

2dd13506

xcgroup: add xcgroup_move_process helper function · aa912e4a

Mark A. Grondona authored Mar 17, 2012

This patch adds a helper function to common/xcgroup.c to aid
in moving processes between cgroups. If the cgroups.procs file
is writable then writing the PID to that file is used, as this
method moves all threads in a process atomically.

If cgroups.procs is not writable, then each thread must be moved
individually by walking the /proc/PID/task/ directory and writing
each taskid individually to the 'tasks' file in the cgroup. The
second method is racy if a process is concurrently creating
threads, but it is better than the current method of just moving
one of the process's threads.

aa912e4a

16 Mar, 2012 6 commits
- Start v2.4.0-pre5 tag · 0aaf4293
  Morris Jette authored Mar 16, 2012
  
  0aaf4293
- Merge branch 'slurm-2.3' · e26d4656
  Morris Jette authored Mar 16, 2012
```
Conflicts:
	NEWS
```
  e26d4656
- Start NEWS for v2.3.5 · b720f7f1
  Morris Jette authored Mar 16, 2012
  
  b720f7f1
- Update META for v2.4.0-pre4 · 4fdf9e66
  Morris Jette authored Mar 16, 2012
  
  4fdf9e66
- Merge branch 'slurm-2.3' · 740cef6f
  Morris Jette authored Mar 16, 2012
```
Conflicts:
	META
```
  740cef6f
- Update META for v2.3.4 tag · 23052ff3
  Morris Jette authored Mar 16, 2012
  
  23052ff3