Commits · 6ce0e77b803b653d128739dedb4491b755a5e12b · Manuel G. Marciani / ces_slurm_simulator

12 Oct, 2011 9 commits

cgroups: Add new config parameter MinRAMSpace · 6ce0e77b

Mark A. Grondona authored Sep 29, 2011

Add a new configuration parameter MinRAMSpace which sets a lower bound on
memory.limit_in_bytes and memory.memsw.limit_in_bytes . This is required in
case an administrator or user sets an absurdly low value for memory limit,
potentially causing the slurmstepd to be terminated by the OOM killer.

MinRAMSpace is set in MB of RAM and is 30 by default. (An arbitrarily
chosen value)

6ce0e77b

cgroups: Allow percent values in cgroup.conf to be floating point · fa38c431

Mark A. Grondona authored Oct 01, 2011

The use of whole percent values for cgroup.conf parameters such
as AllowedRAMSpace, MaxRAMPercent, AllowedSwapSpace and MaxSwapPercent
may be too coarse grained on systems with large amounts of memory.
(e.g. 1% of 64G is over 650MB).

This patch allows these percentage values to be arbitrary floating
point numbers to allow finer grained tuning of these limits and
parameters.

fa38c431

task/cgroup: Don't create memory cgroups with limit of 0 bytes · e1bb1689

Mark A. Grondona authored Oct 01, 2011

Treat a 0 byte memory limit from SLURM as unlimited and instead use
MaxRAMPercent and MaxSwapPercent as RAM and Swap limits for the job/job
step. This avoids creating a memory cgroup with limit_in_bytes = 0,
which would end up causing the cgroup to OOM before slurmstepd could
even be started.

This also allows systems in which SLURM isn't explicitly allocating
memory to use the task/cgroup plugin with ConstrainRAMSpace=yes.

e1bb1689

task/cgroup: Apply MaxRamPercent and MaxSwapPercent to memory cgroups · db99233d

Mark A. Grondona authored Sep 30, 2011

Calculate the upper bound RAM in bytes and Swap in bytes that may
be used by any one cgroup and apply this limit in the task/cgroup
code.

db99233d

cgroups: Add MaxRAMPercent and MaxSwapPercent config parameters · f8afbebc

Mark A. Grondona authored Sep 30, 2011

As a failsafe we may want to put a hard limit on memory.limit_in_bytes
and memory.memsw.limit_in_bytes when using cgroups. This patch adds
MaxRAMPercent and MaxSwapPercent which are taken as percentages of
available RAM (RealMemory as reported by slurmd), and which will be
applied as upper bounds when creating memory controller cgroups.

f8afbebc

Propagate real_memory_size to slurmstepd at job start · 4cf2f340

Mark A. Grondona authored Sep 30, 2011

Add conf->real_memory_size to the list of slurmd_conf_t members that
are propagated to slurmstepd during a job step launch. This makes the
amount of RAM available on the system (as determined by slurmd) available
for use in slurmstepd plugins or slurmstepd itself, without having to
recalculate its value.

4cf2f340

task/cgroup: Refactor task_cgroup_memory_create · 941262a3

Mark A. Grondona authored Sep 16, 2011

There was some duplicated code in task_cgroup_memory_create. In order
to facilitate extending this code in the future, refactor it into
a common function memcg_initialize().

941262a3

cgroups: Support configurable cgroup mount dir in release agent · fa6b256e

Mark A. Grondona authored Sep 29, 2011

The example cgroup release agent packaged and installed with
SLURM assumes a base directory of /cgroup for all mounted
subsystems. Since the mount point is now configurable in SLURM,
this script needs to be augmented to determine the location
of the subsystem mount point at runtime.

fa6b256e

cgroups: Allow cgroup mount point to be configurable · c9ea11b5

Mark A. Grondona authored Jul 27, 2011

cgroups code currently assumes cgroup subsystems will be mounted
under /cgroup, which is not the ideal location for many situations.
Add a new cgroup.conf parameter to redefine the mount point to an
arbitrary location. (for example, some systems may already have
cgroupfs mounted under /dev/cgroup or /sys/fs/cgroup)

c9ea11b5

07 Oct, 2011 1 commit

Prevent crash with MaxMemPerCPU=0 · 06eca2de

Morris Jette authored Oct 07, 2011

Prevent slurmctld crashing with divide by zero with a configuration of MaxMemPerCPU=0.

06eca2de

05 Oct, 2011 2 commits
- removed other unneeded variables. · 4f015589
  Danny Auble authored Oct 05, 2011
  
  4f015589
- BLUEGENE - If removing blocks from system that once existed cleanup of old · 51edcafb
  Danny Auble authored Oct 05, 2011
```
block happens correctly now.
```
  51edcafb
04 Oct, 2011 3 commits
- Correct faq.html numbering · 899b2104
  Morris Jette authored Oct 04, 2011
  
  899b2104
- Major re-write of CPU Management web page · f4102fe7
  Morris Jette authored Oct 04, 2011
```
Major re-write of the CPU Management User and Administrator Guide (web
page) by Martin Perry, Bull.
```
  f4102fe7
- Fix for cray/srun wrapper parsing for some perl version · 4c1e65dd
  Morris Jette authored Oct 03, 2011
  
  4c1e65dd
03 Oct, 2011 1 commit
- BGQ - fix to set up corner correctly for sub block jobs. · b836839f
  Danny Auble authored Oct 03, 2011
  
  b836839f
30 Sep, 2011 4 commits

Do not print error on duplicated select plugins · 820c0f39

Mark A. Grondona authored Sep 29, 2011

PluginDir is a path. It shouldn't be an error to have duplicate
plugins in your path. Plus, the error is not helpful because it
doesn't specify which path is not being loaded. Therefore, just
remove the error and load the first plugin in the path as expected.

820c0f39

Fix bugs in sched/backfill, time limits and QOS · 4df8a986

Morris Jette authored Sep 30, 2011

Fix bugs in sched/backfill with respect to QOS reservation support and job
time limits. Patch from Alejandro Lucero Palau (Barcelona Supercomputer Center).

4df8a986

Clarify use of QOS NoReserve flag in documentation · 75186592
Morris Jette authored Sep 30, 2011

75186592

Fix in GRES cpu count availability for each resources · e0890cd9

Morris Jette authored Sep 30, 2011

Fix to GRES allocation logic when resources are associated with specific
CPUs on a node. Patch from Steve Trofinoff, CSCS.

e0890cd9

29 Sep, 2011 6 commits
- Fix for accounting where your cluster isn't numbered in counting order · 183888a0
  Danny Auble authored Sep 29, 2011
```
(i.e. 1-9,0 instead of 0-9).  The bug would cause 'sacct -N nodename' to
not give correct results on these systems.
```
  183888a0
- BLUEGENE - Fix if running in Static/Overlap mode and full system block · 6b7d41b5
  Danny Auble authored Sep 28, 2011
```
is in an error state, won't deny jobs.
```
  6b7d41b5
- BLUEGENE - Fix minor potential memory leak when setting block error reason. · 7c25f668
  Danny Auble authored Sep 28, 2011
  
  7c25f668
- BLUEGENE - handle reason of blocks in error more correctly between · 01d49db4
  Danny Auble authored Sep 28, 2011
```
restarts of the slurmctld.
```
  01d49db4
- Rename node_ptr to be fe_ptr instead to avoid confusion. · a1ebcb8c
  Danny Auble authored Sep 28, 2011
  
  a1ebcb8c
- BLUEGENE - Update correctly the state in the reason of a block if an · 3a507bc2
  Danny Auble authored Sep 28, 2011
```
admin sets the state to error.
```
  3a507bc2
28 Sep, 2011 4 commits
- Advise use of logrotate · 91e543d4
  Morris Jette authored Sep 28, 2011
```
Advise use of the logrotate tool in order to avoid SLURM log files
from growing too large. Patch from Rod Shultz, Bull.
```
  91e543d4
- Do not treat absense of gres.conf file as fatal error. · 38e0af47
  Morris Jette authored Sep 28, 2011
```
Do not treat the absence of a gres.conf file as a fatal error on systems
configured with GRES, but set GRES counts to zero. These counts can be
Counts can be altered by node_config_load() in the gres plugin.
```
  38e0af47
- Fix for handling QOS limits per user on a reconfig of the slurmctld. · 30ef1e98
  Danny Auble authored Sep 27, 2011
  
  30ef1e98
- Added extra talk from SUG 2011 · 885d17f4
  Danny Auble authored Sep 27, 2011
  
  885d17f4
27 Sep, 2011 1 commit

Allow job owner to use scontrol notify · 141d87a4

Mark A. Grondona authored Sep 26, 2011

The slurmctld code that processes job notify messages unecessarily
restricts these messages to be from the slurm user or root. This
patch allows users to send notifications to their own jobs.

141d87a4

26 Sep, 2011 4 commits
- more cosmetic fixes for the new 4.6 compiler · c960af1e
  Danny Auble authored Sep 26, 2011
  
  c960af1e
- Fix for sview reservation tab when finding correct reservation. · 8532ea9d
  Danny Auble authored Sep 26, 2011
  
  8532ea9d
- Added presentations for SUG 2011 · 87f00843
  Danny Auble authored Sep 26, 2011
  
  87f00843
- Cosmetic mods for GCC v4.6 · 413b1c2c
  Morris Jette authored Sep 26, 2011
```
Many cosmetic modifications to eliminate warning message from GCC version
4.6 compiler, mostly due to unused variables.
```
  413b1c2c
19 Sep, 2011 1 commit
- update NEWS from last checkin · 16fd2265
  Danny Auble authored Sep 19, 2011
  
  16fd2265
17 Sep, 2011 1 commit
- BLUEGENE - Fix for if changing the defined blocks in the bluegene.conf and · 50cafcf7
  Danny Auble authored Sep 16, 2011
```
jobs happen to be running on blocks not in the new config.
```
  50cafcf7
16 Sep, 2011 2 commits

Problem using salloc/mpirun with task affinity socket binding · 98b203d4

Morris Jette authored Sep 15, 2011

salloc/mpirun does not play well together with task affinity socket binding.  The following example illustrates the problem.

[sulu] (slurm) mnp> salloc -p bones-only -N1-1 -n3 --cpu_bind=socket mpirun cat /proc/self/status | grep Cpus_allowed_list
salloc: Granted job allocation 387
--------------------------------------------------------------------------
An invalid physical processor id was returned ...

The problem is that with mpirun jobs Slurm launches only a single task, regardless of the value of -n. This confuses the socket binding logic in task affinity.  The result is that task affinity binds the task to only a single cpu, instead of all the allocated cpus on the socket.  When mpi attempts to bind to any of the other allocated cpus on the socket, it gets the "invalid physical processor id" error. Note that the problem may occur even if socket binding is not explicitly requested by the user.  If task/affinity is configured and the allocated CPUs are a whole number of sockets, Slurm will use "implicit auto binding" to sockets, triggering the problem.
Patch from Martin Perry (Bull).

98b203d4

Describe mechanism to reserve CPUs rather than whole nodes · 7e181113

Morris Jette authored Sep 15, 2011

Update reservation web page to describe mechanism to reserve CPUs rather than whole nodes and provide an example.

7e181113

15 Sep, 2011 1 commit

Avoid prematurely clearing a job's user/admin held reason · 37ca1d1a

Morris Jette authored Sep 15, 2011

Avoid clearing a job's reason from JobHeldAdmin or JobHeldUser when it is
otherwise updated using scontrol or sview commands. Patch based upon work
by Phil Eckert (LLNL).

37ca1d1a