Commits · 536c8451055bfb0fc890a9b11b41f81780a0b7e6 · Manuel G. Marciani / ces_slurm_simulator

10 Mar, 2016 4 commits

Morris Jette authored Mar 09, 2016

Fix Cray NHC spawning on job requeue. Previous logic would leave nodes
allocated to a requeued job as non-usable on job termination.

Specifically, each job has a "cleaning/cleaned" flag. Once a job
terminates, the cleaning flag is set, then after the job node health
check completes, the value gets set to cleaned. If the job is requeued,
on its second (or subsequent) termination, the select/cray plugin
is called to launch the NHC. The plugin sees the "cleaned" flag
already set, it then logs:
error: select_p_job_fini: Cleaned flag already set for job 1283858, this should never happen
and returns, never launching the NHC. Since the termination of the
job NHC triggers releasing job resources (CPUs, memory, and GRES),
those resources are never released for use by other jobs.

Bug 2384

536c8451

Correctly parse nids in slurmconfgen_smw.py · e050806e

David Gloe authored Mar 09, 2016

An error in slurmconfgen_smw.py caused it to parse the nic as the nid.
On some systems those values differ, causing the generated slurm.conf file to
be incorrect.

Bug 2532.

e050806e

Fix route/topology plugin to prevent segfault in sbcast. · 0dfc924c

Bill Brophy authored Mar 08, 2016

route_p_split_hostlist was not thread-safe, and would cause
one of several segfaults depending on where in the initialization
code each thread was.

Bug 2495.

0dfc924c

Fix displayed value for RoutePlugin. · db8491f1
Tim Wickberg authored Mar 08, 2016
```
Was incorrectly displaying "(null)" even when loaded successfully.
```
db8491f1

07 Mar, 2016 1 commit

Added per job array task dependencies · c8dd9790

Dominik Bartkiewicz authored Mar 07, 2016

Added new job dependency type of "aftercorr" which will start a task of a
    job array after the corresponding task of another job array completes.
bug 2460

c8dd9790

05 Mar, 2016 2 commits
- Make it so jobs/steps track ':' named gres/tres, before hand gres/gpu:tesla · 0cd69296
  Danny Auble authored Mar 04, 2016
```
would only track gres/gpu, now it will track both gres/gpu and
gres/gpu:tesla as separate gres if configured like
AccountingStorageTRES=gres/gpu,gres/gpu:tesla
```
  0cd69296
- Fixed double read lock on getting job's gres/tres. · b23a57cf
  Danny Auble authored Mar 04, 2016
  
  b23a57cf
04 Mar, 2016 3 commits
- Fix issue where steps weren't always getting the gres/tres involved. · b294f81b
  Danny Auble authored Mar 04, 2016
  
  b294f81b
- Fix NEWS entry. · d2b913a2
  Brian Christiansen authored Mar 03, 2016
```
Continuation of 31225a82
```
  d2b913a2
- Fix for tasks being packed onto core when --ntasks-per-core=1 and --cpus-per-task > threads. · b11ec103
  Brian Christiansen authored Mar 03, 2016
```
Bug 2430
```
  b11ec103
03 Mar, 2016 5 commits

Defer slurmd registration until NodeHealthCheck · 7fb0c981

Thomas Hamel authored Mar 03, 2016

We want to introduce a new behavior in the way slurmd uses the
HealthCheckProgram. The idea is to avoid a race condition between the
first HealthCheckProgram run and the node accepting jobs. The slurmd
daemon will initialize and then loop on HealthCheckProgram execution
before registering with slurmctld. It will stay in this loop until
the HealthCheckProgram returns successfully (the node is still DOWN).

On our clusters we are using NHC as an HealthCheckProgram. NHC drains
the node if it fails and remove the drain if it is successfull, this
behavior fits well with our purpose. This behavior permits us to start
slurmd at boot without setting up a complex boot sequence in the init
system, slurmd just wait for the node to be ready before registering.

The HealthCheckProgram is not run during slurmd startup if
HealthCheckInteval is 0.

7fb0c981

Fix issue with sbcast not doing a correct fanout. · 72f13426
Danny Auble authored Mar 03, 2016

72f13426
Fix getting reservations to database when database is down. · 5c43d754
Brian Christiansen authored Mar 03, 2016
```
Bug 2507
```
5c43d754

Increase step GRES variable size · 7f0bdc84

Morris Jette authored Mar 03, 2016

Step GRES value changed from type "int" to "int64_t" to support larger
values. Previous logic could fail in step allocation values over 32-bits.
Other GRES values are 64-bit.

7f0bdc84

Force close on exec on first 256 file descriptors when launching a · f502f1e5

Danny Auble authored Mar 02, 2016

slurmstepd to close potential open ones.

It was pointed out the slurmd using acct_gather_energy/ipmi links to
freeipmi which could possibly open /dev/ipmi0 without the close on exec
flag set as root while launching a step leaving it open in the users app.

What this does is sets the flag on the first 256 to mitigate the concern.

Reported by Maksym Planeta.

Bug 2506

f502f1e5

02 Mar, 2016 4 commits
- Backfill scheduler to validate correct job partition · efd9d35e
  Gary B Skouson authored Mar 02, 2016
```
Previous logic tested whatever the job's partition pointer indicated
rather than the partition we are trying to run the job in. This bug
was introduced in Slurm version 15.08.5, Nov 16, 2015, commit
94f0e948
bug 2499
```
  efd9d35e
- Update documentation for change default cgroup mount of /sys/fs/cgroup · 7da25924
  Tim Wickberg authored Mar 02, 2016
  
  7da25924
- Remove a duplicate xmalloc · 2d5066e7
  Thomas Cadeau authored Feb 26, 2016
  
  2d5066e7
- Confirm PowerSave mode with node_features/knl_cray · 3e0734b6
  Morris Jette authored Mar 01, 2016
```
Check that PowerSave mode configured for node_features/knl_cray plugin.
    It is required to reconfigure and reboot nodes. Fatal error if
    not configured.
```
  3e0734b6
01 Mar, 2016 4 commits

Remove BEGIN_C_DECLS and END_C_DECLS macros. · 1434364d

Tim Wickberg authored Mar 01, 2016

src/common/mapping.h was the one place outside of slurm/*h that used this,
just remove it from there.

Replace macro with #ifdef __cplusplus in slurm/*h in case anyone is linking
C++ against libslurm.

1434364d

Remove PARAMS macro from function definitions. · 6ad00816

Tim Wickberg authored Feb 29, 2016

Macro hasn't been used consistently for three+ years, and is protecting against
compilation by non-ANSI C compilers which has not been a concern for quite some
time. Cleanup formatting of function declarations while here.

No change to logic.

6ad00816

Update NEWS as well. · a058ff4a
Tim Wickberg authored Mar 01, 2016

a058ff4a

Defer suspend until launch completes · 52fe3de1

Morris Jette authored Feb 29, 2016

Insure that a job is completely launched before trying to suspend it.
Previous logic would start suspend logic early in the life of the
slurmstepd process, after it's listening socket was open but before
the tasks were launched. This defers the suspend logic until after
all prologs and setup completes and the tasks are launched. This is
important in the case of gang scheduling, in which newly launched
jobs can be immediately suspended.
bug 2494

52fe3de1

29 Feb, 2016 1 commit
- Avoid superfluous Weight=1 lines in generated config. · fc619ba0
  Tim Wickberg authored Feb 27, 2016
```
Default value is 1. Weight is uint32_t so this check was always succeeding.
```
  fc619ba0
27 Feb, 2016 1 commit
- Add node_features_p_user_update() function to node_features plugin · b258d39a
  Morris Jette authored Feb 26, 2016
  
  b258d39a
26 Feb, 2016 5 commits
- Set correct reason when a QOS' MaxTresMins is violated. · 745568f2
  Danny Auble authored Feb 26, 2016
  
  745568f2
- knl_cray.conf - Added ALlowUserBoot option · 0ed49348
  Morris Jette authored Feb 26, 2016
  
  0ed49348
- Add AllowMCDRAM and AllowNUMA to knl_cray.conf · 4eff6331
  Morris Jette authored Feb 26, 2016
  
  4eff6331
- Add not to slurm.conf man page about SallocDefaultCommand and TaskPlugins. · b5b349b0
  Tim Wickberg authored Feb 25, 2016
```
Add note to slurm.conf man page about setting "--cpu_bind=no" as part
of SallocDefaultCommand if a TaskPlugin is in use.
```
  b5b349b0
- Revert call to getaddrinfo · a232cda3
  Morris Jette authored Feb 25, 2016
```
Revert call to getaddrinfo, restoring gethostbyaddr (introduced in Slurm
    16.05.0pre1) which was failing on some systems. Specifically test7.2
    was failing on some systems with getaddrinfo() returning an error of
    "System error: Resource temporarily unavailable". Partial reversion
    of commit 89621f65
```
  a232cda3
25 Feb, 2016 2 commits

Fix issue where SocketsPerBoard didn't translate to Sockets when CPUS= · fcae2193
Danny Auble authored Feb 24, 2016
```
was also given.
```
fcae2193

Split partition's "Priority" field · 7844563c

Morris Jette authored Feb 24, 2016

Split partition's "Priority" field into "PriorityTier" (used to order
    partitions for scheduling and preemption) plus "PriorityJobFactor" (used by
    priority/multifactor plugin in calculating job priority, which is used to
    order jobs within a partition for scheduling).
bug 2479

7844563c

24 Feb, 2016 5 commits
- Make it so scontrol update part qos= will take away a partition QOS from · 3a7470ae
  Danny Auble authored Feb 24, 2016
```
a partition.
```
  3a7470ae
- Make it possible to change CPUsPerTask with scontrol. · de28c13a
  Danny Auble authored Feb 24, 2016
```
This also reverts most of commit fa331e30 as well as commit bd9fa830
which would try to set the pn_min_cpus every time a job was updated.
If a job didn't request node counts then they were hosed.

This commit takes away the magic which was screwing things up.  Now the
person gets what they asked for without magic changing things.

Bug 2302
Bug 2742
Bug 2478
```
  de28c13a
- Fix issue where when updating a job the pn_min_cpus was updated · bd9fa830
  Danny Auble authored Feb 24, 2016
```
erroneously.
```
  bd9fa830
- BGQ - Tighter locks around structures when nodes/cables change state. · c5925f41
  Danny Auble authored Feb 23, 2016
  
  c5925f41
- BGQ - Remove redeclaration of job_read_lock. · fd3dedda
  Danny Auble authored Feb 23, 2016
  
  fd3dedda
23 Feb, 2016 1 commit

Fix issue with resizing jobs and limits not be kept track of correctly. · 92ac0dcd

Danny Auble authored Feb 22, 2016

This whole process could probably be done better by keeping track of
old values and new values and only calling one function instead of a
pre and post function, but that can probably wait for future generations
of the code as it works now and is probably adequate for the time being.

Bug 2352

92ac0dcd

22 Feb, 2016 1 commit
- Start NEWS for v 16.05.0-pre2 · 4a6acaf6
  Morris Jette authored Feb 22, 2016
  
  4a6acaf6
19 Feb, 2016 1 commit

BurstBuffer/cray pre-run race condtition fix · e8959ae9

Morris Jette authored Feb 19, 2016

BurstBuffer/cray - Defer job cancellation or time limit while "pre-run"
    operation in progress to avoid inconsistent state due to multiple calls
    to job termination functions.
bug 2454

e8959ae9