Commits · ea3d501e7c26b16fe509d4d8294a577035d6057b · Manuel G. Marciani / ces_slurm_simulator

25 Jan, 2017 13 commits
- Add numeric values to enum comments · ea3d501e
  Morris Jette authored Jan 25, 2017
```
No change in logic, just added values to comments in a long enum
  to more easily identify their values
```
  ea3d501e
- Remove rpc_version checks, these must always be true at this point. · 061c9d41
  Tim Wickberg authored Jan 25, 2017
```
First noticied by Brian.
```
  061c9d41
- Merge branch 'slurm-16.05' into slurm-17.02 · db3e3491
  Tim Wickberg authored Jan 25, 2017
  
  db3e3491
- Fix agent retry thread to be detached · e7c58578
  Morris Jette authored Jan 24, 2017
```
It was leaking memory otherwise
```
  e7c58578
- Remove --enable-simulator configure option. · 7a10a6b2
  Tim Wickberg authored Jan 25, 2017
```
Not used here.
```
  7a10a6b2
- Merge branch 'slurm-16.05' into slurm-17.02 · 1f9061dd
  Tim Wickberg authored Jan 24, 2017
  
  1f9061dd
- testsuite - refactor test17.39 and test28.7 to avoid memory enforcement. · 46fa6b0f
  Tim Wickberg authored Jan 24, 2017
```
Commit 63b7e3a8 changed the --mem limit to 1MB for the job if
not using a memory SelectType, but this can cause the job to fail
if the JobAcctGatherFrequency is frequent enough to notice that the
"sleep" command is using more than 1MB of resources.

Refactor test to avoid specifying job memory. Use --wrap to avoid
creating a temporary job script as well.
```
  46fa6b0f
- testsuite - refactor test3.15 to avoid memory enforcement. · 0339efe8
  Tim Wickberg authored Jan 24, 2017
```
Commit 63b7e3a8 changed the --mem limit to 1MB for the step if
not using a memory SelectType, but this can cause the job to fail
if the JobAcctGatherFrequency is frequent enough to notice that the
"sleep" command is using more than 1MB of resources.

Refactor test to avoid specifying job memory. Use --wrap to avoid
creating a temporary job script.
```
  0339efe8
- Merge branch 'slurm-16.05' into slurm-17.02 · 218343b5
  Tim Wickberg authored Jan 24, 2017
  
  218343b5
- testsuite - refactor test2.8 to avoid memory enforcement. · 25918419
  Tim Wickberg authored Jan 24, 2017
```
Commit 63b7e3a8 changed the --mem limit to 1MB for the step if
not using a memory SelectType, but this can cause the job to fail
if the JobAcctGatherFrequency is frequent enough to notice that the
"sleep" command is using more than 1MB of resources.

Refactor test to avoid specifying memory memory; and since only one
step is checked for, only run a single step in the job. Use --wrap
to avoid creating a temporary job script.
```
  25918419
- Disable test10.5 and test10.13 on non-BlueGene systems. · 0daa9ac1
  Tim Wickberg authored Jan 24, 2017
  
  0daa9ac1
- Merge branch 'slurm-15.08' into slurm-16.05 · 01c18d79
  Tim Wickberg authored Jan 24, 2017
  
  01c18d79
- Disable test10.5 and test10.13 on non-BlueGene systems. · bb2a5aae
  Tim Wickberg authored Jan 24, 2017
  
  bb2a5aae
24 Jan, 2017 18 commits
- Merge branch 'slurm-17.02' of github.com:schedmd/slurm into slurm-17.02 · 1c5d399b
  Morris Jette authored Jan 24, 2017
  
  1c5d399b
- proctrack/linuxproc bug fix · 9ad6275f
  Morris Jette authored Jan 24, 2017
```
This bug was introduced in commit 93adc329
Symptom is failure of test 2.7
```
  9ad6275f
- Merge branch 'slurm-16.05' into slurm-17.02 · 134952bd
  Tim Wickberg authored Jan 24, 2017
  
  134952bd
- Merge branch 'slurm-15.08' into slurm-16.05 · 877aa679
  Tim Wickberg authored Jan 24, 2017
  
  877aa679
- Fix tests for some configurations · e8bb2944
  Morris Jette authored Apr 21, 2016
```
Some portions of tests 21.30 and 21.34 failed with accounting and
priority basic. These changes disable portions of those tests as
needed based upon configuration.
```
  e8bb2944
- Add missing limits.h include wherever PATH_MAX is used. · f05de490
  Tim Wickberg authored Jan 24, 2017
```
FreeBSD requires this to build; overlooked in 2d9e999f.
```
  f05de490
- Cosmetic changes. No change to logic · 0bd05a3f
  Morris Jette authored Jan 24, 2017
  
  0bd05a3f
- Initialize a variable before use · 4f862588
  Morris Jette authored Jan 24, 2017
  
  4f862588
- Update RELEASE_NOTES · d720be5e
  Morris Jette authored Jan 24, 2017
```
Update information about modified functions and data structures.
```
  d720be5e
- Fix agent retry thread to be detached · 591c72ce
  Morris Jette authored Jan 24, 2017
```
It was leaking memory otherwise
```
  591c72ce
- Improve _allocate_sc performance. · d04b32bd
  Dominik Bartkiewicz authored Jan 24, 2017
```
_allocate_sc() is heavily used within the scheduler, change to
stack allocation from heap to avoid constant churn on xmalloc()/xfree().

Bug 3420.
```
  d04b32bd
- Testsuite: created global proc get_my_id and made tests consistent. · 7dfb7249
  Isaac Hartung authored Jan 20, 2017
  
  7dfb7249
- Merge branch 'slurm-16.05' into slurm-17.02 · c50953fd
  Morris Jette authored Jan 23, 2017
  
  c50953fd
- Fix race condition in a test · ad455b7d
  Morris Jette authored Jan 23, 2017
```
test1.63 was failing periodically due to a race condition. A signal
  was being sent to srun before the signal handler thread was spawned.
```
  ad455b7d
- Rename variable name · 6d5b9a32
  Brian Christiansen authored Jan 23, 2017
```
Too closely related to working_cluster_rec.
```
  6d5b9a32
- Fix coverity memory leaks · d2ef4ee1
  Brian Christiansen authored Jan 23, 2017
```
CID 16088[1-4]
```
  d2ef4ee1
- Add configure tests for builtin clz and ctz functions. · 36453f97
  Tim Wickberg authored Jan 11, 2017
```
Could be used in bit_ffs and bit_fls functions rather than
existing for loops.
```
  36453f97
- Add in auxdir/ax_gcc_builtin.m4 and add check for __builtin_popcountll. · 618a367b
  Tim Wickberg authored Jan 11, 2017
  
  618a367b
23 Jan, 2017 9 commits

Replace redefinition of free() with include of stdlib.h. · cab4bd3c
Tim Wickberg authored Jan 11, 2017

cab4bd3c
Add the ability to purge rolled up usage from the database. · 122a07bb
Danny Auble authored Jan 23, 2017
```
Bug 1599
```
122a07bb
Make it so the archive files have the table we used instead of just a canned string. · 4bbb8ac5
Danny Auble authored Jan 23, 2017

4bbb8ac5

Add new knl.conf parameters to capmc drivers · 0eea2c3d

Morris Jette authored Jan 23, 2017

Add new knl.conf parameter to the capmc_suspend and capmc_resume
  programs. They are not used by those programs, but we need to
  prevent an error if those new parameters are used.

0eea2c3d

Merge branch 'slurm-16.05' · a692d9c7
Morris Jette authored Jan 23, 2017

a692d9c7

For batch step, reset job memory after node boot · 0277629b

Morris Jette authored Jan 23, 2017

Reset a job's memory limit based upon what's available after node
  reboot, which can change on a KNL if the MCDRAM mode is changes
  on reboot

0277629b

Fix for backfill launch job with reboot · d72b13f2

Morris Jette authored Jan 23, 2017

This bug was likely the root cause of bug 3366. If the backfill scheduler
  allocates resources for a batch job and a node reboot is required, the
  batch launch RPC would be sent to the agent. At that point, there is a
  race condition between the agent and the job_time_limit() function
  testing for boot completion. If the job_time_limit() function ran
  first, it would trigger a second launch RPC request getting sent to
  the agent.
bug 3366

d72b13f2

Cleaner job configuring logic · f9804256
Morris Jette authored Jan 23, 2017
```
Clean up logic to test if job is configuring
bug 3366
```
f9804256

Avoid launching batch step while configuring · e3a7bdcc

Morris Jette authored Jan 23, 2017

Do not launch a batch step while the job is configuring. Previous
  logic checked for the PrologSlurmctld running, but not nodes
  booting. Checking the job's CONFIGURING state flag will validate
  both.
bug 3366

e3a7bdcc