Commits · cc6f7e4ca81b439cf28ba1e3c0a75662295d8296 · Manuel G. Marciani / ces_slurm_simulator

01 Feb, 2017 3 commits
- Fix typo · cc6f7e4c
  pamxl authored Feb 01, 2017
  
  cc6f7e4c
- Fix typo in burst_buffer.conf man page · 9ceec561
  Morris Jette authored Feb 01, 2017
  
  9ceec561
- job_submit/lua - Make "immediate" parameter available. · 838702db
  Chansup Byun authored Feb 01, 2017
  
  838702db
31 Jan, 2017 12 commits
- Update NEWS for next 17.02 version · b8734dfa
  Danny Auble authored Jan 31, 2017
  
  b8734dfa
- Merge remote-tracking branch 'origin/slurm-16.05' into slurm-17.02 · a9190591
  Danny Auble authored Jan 31, 2017
```
# Conflicts:
#	META
#	NEWS
```
  a9190591
- NEWS update for next 16.05 version · b8aecfd0
  Danny Auble authored Jan 31, 2017
  
  b8aecfd0
- Update META for 17.02.0-0rc1 tag · 4e3b81e9
  Danny Auble authored Jan 31, 2017
  
  4e3b81e9
- Merge remote-tracking branch 'origin/slurm-16.05' into slurm-17.02 · 2a662e06
  Danny Auble authored Jan 31, 2017
  
  2a662e06
- Update META for v16.05.9 tag · abfd9995
  Danny Auble authored Jan 31, 2017
  
  abfd9995
- Fix minor memory leak in jobcomp · 528bfcd6
  Danny Auble authored Jan 31, 2017
  
  528bfcd6
- Add new bit_and_not() function to bitstring.c. · 28e1c4d9
  Dominik Bartkiewicz authored Jan 30, 2017
```
Use instead of the repeated construction of:
bit_not(b); bit_and(a, b); bit_not(b);
to avoid two addition iterations.
```
  28e1c4d9
- Make sure acct policy limits imposed on a job are correct after requeue. · 5d941217
  Alejandro Sanchez authored Jan 31, 2017
  
  5d941217
- Revert part of 63698ae5 · 83dd027f
  Brian Christiansen authored Jan 30, 2017
```
Can't set the job_id of a will_run will_run job prior to creating the
job. Invalidates the jobs for non-slurmuser jobs. Also not burning a
jobid could make the will_run job not be distinguishable in the logs.

Will handle the federation aspect of this in 17.11.

Bug 3436
```
  83dd027f
- If dealing with a lot of jobs and many need to be timed out after 3 seconds drop out of the · 059275f6
  Dominik Bartkiewicz authored Jan 30, 2017
```
job_write_lock and sleep a second before grabbing it again.  This should probably be made
configurable later on.
```
  059275f6
- Make duplicate code an extern function. · 87b86eef
  Danny Auble authored Jan 30, 2017
  
  87b86eef
30 Jan, 2017 9 commits

Merge remote-tracking branch 'origin/slurm-16.05' into slurm-17.02 · 97c485bf
Danny Auble authored Jan 30, 2017
```
# Conflicts:
#	src/slurmctld/proc_req.c
```
97c485bf
Fixed another minor memory leak. · 63051cab
Danny Auble authored Jan 30, 2017

63051cab
Note about gethostbyname and a minor (unavoidable) memory leak. · 37bae248
Danny Auble authored Jan 30, 2017

37bae248

Danny Auble authored Jan 30, 2017

e3a7bdcc
f9804256
d72b13f2

Reference bug 3366

If you are running on a Bluegene system we rely on the prolog to take us out of configuring
state.  These commits work good for system rebooting the nodes where the prolog is running,
but in the case of Bluegene this is the opposite desire :).   These commits on a Bluegene
pretty much make it so a batch job never gets launched.

a4c51165

Set SLURM_JOB_GPUS env var for Prolog · 96d29749
Morris Jette authored Jan 30, 2017
```
Properly set SLURM_JOB_GPUS environment variable for Prolog.
bug 3437
```
96d29749
Merge branch 'slurm-16.05' into slurm-17.02 · 57ab5343
Morris Jette authored Jan 30, 2017

57ab5343

Clear job BeginTime reason · 0abbf727

Morris Jette authored Jan 30, 2017

Clear job's reason of "BeginTime" in a more timely fashion and/or prevents
    them from being stuck in a PENDING state. There are multiple ways of
    clearing the reason, especially on a lightly loaded system, but the
    state can persist indefinitely on a heavily loaded system.
bug 3368

0abbf727

Fix minor memory leak in cray bb. · 4b0e96d7
Danny Auble authored Jan 30, 2017

4b0e96d7

will_run fix for job with begin time in past · f75abc9c

Morris Jette authored Jan 30, 2017

Fix to logic for getting expected start time of existing job ID with
explicit begin time that is in the past. Previous logic would
compare that (past) begin time with advanced reservations that
would compete with it rather than the current time.

f75abc9c

29 Jan, 2017 9 commits

Merge branch 'slurm-15.08' into slurm-16.05 · 76ca2ce7
Morris Jette authored Jan 29, 2017

76ca2ce7

Avoid test failure on Cray · c75c6d71

Morris Jette authored Jan 29, 2017

On cray systems with step NHC, the step launches are delayed and
  produce a pair of messages (below) that caused the test to fail:
  srun: Job step creation temporarily disabled, retrying
  srun: Job step created

c75c6d71

Merge branch 'slurm-16.05' into slurm-17.02 · c1dbb786
Morris Jette authored Jan 29, 2017

c1dbb786
Add delay for job burst buffer purge · 9834b29b
Morris Jette authored Jan 29, 2017

9834b29b

Backport of RELEASE_NOTES file · 8208793a

Morris Jette authored Jan 29, 2017

The v17.02 updates appear to have only gone into master rather than
  the slurm-17.02 branch

8208793a

Merge branch 'slurm-16.05' into slurm-17.02 · 0c1f3cf5
Morris Jette authored Jan 29, 2017

0c1f3cf5
Fix some tests for Cray · a6eb2d43
Morris Jette authored Jan 29, 2017

a6eb2d43

task/cray configuration ordering bug · 0545f523

Morris Jette authored Jan 29, 2017

CRAY systems only: TaskPlugins must list task/cgroup before task/cray in
order for the cgroup files to be created before task/cray runs. Without
this change, the task/cray plugin frequently produces errors about the
"mems" file being missing. The errors don't seem consistent, so this
probably involves a race condition. Note that NERSC uses this order
today and I changed read_config.c to produce a fatal error if the order
is reversed.

0545f523

Log error if task/cray not preceeded by task/cgroup · 34b0f61d
Morris Jette authored Jan 29, 2017
```
Failure to do so results in a bunch of task/cray errors about
  not finding the cgroup set up.
```
34b0f61d

28 Jan, 2017 7 commits
- Merge branch 'slurm-16.05' into slurm-17.02 · 541957bd
  Morris Jette authored Jan 28, 2017
  
  541957bd
- Merge branch 'slurm-15.08' into slurm-16.05 · 24bca89c
  Morris Jette authored Jan 28, 2017
  
  24bca89c
- Fix test for down nodes · 3d050969
  Morris Jette authored Jan 28, 2017
```
Avoid a test failing of all nodes in a partition are not usable
  (down, drained, reserved, or otherwise unusable).
```
  3d050969
- Merge branch 'slurm-16.05' into slurm-17.02 · 63cadb03
  Morris Jette authored Jan 28, 2017
  
  63cadb03
- Merge branch 'slurm-15.08' into slurm-16.05 · 5453fb87
  Morris Jette authored Jan 28, 2017
  
  5453fb87
- Fix tests to work properly on native Cray · f0814910
  Morris Jette authored Jan 28, 2017
```
Disable test if underlying select/linear use
```
  f0814910
- Avoid stray error files in test · 05b404d8
  Morris Jette authored Jan 28, 2017
```
Modify qsub test to explicitly create and destroy the error files
  to avoid leaving around a bunch of error files (even if they are
  normally empty).
```
  05b404d8