Commits · 8b7d7244ee0a69cc8855fd4079fde3475f68df15 · Manuel G. Marciani / ces_slurm_simulator

29 Dec, 2014 2 commits
- Make it so a newer version of a slurmstepd can talk to an older srun. · 2a5271c1
  Danny Auble authored Dec 29, 2014
  
  2a5271c1
- Fix documentation issues in slurm.conf. · 4fcc08e2
  David Bigagli authored Dec 29, 2014
  
  4fcc08e2
26 Dec, 2014 1 commit
- Fixes for clean build on FreeBSD. · a8f11909
  Jason Bacon authored Dec 26, 2014
  
  a8f11909
24 Dec, 2014 3 commits

Enable per-partition gang sched resolution · 5e02af31

Morris Jette authored Dec 24, 2014

Enable per-partition gang scheduling resource resolution (e.g. the partition
can have SelectTypeParameters=CR_CORE, while the global value is CR_SOCKET).
bug 1299

5e02af31

Enforce partition shared option · f8fb79d5

Morris Jette authored Dec 23, 2014

Properly enforce partition Shared=YES option. Previously oversubscribing
resources required gang scheduling to also be configured.

f8fb79d5

Fix bad job array task ID value · 46a2e9a1

Morris Jette authored Dec 23, 2014

Prevent invalid job array task ID value if a task is started using gang
scheduling (i.e. the task starts in a SUSPENDED state). The task ID gets
set to NO_VAL and the task string is also cleared.

46a2e9a1

23 Dec, 2014 3 commits

Fix bad job array task ID value · 48016f86

Morris Jette authored Dec 23, 2014

Prevent invalid job array task ID value if a task is started using gang
scheduling (i.e. the task starts in a SUSPENDED state). The task ID gets
set to NO_VAL and the task string is also cleared.

48016f86

Prevent gang resume of suspended job · 161d0336

Morris Jette authored Dec 23, 2014

Prevent a job manually suspended from being resumed by gang scheduler once
free resources are available.
bug 1335

161d0336

set node state RESERVED on maint reservation delete · cf846644

Dorian Krause authored Dec 22, 2014

we have hit the following problem that seems to be present in Slurm
slurm-14-11-2-1 and previous versions. When a node is reserved and an
overlapping maint reservation is created and later deleted the scontrol
output will report the node as IDLE rather than RESERVED:

+ scontrol show node node1
+ grep State
   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1
+ scontrol create reservation starttime=now duration=120 user=usr01000
nodes=node1 ReservationName=X
Reservation created: X
+ sleep 10
+ scontrol show nodes node1
+ grep State
   State=RESERVED ThreadsPerCore=1 TmpDisk=0 Weight=1
+ scontrol create reservation starttime=now duration=120 user=usr01000
nodes=ALL flags=maint,ignore_jobs ReservationName=Y
Reservation created: Y
+ sleep 10
+ grep State
+ scontrol show nodes node1
   State=MAINT ThreadsPerCore=1 TmpDisk=0 Weight=1
+ scontrol delete ReservationName=Y
+ sleep 10
+ scontrol show nodes node1
+ grep State
*   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1*
+ scontrol delete ReservationName=X
+ sleep 10
+ scontrol show nodes node1
+ grep State
   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1

Note that the after the deletion of reservation "X" the State=IDLE
instead of State=RESERVED. I think that the delete_resv() function in
slurmctld/reservation.c should call set_node_maint_mode(true) like
update_resv() does. With the patch pasted at the end of this e-mail I
get the following output which matches my expectation:

+ scontrol show node node1
+ grep State
   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1
+ scontrol create reservation starttime=now duration=120 user=usr01000
nodes=node1 ReservationName=X
Reservation created: X
+ sleep 10
+ scontrol show nodes node1
+ grep State
   State=RESERVED ThreadsPerCore=1 TmpDisk=0 Weight=1
+ scontrol create reservation starttime=now duration=120 user=usr01000
nodes=ALL flags=maint,ignore_jobs ReservationName=Y
Reservation created: Y
+ sleep 10
+ scontrol show nodes node1
+ grep State
   State=MAINT ThreadsPerCore=1 TmpDisk=0 Weight=1
+ scontrol delete ReservationName=Y
+ sleep 10
+ scontrol show nodes node1
+ grep State
*   State=RESERVED ThreadsPerCore=1 TmpDisk=0 Weight=1*
+ scontrol delete ReservationName=X
+ sleep 10
+ scontrol show nodes node1
+ grep State
   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1

Thanks,
Dorian

cf846644

22 Dec, 2014 2 commits

Auth/munge - Correct AccountingStoragePass parsing · 2edef50d
Daniel Ahlin authored Dec 22, 2014
```
Correct parsing of AccountingStoragePass when specified in old format
(just a path name)
```
2edef50d

avoid delay on commit for PMI task at rank 0 · fcc11e22

Rémi Palancher authored Dec 22, 2014

Intel MPI, on MPI jobs initialisation through PMI, uses to call PMI_KVS_Put()
many many times from task at rank 0, and each on these call is followed by
PMI_KVS_Commit(). Slurm implementation of PMI_KVS_Commit() imposes a delay
to avoid DDOS on original srun. This delay is proportional to the total number.
It could be up to 3 secs for large jobs for ex. with 7168 tasks. Therefore,
when Intel MPI calls PMI_KVS_Commit() 475 times (mesured on a test case) from
task at rank 0, 28 minutes are spent in delay function.
All other tasks in the job are waiting for a PMI_Barrier. Therefore, there is
no risk for a DDOS from this single task 0. The patch alters the delaying time
calculation to make sure task at rank 0 will does not be delayed. All other
tasks are globally spreaded in the same time range as before.

fcc11e22

20 Dec, 2014 3 commits
- Add sview burst buffer display. · 64d14d0e
  Nathan Yee authored Dec 19, 2014
  
  64d14d0e
- Make it so previous versions of salloc/srun work with newer versions · d11ece80
  Danny Auble authored Dec 19, 2014
```
    of Slurm daemons.

The slurmstepd still needs to be fixed, which most likely can't be fixed
until 15.08.
```
  d11ece80
- Add the account, qos and reservation to srun. · 0252a63e
  David Bigagli authored Dec 19, 2014
  
  0252a63e
19 Dec, 2014 4 commits
- Make it so previous versions of salloc/srun work with newer versions · 61db2e34
  Danny Auble authored Dec 19, 2014
```
of Slurm daemons.
```
  61db2e34
- Add the environment variables SLURM_JOB_ACCOUNT, SLURM_JOB_QOS · 167d9ef2
  David Bigagli authored Dec 18, 2014
```
and SLURM_JOB_RESERVATION in the batch job.
```
  167d9ef2
- Fix for task/affinity if an admin configures a node for having threads · 731b6ded
  Danny Auble authored Dec 18, 2014
```
but then sets CPUs to only represent the number of cores on the node.
```
  731b6ded
- MySQL - Enhanced coordinator security checks. · 75af062c
  Danny Auble authored Dec 18, 2014
  
  75af062c
17 Dec, 2014 2 commits
- Fix ghost job when submitting job after all jobids are exhausted. · c8754578
  Brian Christiansen authored Dec 17, 2014
```
Bug 1327
```
  c8754578
- In srun honor ntasks_per_node before looking at cpu count when the user · 6a389a7a
  Danny Auble authored Dec 16, 2014
```
doesn't request a number of tasks.
```
  6a389a7a
16 Dec, 2014 2 commits
- Fix job hash table bug · f293ce7c
  Morris Jette authored Dec 16, 2014
```
Fix job array hash table bug, could result in slurmctld infinite loop or
invalid memory reference.
bug 1309
```
  f293ce7c
- Update news file. · 9c6f34d8
  David Bigagli authored Dec 15, 2014
  
  9c6f34d8
12 Dec, 2014 4 commits

Prevent vestigial job array record · 42d75a09

Morris Jette authored Dec 12, 2014

If a master job array record is complete, then consider all pending
tasks as also complete. This problem happens when a master job array
record is pending (has pending tasks) and is cancelled. The result
previously was a job record not visible to squeue/scontrol, but occupying
memory.
The same type of problem happened with respect to a dependency on a job
array which was cancelled.

42d75a09

update news for next tag · ed352bce
Danny Auble authored Dec 12, 2014

ed352bce
Update news for next tag · e437094d
Danny Auble authored Dec 12, 2014

e437094d
Update news for potential next tag · f84b4724
Danny Auble authored Dec 12, 2014

f84b4724

11 Dec, 2014 8 commits

Sanity check for Correct QOS on startup. · 7f5b20c9

Danny Auble authored Dec 11, 2014

If a QOS was added for the job and then removed and it just happened
to be the largest QOS id wise if the slurmctld was restarted and the job
wasn't flushed out yet it could mess things up.

7f5b20c9

Update news file · 78739a24
David Bigagli authored Dec 11, 2014

78739a24

Improve info for jobs delayed by reservation · cd8576bd

Morris Jette authored Dec 11, 2014

Log how many nodes are removed from consideration from jobs due to
  advanced reservation.
Change user error message to indicated that required nodes might
  be down, drained or (added this bit) reserved.

cd8576bd

Fix for proctrack strol logic · 8582d580

Morris Jette authored Dec 11, 2014

In proctrack/linuxproc and proctrack/pgid, check the result of strtol()
for error condition rather than errno, which might have a vestigial error
code.

8582d580

Fix issue with accounting_storage/filetxt and job arrays not being printed · 64e5324d
Danny Auble authored Dec 10, 2014
```
correctly.
```
64e5324d
Fix possible race condition when attempting to use QOS on a system running · 1bef5ee8
Danny Auble authored Dec 10, 2014
```
accounting_storage/filetxt.
```
1bef5ee8

Split task_dist_states variable · 54e53857

Morris Jette authored Dec 10, 2014

The task_dist_states variable has been split into "flags" and "base"
components. Added SLURM_DIST_PACK_NODES and SLURM_DIST_NO_PACK_NODES values
to give user greater control over task distribution. The srun --dist options
has been modified to accept a "Pack" and "NoPack" option. These options can
be used to override the CR_PACK_NODE configuration option.

54e53857

A partition can now have a QOS attached. This will allow a partition · 77b644b9

Danny Auble authored Dec 10, 2014

to have all the limits a QOS has.  If a limit is set in both QOS
the partition QOS will override the job's QOS unless the job's QOS has the
'PartitionQOS' flag set.

77b644b9

09 Dec, 2014 2 commits
- Add smap support for job arrays · 9b812ea2
  Morris Jette authored Dec 09, 2014
  
  9b812ea2
- Make sure the bitstrings for a partitions Allow/DenyQOS are up to date · 3fda378e
  Danny Auble authored Dec 08, 2014
```
when running from cache.
```
  3fda378e
08 Dec, 2014 4 commits
- Increase MAX_PACK_MEM_LEN. · 987504a3
  David Bigagli authored Dec 08, 2014
  
  987504a3
- Fix squeue issue with not recognizing "localhost" in --nodelist option. · 637c297d
  Brian Christiansen authored Dec 08, 2014
```
Bug 1305
```
  637c297d
- Fix GRES bug with Types · ac9436bc
  Morris Jette authored Dec 08, 2014
```
Fix bug with GRES having multiple types that can cause slurmctld abort.
This can be reproduced with select/cons_res and one Gres like this:
Name=gpu Type=kepler File=/dev/tty0
A bad index was being used that caused an assert.
```
  ac9436bc
- Update NEWS file. · 46ec8773
  David Bigagli authored Dec 08, 2014
  
  46ec8773