Commits · 02d6f6a51d6bb791d8049092a1f2358af691291b · Manuel G. Marciani / ces_slurm_simulator

31 Jan, 2015 1 commit
- Fix database resources so they can add new clusters to them after they have · da8409c0
  Danny Auble authored Jan 30, 2015
```
initially been added.
```
  da8409c0
30 Jan, 2015 2 commits
- Use the slurm_getpwuid_r wrapper. · 7644f970
  David Bigagli authored Jan 30, 2015
  
  7644f970
- Remove minor warning when compiling slurmstepd. · 1bbce7d8
  Danny Auble authored Jan 30, 2015
  
  1bbce7d8
28 Jan, 2015 3 commits
- Fix file name substitution for job's stderr. · fda051cb
  David Bigagli authored Jan 28, 2015
  
  fda051cb
- Update NEWS · 60ce95ce
  Brian Christiansen authored Jan 28, 2015
  
  60ce95ce
- Don't log error message about removing nonexistent cpuset dir. · b37abc7c
  Brian Christiansen authored Jan 27, 2015
```
Bug 1397
```
  b37abc7c
27 Jan, 2015 1 commit
- Fix Slurmdb::clusters_get() in perl api from not returning information. · 72de52fd
  Brian Christiansen authored Jan 26, 2015
```
Bug 1384
```
  72de52fd
26 Jan, 2015 1 commit
- allow --ignore-pbs as #SBATCH argument · 39db24b1
  Aaron Knister authored Jan 26, 2015
  
  39db24b1
23 Jan, 2015 1 commit
- Remove misleading non-consumable GRES logging · b46179f6
  Dorian Krause authored Jan 23, 2015
  
  b46179f6
22 Jan, 2015 1 commit
- Fix perl api tests for libslurmdb to work correctly. · 0bc3c68b
  Danny Auble authored Jan 22, 2015
  
  0bc3c68b
21 Jan, 2015 2 commits

fix job array scheduling anomaly · 3787c01f

Morris Jette authored Jan 21, 2015

If some tasks of a job array are runnable and the meta-job array
record is not runable (e.g. held), the old logic could start a
runable task then try to start the non-runable meta-job, discover
it can not run, and set its reason to "BadConstraints".

Test case:
Make it so no jobs can start (partition stopped, slurmd down, etc.)
submit a job array
hold the job array
release the first two tasks of the job array
Make it so jobs can start

3787c01f

fix squeue merging of job arrays · 261580be

Morris Jette authored Jan 21, 2015

Squeue modified to not merge tasks of a job array if their wait reasons
differ.
bug 1388

261580be

20 Jan, 2015 2 commits
- If users specify ALL together with other variables using the · 9e1b2d9b
  David Bigagli authored Jan 20, 2015
```
--export sbatch/srun command line option, propagate the users'
environ to the execution side. #1367
```
  9e1b2d9b
- Fix memory leak in mysql accounting when usage rollup happens. · 5a1cb777
  Danny Auble authored Jan 20, 2015
  
  5a1cb777
19 Jan, 2015 1 commit
- job_submit/lua - Add "alloc_node" job info · 85b3cc2d
  jette authored Jan 19, 2015
```
bug 1379
```
  85b3cc2d
17 Jan, 2015 1 commit
- job_submit/pbs - Fix possible deadlock. · d0dd1c53
  jette authored Jan 17, 2015
```
bug 1375
```
  d0dd1c53
15 Jan, 2015 3 commits

Make CR_ONE_TASK_PER_CORE work correctly with task/affinity. · db926ab7

Danny Auble authored Jan 15, 2015

What this does is use the core level binding after each task is laid out
to skip all the extra threads in the core so it doesn't give them to
another task.

It probably isn't perfect, but does solve all the scenarios I found.

db926ab7

GRES scheduling fix · 72cefd54

Morris Jette authored Jan 15, 2015

Fix for GRES scheduling in which there is CPU topology defined or
GRES types defined and there is more than 1 GPU per topology record
in slurmctld. Without this fix, only one GRES could be allocated
from each defined topology.
bug 1369

72cefd54

Fix for slurmctld abort on gres error · ce1d99f5

Morris Jette authored Jan 14, 2015

The slurmctld could abort with a gres configuration having
Type= configured, but no CPU binding configured.

ce1d99f5

14 Jan, 2015 2 commits
- Update NEWS file. · 20ce1e1c
  David Bigagli authored Jan 14, 2015
  
  20ce1e1c
- Make sacctmgr print out classification correctly for clusters · b72616a6
  Danny Auble authored Jan 14, 2015
  
  b72616a6
13 Jan, 2015 2 commits
- Correct check of enforcement when filling in an association. · 0f5c3cd8
  Danny Auble authored Jan 12, 2015
  
  0f5c3cd8
- Make sure assoc_mgr locks are initialized correctly. · 319be552
  Danny Auble authored Jan 12, 2015
```
Most of these don't matter as they are all NO_LOCK

Fallout from commit f1ebdef1 when the resources were added.
```
  319be552
09 Jan, 2015 2 commits
- Fix minor typo · a7cb5b12
  Danny Auble authored Jan 08, 2015
  
  a7cb5b12
- Edit news for next tag · 35e1ece8
  Danny Auble authored Jan 08, 2015
  
  35e1ece8
07 Jan, 2015 4 commits

update NEWS from Merge · 8e7a62d1
Danny Auble authored Jan 07, 2015

8e7a62d1
Add pbs parser fix to NEWS · 729a58ac
Aaron Knister authored Jan 06, 2015

729a58ac

avoid delay on commit for PMI task at rank 0 · bb6656dc

Rémi Palancher authored Dec 22, 2014

Intel MPI, on MPI jobs initialisation through PMI, uses to call PMI_KVS_Put()
many many times from task at rank 0, and each on these call is followed by
PMI_KVS_Commit(). Slurm implementation of PMI_KVS_Commit() imposes a delay
to avoid DDOS on original srun. This delay is proportional to the total number.
It could be up to 3 secs for large jobs for ex. with 7168 tasks. Therefore,
when Intel MPI calls PMI_KVS_Commit() 475 times (mesured on a test case) from
task at rank 0, 28 minutes are spent in delay function.
All other tasks in the job are waiting for a PMI_Barrier. Therefore, there is
no risk for a DDOS from this single task 0. The patch alters the delaying time
calculation to make sure task at rank 0 will does not be delayed. All other
tasks are globally spreaded in the same time range as before.

bb6656dc

Add PMI2 fix to NEWS · 84d61f94
Aaron Knister authored Jan 06, 2015

84d61f94

06 Jan, 2015 4 commits
- job array depdendency fix · 744f114b
  Morris Jette authored Jan 05, 2015
```
Fix race condition that could start a job that is dependent upon a job array
before all tasks of that job array complete.
bug 1324
```
  744f114b
- Fix segfault in slurmstepd when job exceeded memory limit. · a4b05bad
  Brian Christiansen authored Jan 05, 2015
```
Bug 1350
```
  a4b05bad
- BLUEGENE - Remove check that would erroneously remove the CONFIGURING · 77787999
  Danny Auble authored Jan 05, 2015
```
flag from a job while the job is waiting for a block to boot.
```
  77787999
- BGQ - Put print statement under a DebugFlag. This was just an oversight. · 8681a0f5
  Danny Auble authored Jan 05, 2015
  
  8681a0f5
05 Jan, 2015 1 commit
- Correct the pbs parser. · e35c6c4b
  David Bigagli authored Jan 05, 2015
  
  e35c6c4b
02 Jan, 2015 2 commits
- Fix segfault with job arrays. · db98d624
  Brian Christiansen authored Jan 02, 2015
```
Bug 1346
```
  db98d624
- Fix cosmetic info statements when dealing with a job array task instead of · 70837b3f
  Danny Auble authored Jan 02, 2015
```
a normal job.
```
  70837b3f
01 Jan, 2015 1 commit
- Fix sacct when searching by nodelist. · 99440d95
  Brian Christiansen authored Dec 31, 2014
  
  99440d95
30 Dec, 2014 2 commits
- Update openmpi documentation. · 10577d87
  David Bigagli authored Dec 30, 2014
  
  10577d87
- Restore the SLURM_STEP_RESV_PORTS env variable. · 5170be55
  David Bigagli authored Dec 30, 2014
  
  5170be55
29 Dec, 2014 1 commit
- Fix documentation issues in slurm.conf. · 4fcc08e2
  David Bigagli authored Dec 29, 2014
  
  4fcc08e2