Commits · 2b7389a1fadec35a4e530e8a70fe8185e618ed14 · Manuel G. Marciani / ces_slurm_simulator

02 May, 2013 2 commits

POE - Fix logic binding tasks to CPUs. · 48e164e0

jette authored May 02, 2013

Without this change pmdv12 was bound to one CPU and could not use
all of the resources allocated to the job step for the tasks that
it launches

48e164e0

POE - Correct task count for srun --launch-cmd option · d89d7cd9

jette authored May 02, 2013

This only changes behaviour when the --ntasks option is not used,
but the --cpus-per-task option is use

d89d7cd9

01 May, 2013 6 commits
- Added sacct format option of "ALL" to print all fields. · 0ce7c9aa
  Morris Jette authored May 01, 2013
```
Also add size specification of "%0" to not limit a field size.
For example "sacct --format=%0ALL" to print everything.
```
  0ce7c9aa
- POE - Correct logic to support srun network instances count with POE. · 2fe37e32
  Morris Jette authored May 01, 2013
  
  2fe37e32
- Accounting - Fix minor initialization error. · df0faeac
  Danny Auble authored May 01, 2013
  
  df0faeac
- POE - Correct logic to support poe option "-euidevice sn_all" · 878d67f1
  Morris Jette authored May 01, 2013
```
also "-euidevice sn_single".
```
  878d67f1
- CRAY - Change logging of transient ALPS errors from error() to debug(). · fd456175
  Morris Jette authored May 01, 2013
  
  fd456175
- Modify slurmctld data structure locking to interleave read and write locks · 2e99b99a
  Morris Jette authored Apr 30, 2013
```
Modify slurmctld data structure locking to interleave read and write
locks rather than always favor write locks over read locks.
```
  2e99b99a
30 Apr, 2013 3 commits

Change maximum delay for state save from 2 secs to 5 secs. · 5a2a76ff
Morris Jette authored Apr 30, 2013
```
Make timeout configurable at build time by defining SAVE_MAX_WAIT.
```
5a2a76ff

added script to help manage native and symmetric MPI runs within SLURM · fdf56162

Olli-Pekka Lehto authored Apr 30, 2013

Dear all,

As quick fix, I have put together this script to help manage native and symmetric MPI runs within SLURM. It's a bit bare-bones currently but I needed to get it working quickly :)

It does not provide tight integration between the scheduler and MPI daemons and requires a slot on the host, even when running fully on the MIC, so it's really far from an optimal solution but could be a stopgap.

It's inspired by the TACC Stampede documentation. They seem to have a similar script in place.

It's fairly simple, you provide the names of the MIC binary (with -m) and host binary (with -c). The host MPI/OpenMP parameters are given as usual and the Xeon Phi side parameters as environment variables (MIC_PPN, MIC_OMP_NUM_THREADS). Currently it supports only 1 card per host but extending it should be simple enough.

Here are a couple of links to documentation:

Our prototype cluster documentation:
https://confluence.csc.fi/display/HPCproto/HPC+Prototypes#HPCPrototypes-XeonPhiDevelopment
Presentation at the PRACE Spring School in Umeå earlier this week:
https://www.hpc2n.umu.se/sites/default/files/1.03%20CSC%20Cluster%20Introduction.pdf

Feel free to include this in the contribs -directory. It might need a bit of cleanup though and I don't know when I have the time to do this.

I have also added support for TotalView debugger (provided it's installed and configured properly for Xeon Phi usage).

Future ideas:

For the native MIC client, I've been testing it out a bit and looking at ways to minimize the changes needed for support. The two major challenges seem to be in scheduling and affinity:

I think it might be necessary to put it into a specific topology plugin, like the one for BG/Q, but it looks like a lot of work to do that.

Best regards,
Olli-Pekka

fdf56162

Accounting - make average by task not cpu. · 81ccec93
Danny Auble authored Apr 29, 2013

81ccec93

29 Apr, 2013 3 commits

Fix AdminHold bug with reservations · f7c388ba

Morris Jette authored Apr 29, 2013

Avoid placing pending jobs in AdminHold state due to backfill scheduler
interactions with advanced reservation.
Specifically, if the backfill scheduler tests a pending job can be
scheduled after it's advanced reservation ends then the job was
assigned a priority of zero (AdminHold).

f7c388ba

Add missing symbols to the xlator.h · 6d18cd66
Danny Auble authored Apr 29, 2013

6d18cd66
Fix for linking to the select/cray plugin to not give warning about · 53640b5f
Danny Auble authored Apr 29, 2013
```
undefined variable.
```
53640b5f

26 Apr, 2013 3 commits
- News update for next version · 7a4b211a
  Danny Auble authored Apr 25, 2013
  
  7a4b211a
- Gres accounting - Fix regression in 2.5.5 for keeping track of gres · e5bc5e36
  Danny Auble authored Apr 25, 2013
```
requested and allocated.
```
  e5bc5e36
- Gres fix for requeued jobs. · 6fff97bb
  Phil Sharfstein authored Apr 25, 2013
  
  6fff97bb
25 Apr, 2013 2 commits
- news for next tag · 883b1f7b
  Danny Auble authored Apr 24, 2013
  
  883b1f7b
- news for next tag · 6d13e33c
  Danny Auble authored Apr 24, 2013
  
  6d13e33c
24 Apr, 2013 1 commit

CRAY - Allocate whole node (CPUs) in reservation despite what the · d7060246

Danny Auble authored Apr 24, 2013

user requests.  We have found any srun/aprun afterwards will work on a
subset of resources.

Before next release remove vestigial code, leaving it there for now just
in case we find something out of the ordinary and we have to revert.

d7060246

23 Apr, 2013 3 commits
- Add DebugFlags=ThreadID which will print the thread id of the calling · 27564d62
  David Bigagli authored Apr 23, 2013
```
thread.
```
  27564d62
- Cray - display correct nodelist, node/cpu count on steps. · 9b84c30f
  Danny Auble authored Apr 22, 2013
  
  9b84c30f
- Cray - when a step is requested count other steps running on nodes in the · af2bc8ff
  Danny Auble authored Apr 22, 2013
```
allocation as taking up the entire node instead of just part of the node
allocated.  And always enforce exclusive on a step request.
```
  af2bc8ff
19 Apr, 2013 3 commits
- Make sure on systems that use a different launcher than launch/slurm not · edda3733
  Danny Auble authored Apr 18, 2013
```
to attempt to signal tasks on the frontend node.
```
  edda3733
- Fix checking if QOS limit MaxCPUMinsPJ is set along with DenyOnLimit to · 46dcb9d8
  Danny Auble authored Apr 18, 2013
```
deny the job instead of holding it.
```
  46dcb9d8
- Make sure on systems that use a different launcher than launch/slurm not · ac7c76ab
  Danny Auble authored Apr 18, 2013
```
to attempt to signal tasks on the frontend node.
```
  ac7c76ab
18 Apr, 2013 1 commit
- Fix checking if QOS limit MaxCPUMinsPJ is set along with DenyOnLimit to · 82a95a36
  Danny Auble authored Apr 18, 2013
```
deny the job instead of holding it.
```
  82a95a36
17 Apr, 2013 3 commits
- CRAY - Dynamically create BASIL XML buffer to resize as needed. · 291476db
  Morris Jette authored Apr 17, 2013
```
Fix for bug 268
```
  291476db
- BGQ - redo - fix for handling half rack system in STATIC of OVERLAP mode · 54ddef91
  Danny Auble authored Apr 17, 2013
```
to implicitly create full system block.
```
  54ddef91
- sview - fix issue where if a partition was completely in one state the · 2776ac58
  Danny Auble authored Apr 17, 2013
```
cpu count would be reflected correctly.
```
  2776ac58
16 Apr, 2013 2 commits
- added news for last patch · c1a04150
  Danny Auble authored Apr 16, 2013
  
  c1a04150
- CRAY - set APRUN_DEFAULT_MEMROY instead of CRAY_AUTO_APRUN_OPTIONS · 1ac178c4
  Danny Auble authored Apr 16, 2013
  
  1ac178c4
12 Apr, 2013 3 commits

Change sview to use GMutex instead of GStaticMutex · ca3c2fa1
Danny Auble authored Apr 12, 2013

ca3c2fa1
Replaced ipmi.conf with generic acct_gather.conf file for all acct_gather · c1793844
Danny Auble authored Apr 12, 2013
```
plugins.  For those doing development to use this follow the model set
forth in the acct_gather_energy_ipmi plugin.
```
c1793844

gres/gpu - Fix for gres.conf file with multiple files on a single line · ee6a7066

Morris Jette authored Apr 12, 2013

We're in the process of setting up a few GPU nodes in our cluster, and
want to use Gres to control access to them.

Currently, we have activated one node with 2 GPUs.  The gres.conf file
on that node reads

----------------

Name=gpu Count=2 File=/dev/nvidia[0-1]
Name=localtmp Count=1800
----------------

(the localtmp is just counting access to local tmp disk.)  Nodes without
GPUs have gres.conf files like this:

----------------

Name=gpu Count=0
Name=localtmp Count=90
----------------

slurm.conf contains the following:

GresTypes=gpu,localtmp
Nodename=DEFAULT Sockets=2 CoresPerSocket=8 ThreadsPerCore=1 RealMemory=62976 Gres=localtmp:90 State=unknown
[...]
Nodename=c19-[1-16] NodeHostname=compute-19-[1-16] Weight=15848 CoresPerSocket=4 Gres=localtmp:1800,gpu:2 Feature=rack19,intel,ib

Submitting a job with sbatch --gres:1 ... sets the CUDA_VISIBLE_DEVICES for
the job.  However, the values seem a bit strange:

- If we submit one job with --gres:1, CUDA_VISIBLE_DEVICES gets the value 0.

- If we submit two jobs with --gres:1 at the same time,
  CUDA_VISIBLE_DEVICES gets the value 0 for one job, and 1633906540 for
  the other.

- If we submit one job with --gres:2, CUDA_VISIBLE_DEVICES gets the
  value 0,1633906540

ee6a7066

11 Apr, 2013 3 commits
- CRAY - Fix if srun --mem is given outside an allocation to set the · bf3a7790
  Danny Auble authored Apr 11, 2013
```
APRUN_DEFAULT_MEMORY env var for aprun.  This scenario will not display
the option when used with --launch-cmd.
```
  bf3a7790
- CRAY - fix issue with --mem option not giving correct amount of memory · 85d5556a
  Danny Auble authored Apr 11, 2013
```
per cpu.
```
  85d5556a
- CRAY - fix issue with --mem option not giving correct amount of memory · a5bc91e6
  Danny Auble authored Apr 11, 2013
```
per cpu.
```
  a5bc91e6
10 Apr, 2013 2 commits
- CRAY - Correct task per node count in BASIL Reservation request · 4896b2ed
  Morris Jette authored Apr 10, 2013
```
If task count specified, but no tasks-per-node, then set the tasks
per node in the BASIL reservation request.
```
  4896b2ed
- CRAY - If hostlist is given with srun make sure the node count is the same · 64d46ce9
  Danny Auble authored Apr 09, 2013
```
as the hosts given.
```
  64d46ce9