- 01 May, 2013 15 commits
-
-
Morris Jette authored
-
Danny Auble authored
-
Morris Jette authored
Make sure the lock array in assoc_mgr.h matches that of locks.h See commit 2e99b99a
-
-
David Bigagli authored
cluster.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
disk IO.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Bill Brophy authored
-
Morris Jette authored
Modify slurmctld data structure locking to interleave read and write locks rather than always favor write locks over read locks.
-
- 30 Apr, 2013 10 commits
-
-
Morris Jette authored
Make timeout configurable at build time by defining SAVE_MAX_WAIT.
-
Danny Auble authored
-
Danny Auble authored
-
Thomas Cadeau authored
-
Thomas Cadeau authored
-
Olli-Pekka Lehto authored
Dear all, As quick fix, I have put together this script to help manage native and symmetric MPI runs within SLURM. It's a bit bare-bones currently but I needed to get it working quickly :) It does not provide tight integration between the scheduler and MPI daemons and requires a slot on the host, even when running fully on the MIC, so it's really far from an optimal solution but could be a stopgap. It's inspired by the TACC Stampede documentation. They seem to have a similar script in place. It's fairly simple, you provide the names of the MIC binary (with -m) and host binary (with -c). The host MPI/OpenMP parameters are given as usual and the Xeon Phi side parameters as environment variables (MIC_PPN, MIC_OMP_NUM_THREADS). Currently it supports only 1 card per host but extending it should be simple enough. Here are a couple of links to documentation: Our prototype cluster documentation: https://confluence.csc.fi/display/HPCproto/HPC+Prototypes#HPCPrototypes-XeonPhiDevelopment Presentation at the PRACE Spring School in Umeå earlier this week: https://www.hpc2n.umu.se/sites/default/files/1.03%20CSC%20Cluster%20Introduction.pdf Feel free to include this in the contribs -directory. It might need a bit of cleanup though and I don't know when I have the time to do this. I have also added support for TotalView debugger (provided it's installed and configured properly for Xeon Phi usage). Future ideas: For the native MIC client, I've been testing it out a bit and looking at ways to minimize the changes needed for support. The two major challenges seem to be in scheduling and affinity: I think it might be necessary to put it into a specific topology plugin, like the one for BG/Q, but it looks like a lot of work to do that. Best regards, Olli-Pekka
-
Morris Jette authored
Conflicts: doc/html/slurm_ug_registration.shtml
-
Danny Auble authored
-
Hongjia Cao authored
-
Morris Jette authored
-
- 29 Apr, 2013 14 commits
-
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Ryan Cox authored
-style output file. Signed-off-by:
Danny Auble <da@schedmd.com>
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
Avoid placing pending jobs in AdminHold state due to backfill scheduler interactions with advanced reservation. Specifically, if the backfill scheduler tests a pending job can be scheduled after it's advanced reservation ends then the job was assigned a priority of zero (AdminHold).
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
undefined variable.
-
Morris Jette authored
Previously nothing would be printed to stderr of foreground program on OOM. Eliminate the need for log_fp() function
-
Morris Jette authored
Partial revert of commit 3a6bd336
-
- 28 Apr, 2013 1 commit
-
-
jette authored
-