1. 03 May, 2013 2 commits
    • Morris Jette's avatar
      Merge branch 'slurm-2.5' · 08b9faec
      Morris Jette authored
      Conflicts:
      	testsuite/expect/test1.90
      08b9faec
    • jette's avatar
      Make test more robust · 2592eb5e
      jette authored
      Make test work if current working directory not in the search path
      Check for appropriate task rank on POE based systems
      Disable the entire test on POE systems
      2592eb5e
  2. 02 May, 2013 8 commits
  3. 01 May, 2013 24 commits
  4. 30 Apr, 2013 6 commits
    • Morris Jette's avatar
      Change maximum delay for state save from 2 secs to 5 secs. · 5a2a76ff
      Morris Jette authored
      Make timeout configurable at build time by defining SAVE_MAX_WAIT.
      5a2a76ff
    • Danny Auble's avatar
      Remove unneeded flag · d813d3ac
      Danny Auble authored
      d813d3ac
    • Danny Auble's avatar
      Restore tbl logic · 29d31750
      Danny Auble authored
      29d31750
    • Thomas Cadeau's avatar
      Fix fat fingers (IMPI -> IPMI) · 2e0f87a3
      Thomas Cadeau authored
      2e0f87a3
    • Thomas Cadeau's avatar
      protect launch of tread in IPMI plugin · 89842945
      Thomas Cadeau authored
      89842945
    • Olli-Pekka Lehto's avatar
      added script to help manage native and symmetric MPI runs within SLURM · fdf56162
      Olli-Pekka Lehto authored
      Dear all,
      
      As quick fix, I have put together this script to help manage native and symmetric MPI runs within SLURM. It's a bit bare-bones currently but I needed to get it working quickly :)
      
      It does not provide tight integration between the scheduler and MPI daemons and requires a slot on the host, even when running fully on the MIC, so it's really far from an optimal solution but could be a stopgap.
      
      It's inspired by the TACC Stampede documentation. They seem to have a similar script in place.
      
      It's fairly simple, you provide the names of the MIC binary (with -m) and host binary (with -c). The host MPI/OpenMP parameters are given as usual and the Xeon Phi side parameters as environment variables (MIC_PPN, MIC_OMP_NUM_THREADS). Currently it supports only 1 card per host but extending it should be simple enough.
      
      Here are a couple of links to documentation:
      
      Our prototype cluster documentation:
      https://confluence.csc.fi/display/HPCproto/HPC+Prototypes#HPCPrototypes-XeonPhiDevelopment
      Presentation at the PRACE Spring School in Umeå earlier this week:
      https://www.hpc2n.umu.se/sites/default/files/1.03%20CSC%20Cluster%20Introduction.pdf
      
      Feel free to include this in the contribs -directory. It might need a bit of cleanup though and I don't know when I have the time to do this.
      
      I have also added support for TotalView debugger (provided it's installed and configured properly for Xeon Phi usage).
      
      Future ideas:
      
      For the native MIC client, I've been testing it out a bit and looking at ways to minimize the changes needed for support. The two major challenges seem to be in scheduling and affinity:
      
      I think it might be necessary to put it into a specific topology plugin, like the one for BG/Q, but it looks like a lot of work to do that.
      
      Best regards,
      Olli-Pekka
      fdf56162