1. 18 May, 2011 9 commits
    • Danny Auble's avatar
      Merge github.com:chaos/slurm · 88b0686c
      Danny Auble authored
      88b0686c
    • Morris Jette's avatar
      select/cray: increase robustness of initialisation code · dc8d97eb
      Morris Jette authored
      This improves the initial configuration code:
       a) Better handling of DownNodes lines
          The previous basil_geometry() would set the node Reason field on failure,
          irrespective of whether that node has been marked using a DownNode line.
      
       b) Check all cases of nodes being invisible to ALPS
          Up until now basil_geometry() had to be fixed each time a new source of
          discrepancy between ALPS and SDB state had been discovered (most recent
          case was NULL coordinates when taking out a blade). Depending on ALPS
          interface changes, there may be other possibilities. Instead of fixing the
          SLURM code for each new case, it is better to check whether SLURM and ALPS
          agree. The price is some tiny delay at SLURM initialisation time (since each
          node is first looked up in the ALPS inventory), but it pays well off as it
          eases system administration by pointing to the source of error.
          Any node that has suddenly disappeared from ALPS horizon will now show up in
          the logs, and also in marked down in sinfo.
      
       c) At initialisation time, give a summary as to how many ALPS nodes are online.
      
       d) Turn ALPS-node-invisibility error into warning message, since such nodes may
          already have been covered in a DownNodes statement.
      
      By merging basil_get_initial_state() into basil_geometry(), the previously separate
      knowledge about system state (database state, ALPS inventory) is combined, allowing
      to more easily identify sources of failure.
      Patch from Gerrit Renker, CSCS.
      dc8d97eb
    • Moe Jette's avatar
      Update NEWS for start of slurm v2.3.0-pre6 · 9ad7a366
      Moe Jette authored
      9ad7a366
    • Moe Jette's avatar
      Update META for v2.3.0-pre5 tag · dd739219
      Moe Jette authored
      dd739219
    • Moe Jette's avatar
      Update build notes for use with GIT, not SVN · a7fce8c9
      Moe Jette authored
      a7fce8c9
    • Danny Auble's avatar
      removed debug · 66f39529
      Danny Auble authored
      66f39529
    • Danny Auble's avatar
      f27f3c78
    • Moe Jette's avatar
      Another job step layout bug · b7cfe963
      Moe Jette authored
      Logging message was misleading and incorrect pointer used in another.
      b7cfe963
    • Moe Jette's avatar
      Fix bugs in step allocations in heterogeneous env · 851b8f71
      Moe Jette authored
      Former logic failed to properly allocate resources to a job step
      when specifying both a task count and a node count range on a
      heterogeneous cluster.
      851b8f71
  2. 17 May, 2011 15 commits
  3. 16 May, 2011 6 commits
  4. 14 May, 2011 1 commit
  5. 13 May, 2011 9 commits