1. 07 Dec, 2013 1 commit
  2. 06 Dec, 2013 5 commits
    • Jason Bacon's avatar
      Improve hwloc support for various processors · ac5d734b
      Jason Bacon authored
      Using CPU: Intel(R) Pentium(R) 4 CPU 2.40GHz (2392.04-MHz 686-class CPU)
        Origin = "GenuineIntel"  Id = 0xf27  Family = f  Model = 2  Stepping = 7
      Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
      
      It's also using an older version of hwloc (1.3.1) and I have not yet tested it with a newer one, but since 0 and -1 are legitimate returns values for hwloc_get_nbobjs_by_type(), I think they should be handled in any case.
      
      From the hwloc_get_nbobjs_by_type() man page:
      
      static inline int  hwloc_get_nbobjs_by_type (hwloc_topology_ttopology,
             hwloc_obj_type_ttype) [static]
             Returns the width of level type type. If no object for that type
             exists, 0 is returned. If there are several levels with objects of that
             type, -1 is returned.
      
      I'm attaching a smarter patch that handles both 0 and -1 return values for both CORE and SOCKET.  It logs a warning if it has to fudge a 0 return code and bails out with a helpful error message for -1, which I have no idea how to handle.  At least people won't have to waste time tracking down the problem this way.
      
      Happy Friday,
      
          Jason
      ac5d734b
    • Trofinoff  Stephen's avatar
      Added ApbasilTimeout parameter to the cray.conf · 270f696e
      Trofinoff Stephen authored
      This adds a mechanism to kill a hung apbasil command
      270f696e
    • Morris Jette's avatar
      Fix bad print · 1712d619
      Morris Jette authored
      error introduced in commit ec4df3bf
      1712d619
    • Jason Bacon's avatar
      Fix for hwloc returning zero core count · ec4df3bf
      Jason Bacon authored
      ec4df3bf
    • Morris Jette's avatar
      Fix for gres count change · 4e56260f
      Morris Jette authored
      A abort has been reported if the node's gres count differs from
      it's bitmap. This has been induced by changing the count manually
      (e.g. scontrol update nodename=tux123 gres=gpu:4"). I have not
      been able to reproduce this problem, but this will resize the
      bitmap in order to avoid the assert failure.
      4e56260f
  3. 05 Dec, 2013 2 commits
  4. 04 Dec, 2013 2 commits
  5. 03 Dec, 2013 4 commits
  6. 02 Dec, 2013 3 commits
  7. 29 Nov, 2013 6 commits
  8. 27 Nov, 2013 5 commits
  9. 26 Nov, 2013 5 commits
  10. 25 Nov, 2013 5 commits
  11. 24 Nov, 2013 2 commits