- 07 Dec, 2013 8 commits
-
-
David Bigagli authored
-
Morris Jette authored
Correction to commit 5a4b9e0c
-
David Bigagli authored
conflict resolution.
-
Danny Auble authored
-
Danny Auble authored
-
Philip D. Eckert authored
-
David Bigagli authored
This reverts commit 58c12f7e.
-
David Bigagli authored
the slurmctld throws a fatal error.
-
- 06 Dec, 2013 9 commits
-
-
Jason Bacon authored
Using CPU: Intel(R) Pentium(R) 4 CPU 2.40GHz (2392.04-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf27 Family = f Model = 2 Stepping = 7 Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE> It's also using an older version of hwloc (1.3.1) and I have not yet tested it with a newer one, but since 0 and -1 are legitimate returns values for hwloc_get_nbobjs_by_type(), I think they should be handled in any case. From the hwloc_get_nbobjs_by_type() man page: static inline int hwloc_get_nbobjs_by_type (hwloc_topology_ttopology, hwloc_obj_type_ttype) [static] Returns the width of level type type. If no object for that type exists, 0 is returned. If there are several levels with objects of that type, -1 is returned. I'm attaching a smarter patch that handles both 0 and -1 return values for both CORE and SOCKET. It logs a warning if it has to fudge a 0 return code and bails out with a helpful error message for -1, which I have no idea how to handle. At least people won't have to waste time tracking down the problem this way. Happy Friday, Jason
-
Trofinoff Stephen authored
This adds a mechanism to kill a hung apbasil command
-
Morris Jette authored
-
Morris Jette authored
error introduced in commit ec4df3bf
-
Morris Jette authored
-
Jason Bacon authored
-
Danny Auble authored
Fix for python 3 encoding
-
Morris Jette authored
A abort has been reported if the node's gres count differs from it's bitmap. This has been induced by changing the count manually (e.g. scontrol update nodename=tux123 gres=gpu:4"). I have not been able to reproduce this problem, but this will resize the bitmap in order to avoid the assert failure.
-
Danny Auble authored
-
- 05 Dec, 2013 10 commits
-
-
Danny Auble authored
-
Danny Auble authored
news.html.
-
Teun Docter authored
-
Danny Auble authored
-
Taras Shapovalov authored
instead of when running on the node for the first time.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
not global macros.
-
Danny Auble authored
-
Morris Jette authored
Add SLURM_CLUSTER_NAME to environment variables passed to PrologSlurmctld, Prolog, EpilogSlurmctld, and Epilog.
-
- 04 Dec, 2013 9 commits
-
-
Morris Jette authored
PrologSlurmctld, EpilogSlurmctld, MailProg, etc.
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
container that never had a pid added to it. (The job ended before it began)
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
Previous logic never reopened the file, preventing proper functioning of logrotate.
-
- 03 Dec, 2013 4 commits
-
-
Morris Jette authored
Conflicts: src/slurmctld/job_mgr.c
-
Morris Jette authored
Use hash function to locate job records for improved performance.
-
Danny Auble authored
-
Danny Auble authored
-