- 07 Jun, 2011 10 commits
-
-
Vermeulen Peter, nMCT Howest authored
-
Jimmy Tang authored
-
Jimmy Tang authored
-
Don Lipari authored
-
Danny Auble authored
Did the actual fix for getting the correct information on clusters of any dimension size from accounting when querying against a hostlist
-
Danny Auble authored
-
Danny Auble authored
-
Moe Jette authored
Added scontrol ability to increment or decrement a job or step time limit.
-
Moe Jette authored
-
Moe Jette authored
Modify the man2html calling sequence for a clean build of SLURM from the source directory. The old code would copy the header.txt and footer.txt files from "srcdir" to "." and generate an error if they were the same files. The new code just passes the pathnames of the files into the man2html python script
-
- 06 Jun, 2011 3 commits
-
-
Danny Auble authored
-
Danny Auble authored
would not be set correctly in the added child association.
-
Danny Auble authored
-
- 04 Jun, 2011 4 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
- 03 Jun, 2011 1 commit
-
-
Morris Jette authored
Add an configure option of --enable-salloc-kill-cmd which will cause the salloc command to signal its child processes when the job ends. Job signalling will be the default for Cray systems. Jobs will not be signalled by default on other systems. SIGHUP will be used for interactive jobs and SIGTERM will be used for other jobs.
-
- 02 Jun, 2011 8 commits
-
-
Moe Jette authored
Change the reason that a node is marked DOWN and the log message from node "silent reboot" to "unexpected reboot"
-
Moe Jette authored
-
Moe Jette authored
Patch from Don Albert, Bull
-
Moe Jette authored
-
Moe Jette authored
With default configuration on non-Cray systems, enable salloc to be spawned as a background process. Based upon work by Don Albert (Bull) and Gerrit Renker (CSCS).
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
- 01 Jun, 2011 6 commits
-
-
Moe Jette authored
-
Moe Jette authored
Add support to salloc for a new environment variable SALLOC_KILL_CMD, which is equivalent to the -K/--kill-command option.
-
Moe Jette authored
-
Moe Jette authored
This fixes a bug which is thanks to a report by Don Albert. The problem is that whenever salloc exits with a child process in stopped state (suspended or stopped on terminal input/output), a zombie process is generated, since this case is not caught by the code evaluating the child status. This patch adds the missing case. It uses SIGKILL, which is the only signal that changes the state of a stopped process. It was decided not to try and re-awken the process using SIGCONT, since (a) this happens during session clean-up and (b) if the condition is due to SIGTTIN, the process immediately becomes stopped again. Patch from Gerrit Renker, CSCS.
-
Danny Auble authored
-
Moe Jette authored
Treat the specification of multiple cluster names as a fatal error.
-
- 31 May, 2011 4 commits
- 30 May, 2011 1 commit
-
-
Morris Jette authored
-
- 29 May, 2011 3 commits
-
-
Morris Jette authored
Fix a couple of problems in alps emulation mode caused by recent changes in the select/cray plugin: node cordinates and signal return code
-
Morris Jette authored
select/cray: whitespace fixes and removal of unused code Patch 10_Cray_COSMETICS-whitespace.diff from Gerrit Renker, CSCS
-
Morris Jette authored
On the slurmd, the function build_all_frontend_info() is called before logging is fully initialized. This causes the frontend debug messages (which also get redundantly printed in the slurmctld log file) to be sent to stderr. On our system (where all slurmds get started remotely, via pdsh) the particular implementation caused the startup to hang. The patch uses a solution similar to build_all_node_line_info(), where a boolean flag is used to avoid repeating the slurmctld message in slurmd context. Patch 08_Multiple-Frontend_suppress_initial_debug_message.diff from Gerrit Renker, CSCS
-