- 07 Aug, 2003 5 commits
-
-
Moe Jette authored
By default, logging is to syslog. Expanded usage description to be more complete.
-
Moe Jette authored
-
Moe Jette authored
without duplicates. For example, we want to convert a users request for nodes "lx1,lx2,lx1,lx3" to "lx[1-3]".
-
Moe Jette authored
entry counting function. As a job is completing the bitmap gets cleared as nodes complete, resulting in incorrect values.
-
Moe Jette authored
-
- 06 Aug, 2003 1 commit
-
-
Moe Jette authored
-
- 05 Aug, 2003 4 commits
-
-
Moe Jette authored
job is scheduled.
-
Moe Jette authored
required by a pending job. NOTE: On heterogeneous systems or if the requested number of nodes is a range then the actual number of nodes allocated may differ from the value displayed.
-
Moe Jette authored
required by a pending job.
-
Moe Jette authored
-
- 04 Aug, 2003 1 commit
-
-
Moe Jette authored
batch_job_launch RPC, then deallocate those resources and requeue the job. If a node registers and fails to show a batch job that should have a script running there (node zero of allocation), then consider the job complete.
-
- 02 Aug, 2003 5 commits
-
-
Moe Jette authored
"Can't connect to node" with every ping failure.
-
Moe Jette authored
-
Moe Jette authored
Changed the logging level of a few other message.
-
Moe Jette authored
until the previous one completes. This avoids having too many cycles active (and a bunch of threads too). Ping_nodes control functions moved to a new module.
-
Moe Jette authored
reportted in log (via #define PRINT_TIMES). Resolution of time stamps changed from 10msec to 1usec by changing to gettimeofday instead of clock.
-
- 31 Jul, 2003 20 commits
-
-
Mark Grondona authored
-
Mark Grondona authored
-
Mark Grondona authored
print_exit_status
-
Moe Jette authored
-
Moe Jette authored
allocated/idle.
-
Moe Jette authored
-
Mark Grondona authored
o Call waitpid with WUNTRACED to report on stopped processes.
-
Moe Jette authored
instead of REQUEST_KILL_TASKS RPC to slurmd.
-
Moe Jette authored
memory. Minor code restructuring (moved setrlimit into _init_config() function from main).
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
Set "reason" field when node set down for slurmd error.
-
Moe Jette authored
-
Moe Jette authored
request.
-
Moe Jette authored
-
Mark Grondona authored
-
Mark Grondona authored
-
Moe Jette authored
the backup controller and proc_req.c is the code to process incomming RPCs. No changes in controller logic were made for this. job_mgr.c was also modified to better handle bad job records on controller restart's data recovery.
-
Mark Grondona authored
- Enhanced xsignal.[ch] to more easily block and unblock sets of signals [ added: xsignal_unblock xsignal_sigset_create, changed: xsignal_unblock ] - Take advantage of above changes in srun and slurmd code - Only unblock signals that were specifically blocked in slurmd (instead of *all* signals) when exec'ing user tasks.
-
Mark Grondona authored
o allow error() to return SLURM_ERROR so callers can do: return error (...);
-
- 30 Jul, 2003 1 commit
-
-
Moe Jette authored
-
- 29 Jul, 2003 3 commits