- 20 Sep, 2003 4 commits
-
-
Mark Grondona authored
-
Moe Jette authored
-
Mark Grondona authored
on nodes relative to the current allocation. o srun no longer sends SIGKILL to job if one task is killed except if --no-allocate is used. (the job will otherwise be killed by the controller anyway)
-
Mark Grondona authored
-
- 19 Sep, 2003 11 commits
-
-
Mark Grondona authored
function.
-
Mark Grondona authored
memory appears to be full.
-
Mark Grondona authored
write() call.
-
Mark Grondona authored
- instead of attempting to kill pending threads, immediately exit wait_for_procs if a thread is already waiting for job. - if wait_for_procs fails (thread already waiting), exit w/out sending epilog complete rpc.
-
Moe Jette authored
type debug.
-
Moe Jette authored
descriptors. This was needed in several grouped functions (e.g. slurm_send_recv_rc_msg and slurm_send_only_node_msg, which combine open, send, receive, and close functions for simplicity).
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
RPC server threads to that number - 2. This should slightly reduce the incomming RPC load.
-
Moe Jette authored
and may have been exhausting virtual memory, resulting in the death of slurmctld.
-
Mark Grondona authored
-
- 18 Sep, 2003 4 commits
-
-
Moe Jette authored
the left-over slurmctld process on abnormal termination.
-
Moe Jette authored
clarity. No change in functionality or logic.
-
Moe Jette authored
If the "-c" option is not specified then only the jobs and some node state information will be preserved. Specifically the state of DOWN, DRAINED, or DRAINING nodes and the associated reason field for those nodes.
-
Moe Jette authored
-
- 17 Sep, 2003 10 commits
-
-
Moe Jette authored
-
jwindley authored
-
Moe Jette authored
-
Moe Jette authored
1=required nodes DOWN/DRAINED.
-
Moe Jette authored
unavailable states).
-
Moe Jette authored
right away (before running scheduling function).
-
Moe Jette authored
returned to service. The priority is changed from 1 to value which would be set for the job if submitted at that time. (gnats:279)
-
Moe Jette authored
nodes which are not available (DOWN or DRAIN). This will prevent them from blocking other jobs from using the nodes which are available (i.e. over-ride FIFO scheduling). (gnats:279)
-
Moe Jette authored
Without doing so, its internal record of jobs from its last period of activity are resurrected.
-
Moe Jette authored
-
- 16 Sep, 2003 7 commits
-
-
Moe Jette authored
MAX_SERVER_THREADS is exceeded. Thread counter, mutex, and cond logic all moved into new allocate/deallocate server thread functions.
-
Moe Jette authored
-
Mark Grondona authored
-
Mark Grondona authored
-
Moe Jette authored
-
Moe Jette authored
assumes control. It previously captured state only when the backup controller daemon was initiated.
-
Moe Jette authored
This was not happening for the backup slurmctld.
-
- 15 Sep, 2003 4 commits
-
-
Moe Jette authored
-
Moe Jette authored
-
Mark Grondona authored
-
Mark Grondona authored
-