- 07 Sep, 2016 5 commits
-
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Morris Jette authored
This is required in order to make sure that a compute node reboots as requested; even if the slurmctld daemon restarts. Previously if the slurmctld daemon restarted, we couldn't tell if the compute node rebooted before or after the request. This commit expands upon the work of commit 4517c454 by adding the reboot request time to the node record, which is saved and restored by slurmctld. bug 3042
-
Morris Jette authored
-
Morris Jette authored
Handle case when slurmctld daemon restart while compute node reboot in progress. Return node to service rather than setting DOWN. bug 3042
-
- 06 Sep, 2016 6 commits
-
-
Morris Jette authored
-
Morris Jette authored
Add salloc_wait_nodes option to the SchedulerParameters parameter in the slurm.conf file controlling when the salloc command returns in relation to when nodes are ready for use (i.e. booted). bug 3043
-
Morris Jette authored
-
E Kawashima authored
-
Gennaro Oliva authored
bug 3055
-
Gennaro Oliva authored
bug 3054
-
- 02 Sep, 2016 4 commits
-
-
Danny Auble authored
before all was moved into a common location in common.c.
-
Danny Auble authored
reservations.
-
Brian Christiansen authored
-
Brian Christiansen authored
-
- 01 Sep, 2016 9 commits
-
-
Morris Jette authored
-
Morris Jette authored
sched/backfill - Check that a user's QOS is allowed to use a partition before trying to schedule resources on that partition for the job. bug 3039
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
bug 3037
-
Morris Jette authored
Conflicts: src/plugins/burst_buffer/common/burst_buffer_common.h
-
Morris Jette authored
bug 3035 and 3009
-
Tim Wickberg authored
-
David Gloe authored
This reverts commit 933d4fba.
-
- 31 Aug, 2016 2 commits
-
-
Brian Christiansen authored
-
Tim Wickberg authored
-
- 30 Aug, 2016 4 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
Conflicts: src/plugins/select/cray/select_cray.c
-
Tim Wickberg authored
Otherwise blade_cnt is potentially greater than bit_size(jobinfo->blade_map) which leads to an assertion failure. Bug 3033.
-
- 27 Aug, 2016 6 commits
-
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
instead of 2+. Continuation of previous commit.
-
Artem Polyakov authored
with hwloc.
-
Morris Jette authored
This patch has two parts: 1. When a job is intially submitted, the Slurm was failing to set an initial reason for the job not starting. 2. After a job was submitted, it was sometimes failing to reset the job's reason. It was also failing to reset the "last_job_update" time, so something like "squeue -i1" would not get the new reason. bug 3025
-
- 26 Aug, 2016 3 commits
-
-
Alejandro Sanchez authored
Fix multipart srun submission with EnforcePartLimits=NO and job violating the partition limits. bug 3025
-
Morris Jette authored
-
Alejandro Sanchez authored
bug 3011
-
- 25 Aug, 2016 1 commit
-
-
Danny Auble authored
-