- 08 Sep, 2016 2 commits
-
-
Morris Jette authored
Restructure srun command locking for task_exit processing logic for improved parallelism. This change decreases the amount of time consumed by serial logic by 2 orders of magnitude. bug 3044
-
Morris Jette authored
-
- 07 Sep, 2016 3 commits
-
-
Morris Jette authored
Preserve node "RESERVATION" state when one of multiple overlapping reservations ends. Previous logic would clear the node's RESERVATION state flag when any one of the reservations on the node ended rather than keeping the node in RESERVATION state until the last reservation ended. bug 3057
-
Morris Jette authored
The logic is now heavier weight, so increase interval between tests from 2 to 5 seconds
-
Morris Jette authored
Handle case when slurmctld daemon restart while compute node reboot in progress. Return node to service rather than setting DOWN. bug 3042
-
- 06 Sep, 2016 4 commits
-
-
Morris Jette authored
Add salloc_wait_nodes option to the SchedulerParameters parameter in the slurm.conf file controlling when the salloc command returns in relation to when nodes are ready for use (i.e. booted). bug 3043
-
E Kawashima authored
-
Gennaro Oliva authored
bug 3055
-
Gennaro Oliva authored
bug 3054
-
- 02 Sep, 2016 3 commits
-
-
Danny Auble authored
before all was moved into a common location in common.c.
-
Danny Auble authored
reservations.
-
Brian Christiansen authored
-
- 01 Sep, 2016 4 commits
-
-
Morris Jette authored
sched/backfill - Check that a user's QOS is allowed to use a partition before trying to schedule resources on that partition for the job. bug 3039
-
Morris Jette authored
-
Morris Jette authored
bug 3035 and 3009
-
David Gloe authored
This reverts commit 933d4fba.
-
- 30 Aug, 2016 3 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
Conflicts: src/plugins/select/cray/select_cray.c
-
Tim Wickberg authored
Otherwise blade_cnt is potentially greater than bit_size(jobinfo->blade_map) which leads to an assertion failure. Bug 3033.
-
- 27 Aug, 2016 5 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
instead of 2+. Continuation of previous commit.
-
Artem Polyakov authored
with hwloc.
-
Morris Jette authored
This patch has two parts: 1. When a job is intially submitted, the Slurm was failing to set an initial reason for the job not starting. 2. After a job was submitted, it was sometimes failing to reset the job's reason. It was also failing to reset the "last_job_update" time, so something like "squeue -i1" would not get the new reason. bug 3025
-
- 26 Aug, 2016 2 commits
-
-
Alejandro Sanchez authored
Fix multipart srun submission with EnforcePartLimits=NO and job violating the partition limits. bug 3025
-
Alejandro Sanchez authored
bug 3011
-
- 25 Aug, 2016 2 commits
-
-
Danny Auble authored
both) Continuation of commit 95fa8e8e.
-
Morris Jette authored
If all GRES were not defined on all nodes OR if a regular expression was used for a GRES file configuration (e.g. in gres.conf "Type=gpu Files=/dev/nvidia[0-4]"), then memory corruption was likely. The logic has been bad since its inception several years ago.
-
- 24 Aug, 2016 1 commit
-
-
Joseph Mingrone authored
POLLRDHUP does not exist on BSD, define to POLLHUP as done elsewhere.
-
- 23 Aug, 2016 1 commit
-
-
David Gloe authored
The attached patch switches to a more reliable method of detecting service nodes, using xtcli status. In addition, it switches to the print function to be better compatible with python 3.
-
- 22 Aug, 2016 10 commits
-
-
Boris Karasev authored
-
Danny Auble authored
-
Danny Auble authored
multiple versions.
-
Boris Karasev authored
on the most recent pmix_vX.so file. One is free to directly use new "pmix_vX" plugin names as well. NOTE: there is one small artifact introduced by this fix - there will be no "pmix" name in the output of "srun --mpi=list" but the most recent plugin will be duplicated: $ ./srun --mpi=list srun: MPI types are... srun: mpi/none srun: mpi/mvapich srun: mpi/mpich1_shmem srun: mpi/mpichgm srun: mpi/pmix_v1 <---- srun: mpi/mpichmx srun: mpi/lam srun: mpi/pmi2 srun: mpi/openmpi srun: mpi/mpich1_p4 srun: mpi/pmix_v1 <----
-
Danny Auble authored
conflicting pmix versions.
-
Danny Auble authored
-
Danny Auble authored
-
Boris Karasev authored
-
Danny Auble authored
linking against.
-
Danny Auble authored
commit 20a73c508e1.
-