- 02 Sep, 2016 3 commits
-
-
Danny Auble authored
before all was moved into a common location in common.c.
-
Danny Auble authored
reservations.
-
Brian Christiansen authored
-
- 01 Sep, 2016 4 commits
-
-
Morris Jette authored
sched/backfill - Check that a user's QOS is allowed to use a partition before trying to schedule resources on that partition for the job. bug 3039
-
Morris Jette authored
-
Morris Jette authored
bug 3035 and 3009
-
David Gloe authored
This reverts commit 933d4fba.
-
- 30 Aug, 2016 3 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
Conflicts: src/plugins/select/cray/select_cray.c
-
Tim Wickberg authored
Otherwise blade_cnt is potentially greater than bit_size(jobinfo->blade_map) which leads to an assertion failure. Bug 3033.
-
- 27 Aug, 2016 5 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
instead of 2+. Continuation of previous commit.
-
Artem Polyakov authored
with hwloc.
-
Morris Jette authored
This patch has two parts: 1. When a job is intially submitted, the Slurm was failing to set an initial reason for the job not starting. 2. After a job was submitted, it was sometimes failing to reset the job's reason. It was also failing to reset the "last_job_update" time, so something like "squeue -i1" would not get the new reason. bug 3025
-
- 26 Aug, 2016 2 commits
-
-
Alejandro Sanchez authored
Fix multipart srun submission with EnforcePartLimits=NO and job violating the partition limits. bug 3025
-
Alejandro Sanchez authored
bug 3011
-
- 25 Aug, 2016 2 commits
-
-
Danny Auble authored
both) Continuation of commit 95fa8e8e.
-
Morris Jette authored
If all GRES were not defined on all nodes OR if a regular expression was used for a GRES file configuration (e.g. in gres.conf "Type=gpu Files=/dev/nvidia[0-4]"), then memory corruption was likely. The logic has been bad since its inception several years ago.
-
- 24 Aug, 2016 1 commit
-
-
Joseph Mingrone authored
POLLRDHUP does not exist on BSD, define to POLLHUP as done elsewhere.
-
- 23 Aug, 2016 1 commit
-
-
David Gloe authored
The attached patch switches to a more reliable method of detecting service nodes, using xtcli status. In addition, it switches to the print function to be better compatible with python 3.
-
- 22 Aug, 2016 13 commits
-
-
Boris Karasev authored
-
Danny Auble authored
-
Danny Auble authored
multiple versions.
-
Boris Karasev authored
on the most recent pmix_vX.so file. One is free to directly use new "pmix_vX" plugin names as well. NOTE: there is one small artifact introduced by this fix - there will be no "pmix" name in the output of "srun --mpi=list" but the most recent plugin will be duplicated: $ ./srun --mpi=list srun: MPI types are... srun: mpi/none srun: mpi/mvapich srun: mpi/mpich1_shmem srun: mpi/mpichgm srun: mpi/pmix_v1 <---- srun: mpi/mpichmx srun: mpi/lam srun: mpi/pmi2 srun: mpi/openmpi srun: mpi/mpich1_p4 srun: mpi/pmix_v1 <----
-
Danny Auble authored
conflicting pmix versions.
-
Danny Auble authored
-
Danny Auble authored
-
Boris Karasev authored
-
Danny Auble authored
linking against.
-
Danny Auble authored
commit 20a73c508e1.
-
Danny Auble authored
of what is in the header file of pmix.
-
Danny Auble authored
-
Boris Karasev authored
To ease the distribution process, plugin names will be automatically adjusted to identify the version of API that it can support, ie: pmix_v1 and pmix_v2. This provides the ability for distro's to create separate non-conflicting packages for each API generation. Bug 2986
-
- 20 Aug, 2016 1 commit
-
-
Morris Jette authored
Insure reported expected job start time is not in the past for pending jobs. bug 3002
-
- 19 Aug, 2016 2 commits
-
-
Tim Wickberg authored
-
Morris Jette authored
burst_buffer/cray: Requeue, but do not hold a job which fails the pre_run operation. bug 3009
-
- 18 Aug, 2016 1 commit
-
-
Alejandro Sanchez authored
Bug 2984.
-
- 17 Aug, 2016 1 commit
-
-
Morris Jette authored
-
- 16 Aug, 2016 1 commit
-
-
Alejandro Sanchez authored
Only mark job_id as zero for batch step (when all job steps would be cleared), not for individual steps which prevented successive steps from being cancelled. Bug 2984.
-