- 15 Nov, 2017 5 commits
-
-
Alejandro Sanchez authored
From within slurm_job_submit(): job_desc.pack_job_offset From within slurm_job_modify(): job_rec.pack_job_id job_rec.pack_job_id_set job_rec.pack_job_offset Bug 4372.
-
Felip Moll authored
bug 4339
-
Felip Moll authored
added some additional checks to prevent segfaults in some basic situations. Bug 4247
-
Felip Moll authored
bug 4368
-
Dominik Bartkiewicz authored
Add SLURM_PACK_JOB_ID and SLURM_PACK_JOB_OFFSET to PrologSlurmctld and EpilogSlurmctld environment bug 4379
-
- 14 Nov, 2017 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
Avoid srun abort trying to run on heterogeneous job component that has ended. bug 4366
-
- 13 Nov, 2017 4 commits
-
-
Morris Jette authored
bug 4374
-
Morris Jette authored
Do so even if pack-group 0 is completed, so long as not all components are completed bug 4374
-
Morris Jette authored
-
Tim Wickberg authored
In a prior incarnation of the patch that introduced it, it was MaxQueryTimeLimit, and that was not updated with the code base when changed. Bug 4365.
-
- 10 Nov, 2017 6 commits
-
-
Tim Wickberg authored
-
Tim Wickberg authored
Update slurm.spec and slurm.spec-legacy as well
-
Tim Wickberg authored
-
Felip Moll authored
Bug 4323.
-
Isaac Hartung authored
This now matches the sinfo documentation. Bug 4306.
-
Tim Wickberg authored
The race condition this is avoiding has been fixed elsewhere. This reverts commit 6c21c8bd.
-
- 09 Nov, 2017 21 commits
-
-
Danny Auble authored
Coverity 178912
-
Danny Auble authored
Coverity CID 178913
-
Danny Auble authored
-
Danny Auble authored
-
Felip Moll authored
Removed references to versions <=16.05 and adapted to new 17.11
-
Morris Jette authored
launch/slurm plugin - Avoid using global variable for heterogeneous job steps, which could corrupt memory. bug 4333
-
Morris Jette authored
Ancient versions of OpenMPI and their derivatives (i.e. Cray MPI) are dependent upon communication ports being assigned to them by Slurm. Such MPI jobs will experience step launch failure if any component of a heterogeneous job step is unable to acquire the allocated ports. Non-heterogeneous job steps will retry step launch using a new set of communication ports (no change in Slurm behavior). NOTE: Correcting this would necessitate assigning the same set of ports to all components of the heterogeneous job (not possible today) plus changes to srun in order to better synchronize the step startup and error handling.
-
Dominik Bartkiewicz authored
Same logic as done in commit fb296c70 done for energy. Bug 4336
-
Morris Jette authored
If heterogeneous job step is unable to acquire MPI reserved ports then avoid referencing NULL pointer. bug 4333
-
Danny Auble authored
Force tres change on a job to send data to the database. This should be happening already, but this just makes it always happen.
-
Danny Auble authored
This fixes the possibility of going into this loop when we hadn't setup the tres_req_cnt. The simple case Coverity reported is if the job is already finished it goes here and we never set up tres_req_cnt. Coverity CID 178897
-
Danny Auble authored
This fixes the possibility of referencing a NULL pointer if the reservation doesn't exist anymore when testing. Coverity CID 178898
-
Tim Wickberg authored
Bug 3647.
-
Tim Wickberg authored
Bug 4353.
-
Doug Jacobsen authored
Also collapse a nested %{with cray} block leftover from earlier work. Bug 4332.
-
Doug Jacobsen authored
Logic was inverted from the correct behavior. Bug 4332.
-
Doug Jacobsen authored
Slurm package should not try to manage configs; leave this to the admin to setup as they wish. This avoids an issue on RPM install if /etc/slurm is a symlink to somewhere else. Bug 4332.
-
Doug Jacobsen authored
Bug 4332.
-
Tim Wickberg authored
Bug 4332.
-
Brian Christiansen authored
-
Brian Christiansen authored
-
- 08 Nov, 2017 2 commits
-
-
Brian Christiansen authored
-
Brian Christiansen authored
-