- 15 Mar, 2014 3 commits
-
-
Morris Jette authored
Add logic to sleep and retry if slurm.conf can't be read. Without this, the slurmd daemons may die and when the SlurmdTimeout is reached, the nodes will be marked DOWN and their jobs will be killed. In the long term, it would be good to exit only if the read files on program startup, and the daemons keep running with old configuration on reconfiguration, but I don't have time to do that work now.
-
Morris Jette authored
Fix invalid memory reference if script returns error message for user. Previous code failed to set static variable to NULL resulting in xfree of memory previously freed elsewhere.
-
Morris Jette authored
Add support for job array options in the qsub command, in #PBS options for sbatch scripts and set the appropriate environment variables in the spank_pbs plugin (PBS_ARRAY_ID and PBS_ARRAY_INDEX). Note that Torque uses the "-t" option and PBS Pro uses the "-J" option.
-
- 14 Mar, 2014 4 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
slurm.conf. Rebooting daemons after adding nodes to the slurm.conf is highly recommended.
-
- 13 Mar, 2014 5 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
Add a job flag to indicate when the EpilogSlurmctld us running and don't purge the job record until it completes. This lets the EpilogSlurmctld requeue the job and otherwise manage it. bugs 635 and 636
-
Morris Jette authored
-
Morris Jette authored
bug 640
-
- 11 Mar, 2014 3 commits
-
-
David Bigagli authored
application environment the PMI2 library will connect the the PMI2 server (slurmstepd) instead of using the provided PMI_FD socket.
-
Danny Auble authored
-
Danny Auble authored
-
- 10 Mar, 2014 2 commits
-
-
Morris Jette authored
Cache results for major performance improvement. bug 636
-
David Bigagli authored
-
- 08 Mar, 2014 2 commits
-
-
Morris Jette authored
If a job request explicitly requests a GRES count of zero and that is not the last GRES in the slurm internal data structures, the job request will be rejected. bug 633
-
Danny Auble authored
Perhaps should also look into doing this for nodeinfo and libstate
-
- 07 Mar, 2014 6 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
this would cause pmd's to hang.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
- 06 Mar, 2014 3 commits
-
-
Danny Auble authored
-
Danny Auble authored
code.
-
Morris Jette authored
-
- 05 Mar, 2014 1 commit
-
-
Morris Jette authored
-
- 04 Mar, 2014 6 commits
-
-
Bill Brophy authored
interval.
-
Morris Jette authored
-
Morris Jette authored
-
Magnus Jonsson authored
-
Carlos Bederián authored
-
Don Lipari authored
and SLURM_STEP_ID env vars are set.
-
- 03 Mar, 2014 4 commits
-
-
David Bigagli authored
merged together.
-
Danny Auble authored
4TB of memory on them.
-
Josko Plazonic authored
sacct.
-
Morris Jette authored
Add SchedulerParameters options of preempt_reorder_count and preempt_strict_order.
-
- 02 Mar, 2014 1 commit
-
-
jette authored
This is for consistency. All of the other backfill parameters start with "bf_".
-