- 28 Oct, 2011 1 commit
-
-
Danny Auble authored
-
- 27 Oct, 2011 1 commit
-
-
Morris Jette authored
Add configure option of "--without-rpath" which builds SLURM tools without the rpath option, which will work if Munge and BlueGene libraries are in the default library search path and make system updates easier.
-
- 26 Oct, 2011 1 commit
-
-
Morris Jette authored
Add support for job allocations with multiple job constraint counts. For example: salloc -C "[rack1*2&rack2*4]" ... will allocate the job 2 nodes from rack1 and 4 nodes from rack2. Support for only a single constraint name been added to job step support.
-
- 25 Oct, 2011 1 commit
-
-
Morris Jette authored
Add support for GPU memory allocation on Cray systems using SLURM GRES (Generic RESource) support. Work by Steve Trofinoff, CSCS.
-
- 24 Oct, 2011 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Do not attempt to run HeathCheckProgram on powered down nodes. Patch from Ramiro Alba, Centre Tecnològic de Tranferència de Calor, Spain.
-
- 21 Oct, 2011 5 commits
-
-
Morris Jette authored
If job time limit exceeds partition maximum, but job's minimum time limit does not, set job's time limit to partition maximum at allocation time.
-
Danny Auble authored
ESLURMD_GID_NOT_FOUND where slurm would be a little over zealous in treating missing a GID or UID as a fatal error.
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
- 20 Oct, 2011 3 commits
-
-
Danny Auble authored
-
Danny Auble authored
block correctly.
-
Danny Auble authored
-
- 19 Oct, 2011 7 commits
-
-
Morris Jette authored
Report correct job "Reason" if needed nodes are DOWN, DRAINED, or NOT_RESPONDING, "Resources" rather than "PartitionNodeLimit".
-
Danny Auble authored
-
Morris Jette authored
-
Danny Auble authored
plugins in the slurmd.
-
Danny Auble authored
-
Danny Auble authored
value for jobs.
-
Danny Auble authored
-
- 18 Oct, 2011 1 commit
-
-
Morris Jette authored
-
- 14 Oct, 2011 2 commits
-
-
Danny Auble authored
-
Morris Jette authored
Cray - Fix for srun.pl parsing to avoid adding spaces between option and argument (e.g. "-N2" parsed properly without changing to "-N 2").
-
- 11 Oct, 2011 2 commits
-
-
Morris Jette authored
Cray: Add support for job reservations with node IDs that are not in numeric order. Fix for Bugzilla #5.
-
jette authored
Prevent job hold by operator or account coordinator of his own job from being an Administrator Hold rather than User Hold by default.
-
- 07 Oct, 2011 1 commit
-
-
Morris Jette authored
Prevent slurmctld crashing with divide by zero with a configuration of MaxMemPerCPU=0.
-
- 06 Oct, 2011 1 commit
-
-
Morris Jette authored
Add a node state flag of CLOUD and save/restore NodeAddr and NodeHostName information for nodes with a flag of CLOUD. Major update to elastic computing document.
-
- 05 Oct, 2011 3 commits
-
-
Morris Jette authored
-
Danny Auble authored
block happens correctly now.
-
Morris Jette authored
Add the ability to update a node's NodeAddr and NodeHostName with scontrol. Also enable setting a node's state to "future" using scontrol.
-
- 04 Oct, 2011 3 commits
-
-
Morris Jette authored
Major re-write of the CPU Management User and Administrator Guide (web page) by Martin Perry, Bull.
-
Morris Jette authored
If a job can not run due to QOS or association limits, then do not cancel the job, but leave it pending in a system held state (priority = 1). The job will run when its limits or the QOS/association limits change. Based upon a patch by Phil Ekcert (LLNL).
-
Danny Auble authored
booted block. -- pass 1, more work needs to be done.
-
- 03 Oct, 2011 2 commits
-
-
Danny Auble authored
-
Morris Jette authored
Prevent associations from being delete if it has any jobs in running, pending or suspended state. Previous code prevented this only for running jobs.
-
- 30 Sep, 2011 2 commits
-
-
Morris Jette authored
Fix bugs in sched/backfill with respect to QOS reservation support and job time limits. Patch from Alejandro Lucero Palau (Barcelona Supercomputer Center).
-
Morris Jette authored
Fix to GRES allocation logic when resources are associated with specific CPUs on a node. Patch from Steve Trofinoff, CSCS.
-
- 29 Sep, 2011 1 commit
-
-
Danny Auble authored
(i.e. 1-9,0 instead of 0-9). The bug would cause 'sacct -N nodename' to not give correct results on these systems.
-