- 04 Mar, 2014 3 commits
-
-
Morris Jette authored
-
Magnus Jonsson authored
-
Carlos Bederián authored
-
- 03 Mar, 2014 2 commits
-
-
David Bigagli authored
merged together.
-
Josko Plazonic authored
sacct.
-
- 02 Mar, 2014 1 commit
-
-
jette authored
Add support for SchedulerParameters value of bf_max_job_start that limits the total number of jobs that can be started in a single iteration of the backfill scheduler. bug 607
-
- 27 Feb, 2014 3 commits
-
-
Morris Jette authored
bug 607
-
Danny Auble authored
-
Danny Auble authored
-
- 26 Feb, 2014 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
- 25 Feb, 2014 2 commits
-
-
Danny Auble authored
placed.
-
David Bigagli authored
-
- 21 Feb, 2014 3 commits
-
-
Danny Auble authored
count.
-
Danny Auble authored
nodes.
-
Danny Auble authored
-
- 20 Feb, 2014 2 commits
-
-
Morris Jette authored
If a job requires specific nodes and can not run due to those nodes being busy, the main scheduling loop will block those specific nodes rather than the entire queue/partition. bug 595
-
Morris Jette authored
-
- 19 Feb, 2014 2 commits
-
-
David Bigagli authored
is not a corresponding association.
-
David Bigagli authored
is not a corresponding association.
-
- 14 Feb, 2014 2 commits
-
-
David Bigagli authored
-
Danny Auble authored
needed to forward a message the slurmd would core dump.
-
- 13 Feb, 2014 1 commit
-
-
David Bigagli authored
describing that jobs must be drained from cluster before deploying any checkpoint plugin.
-
- 12 Feb, 2014 1 commit
-
-
Morris Jette authored
Properly enforce a job's cpus-per-task option when a job's allocation is constrained on some nodes by the mem-per-cpu option. bug 590
-
- 10 Feb, 2014 1 commit
-
-
Morris Jette authored
-
- 09 Feb, 2014 1 commit
-
-
Moe Jette authored
-
- 08 Feb, 2014 2 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
- 07 Feb, 2014 1 commit
-
-
Morris Jette authored
bug 586
-
- 05 Feb, 2014 3 commits
-
-
Danny Auble authored
-
Dominik Bartkiewicz authored
Set GPU_DEVICE_ORDINAL environment variable.
-
Danny Auble authored
-
- 04 Feb, 2014 2 commits
-
-
Morris Jette authored
Previous logic would try to pick a specific node count and on a heterogeneous system, this would cause a problem. This change largely reverts commit a270417b
-
Danny Auble authored
-
- 03 Feb, 2014 1 commit
-
-
Danny Auble authored
-
- 31 Jan, 2014 3 commits
-
-
David Bigagli authored
-
Danny Auble authored
i.e. salloc -n32 doesn't request the number of nodes and with the previous code if this request used 4 nodes and only 1 was left in GrpNodes it would just run with no issue since we were checking things before we selected how many nodes it ran on. Now we check this afterwards so we always check the limits on how many nodes, cpus and how much memory is to be used.
-
Morris Jette authored
Fix step allocation when some CPUs are not available due to memory limits. This happens when one step is active and using memory that blocks the scheduling of another step on a portion of the CPUs needed. The new step is now delayed rather than aborting with "Requested node configuration is not available". bug 577
-
- 28 Jan, 2014 1 commit
-
-
Danny Auble authored
based on ionode count correctly on slurmctld restart.
-
- 23 Jan, 2014 1 commit
-
-
Danny Auble authored
connect in a loop instead of producing a fatal.
-