- 04 Feb, 2014 1 commit
-
-
Morris Jette authored
Added whole_node field to job_resources structure Enable gang scheduling for jobs with core specialization and other jobs allocated whole nodes.
-
- 31 Jan, 2014 3 commits
-
-
David Bigagli authored
-
Danny Auble authored
i.e. salloc -n32 doesn't request the number of nodes and with the previous code if this request used 4 nodes and only 1 was left in GrpNodes it would just run with no issue since we were checking things before we selected how many nodes it ran on. Now we check this afterwards so we always check the limits on how many nodes, cpus and how much memory is to be used.
-
Morris Jette authored
Fix step allocation when some CPUs are not available due to memory limits. This happens when one step is active and using memory that blocks the scheduling of another step on a portion of the CPUs needed. The new step is now delayed rather than aborting with "Requested node configuration is not available". bug 577
-
- 29 Jan, 2014 1 commit
-
-
David Bigagli authored
incorrectly when using the hostlist_push_host function and input surrounded by [].
-
- 28 Jan, 2014 1 commit
-
-
Danny Auble authored
based on ionode count correctly on slurmctld restart.
-
- 25 Jan, 2014 2 commits
-
-
jette authored
-
Morris Jette authored
Split a slurmctld's job record "shared" field into "share_res" (share resource) and "whole_node" fields. Needed to better manage allocation of whole nodes for core specialization without disabling gang scheduling of such jobs.
-
- 23 Jan, 2014 5 commits
-
-
David Bigagli authored
to suspend/resume array elements.
-
Danny Auble authored
This reverts commit 34fd501c.
-
Danny Auble authored
connect in a loop instead of producing a fatal.
-
Danny Auble authored
-
Stephane Vaillant authored
-
- 22 Jan, 2014 1 commit
-
-
Danny Auble authored
-
- 21 Jan, 2014 2 commits
-
-
David Bigagli authored
-
David Bigagli authored
This reverts commit 2fa28eb6. Conflicts: NEWS
-
- 20 Jan, 2014 1 commit
-
-
jette authored
-
- 18 Jan, 2014 1 commit
-
-
David Bigagli authored
data correctly accumulating differences between sampling intervals. Fix the data structure mismatch between acct_gather_filesystem_lustre.c and slurm_jobacct_gather.h which caused the hdf5 plugin to log incorrect data.
-
- 17 Jan, 2014 1 commit
-
-
Morris Jette authored
New configuration parameter of CoreSpecPlugin. Added core_spec/none and core_spec/cray plugins (just stubs for now).
-
- 16 Jan, 2014 5 commits
-
-
David Bigagli authored
the srun help.
-
Danny Auble authored
not "idle" when in a reservation.
-
Morris Jette authored
Add version number to node and front-end configuration information visible using the scontrol tool. Sview and sinfo still need to be changed.
-
Morris Jette authored
Add specialized core count field to job credential data. NOTE: This changes the communications protocol from other pre-releases of version 14.03. All programs must be cancelled and daemons upgraded from previous pre-releases of version 14.03. Upgrades from version 2.6 or earlier can take place without loss of jobs
-
David Bigagli authored
network traffic accounting plugin.
-
- 15 Jan, 2014 2 commits
-
-
David Bigagli authored
to print more information when debug and when io error occur.
-
Danny Auble authored
add/remove columns. caused by commit 68f0f5db
-
- 13 Jan, 2014 2 commits
-
-
Morris Jette authored
Do not reset a job's priority when the slurmctld restarts if previously set to some specific value. bug 561
-
John Morrissey authored
groups.
-
- 11 Jan, 2014 1 commit
-
-
David Bigagli authored
hostlist_push_host().
-
- 10 Jan, 2014 1 commit
-
-
David Bigagli authored
-
- 09 Jan, 2014 2 commits
-
-
David Bigagli authored
are not longer set DOWN, they are set to DRAIN instead.
-
Morris Jette authored
Core specialization is now fully supported.
-
- 08 Jan, 2014 4 commits
-
-
David Bigagli authored
-
David Bigagli authored
This reverts commit 3464295e.
-
David Bigagli authored
-
Morris Jette authored
Make sure that licenses are not oversubscribed in overlapping reservations.
-
- 07 Jan, 2014 3 commits
-
-
Danny Auble authored
-
Morris Jette authored
Do not mark the node DOWN if its memory or tmp disk space is lower than configured, just log it using debug message type
-
David Bigagli authored
parameter in slurm.conf.
-
- 06 Jan, 2014 1 commit
-
-
Morris Jette authored
If a job is explicitly suspended, its priority is set to zero. This resets the priority when requeued and also documents that if the job is requeued (e.g. due to a node failure), then it is placed in a held state.
-