- 18 Jan, 2016 2 commits
- 17 Jan, 2016 1 commit
-
-
jette authored
Fix backfill scheduling bug which could postpone the scheduling of jobs due to avoidance of nodes in COMPLETING state. bug 2350
-
- 16 Jan, 2016 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
No need to look up the Reason string for a job, we just set the value.
-
- 15 Jan, 2016 13 commits
-
-
Morris Jette authored
-
Brian Christiansen authored
Conflicts: NEWS
-
Brian Christiansen authored
-
Morris Jette authored
-
Brian Christiansen authored
Bug 2255
-
Morris Jette authored
This should provide more diagnostic information
-
Morris Jette authored
-
Janne Blomqvist authored
-
Morris Jette authored
-
Brian Christiansen authored
Bug 2343
-
Morris Jette authored
Fix for configuration of "AuthType=munge" and "AuthInfo=socket=..." with alternate munge socket path. bug 2348
-
Morris Jette authored
-
Brian Christiansen authored
Bug 2343
-
- 14 Jan, 2016 8 commits
-
-
Morris Jette authored
-
Morris Jette authored
Fix for configuration of "AuthType=munge" and "AuthInfo=socket=..." with alternate munge socket path. bug 2348
-
Morris Jette authored
Previously if partition limits enforcement was not configured, then a job submitted to a partition it could not access (say due to AllowGroups, AllowUsers, etc.) would not be rejected, but would be allocated resources and run. This bug was introduced in commit edf3880c
-
Morris Jette authored
-
Morris Jette authored
-
Janne Blomqvist authored
The initgroups()/getgrouplist() caching in slurmd is changed to not require enumeration, instead individual entries are cached when first needed. This cache is always enabled, thus the CacheGroups configuration setting has been removed. The time that each cache entry is considered valid is determined by the GroupUpdateTime configuration parameter. scontrol reconfig will purge the cache. The default value for the GroupUpdateForce configuration parameter has changed, as systems where /etc/group contains all the groups instead of some external system like NIS, LDAP are nowadays probably the exception rather than the rule. For slurmctld, the group cache still uses enumeration, but this is needed only to take care of special situations like multiple groups with the same GID. With enumeration disabled, group caching still works otherwise. validate_groups() does a little more optional work in order to handle the case where the user p...
-
Morris Jette authored
If a node is out of memory, then the malloc performed by slurmstepd periodically may fail, killing the slurmstepd and orphaning it's processes. bug 2341
-
Morris Jette authored
If a node is out of memory, then the malloc performed by slurmstepd periodically may fail, killing the slurmstepd and orphaning it's processes. bug 2341
-
- 13 Jan, 2016 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
Backfill scheduling fix: If a job can't be started due to a "group" resource limit, rather than reserve resources for it when the next job ends, don't reserve any resources for it. The problem with the original logic is that if a lot of resources are reserved for such pending jobs, then jobs futher down the queue may defered when they really can and should be started. An ideal solution would track all of the TRES resources through time as jobs start and end, but we don't have that logic in the backfill scheduler and don't want that extra overhead in the backfill scheduler. bugs 2326 and 2282
-
Alejandro Sanchez authored
bug 2303
-
Morris Jette authored
-
- 12 Jan, 2016 10 commits
-
-
Tim Wickberg authored
Conflicts: src/api/partition_info.c
-
Tim Wickberg authored
Handle unexpectedly large lines for hostlists. (Bug 2333.) While here rework to avoid extraneous xstrcat calls by using xstrfmtcat instead of snprintf + xstrcat. Collapse line end into own string for readability. No performance or functional change, aside from removing possible line truncation (which will silence additional Coverity warnings). Removes a double xfree() in slurm_sprint_reservation_info().
-
Morris Jette authored
When a reservation is created or updated, compress user provided node names using hostlist functions (e.g. translate user input of "Nodes=tux1,tux2" into "Nodes=tux[1-2]"). bug 2333
-
Brian Christiansen authored
-
Brian Christiansen authored
Reported by CLANG Continuation of 7eff526c
-
Brian Christiansen authored
Reported by CLANG Continuation of 7eff526c
-
Tim Wickberg authored
Match behavior of other PBS-like resource managers. Bug 2330.
-
Danny Auble authored
-
Danny Auble authored
using TRES as a key word.
-
Danny Auble authored
statements.
-