1. 17 Jan, 2016 1 commit
  2. 15 Jan, 2016 6 commits
  3. 14 Jan, 2016 4 commits
    • Morris Jette's avatar
      fix AuthInfo with alternate munge socket location · f3d54f99
      Morris Jette authored
      Fix for configuration of "AuthType=munge" and "AuthInfo=socket=..." with
          alternate munge socket path.
      bug 2348
      f3d54f99
    • Janne Blomqvist's avatar
      Rework group caching to work better in environments with enumeration disabled. · 48a4cdf8
      Janne Blomqvist authored
      The initgroups()/getgrouplist() caching in slurmd is changed to not require enumeration, instead individual entries are cached when first needed. This cache is always enabled, thus the CacheGroups configuration setting has been removed. The time that each cache entry is considered valid is determined by the GroupUpdateTime configuration parameter. scontrol reconfig will purge the cache. The default value for the GroupUpdateForce configuration parameter has changed, as systems where /etc/group contains all the groups instead of some external system like NIS, LDAP are nowadays probably the exception rather than the rule.
      
      For slurmctld, the group cache still uses enumeration, but this is needed only to take care of special situations like multiple groups with the same GID. With enumeration disabled, group caching still works otherwise. validate_groups() does a little more optional work in order to handle the case where the user primary group is in the AllowGroups list, but getgrnam_r() does not return that user as a group member.
      
      bug 1629
      48a4cdf8
    • Morris Jette's avatar
      Avoid slurmstepd abort if malloc fails for accounting · 360fb080
      Morris Jette authored
      If a node is out of memory, then the malloc performed by slurmstepd
        periodically may fail, killing the slurmstepd and orphaning it's
        processes.
      bug 2341
      360fb080
    • Morris Jette's avatar
      Avoid slurmstepd abort if malloc fails for accounting · d5400aa5
      Morris Jette authored
      If a node is out of memory, then the malloc performed by slurmstepd
        periodically may fail, killing the slurmstepd and orphaning it's
        processes.
      bug 2341
      d5400aa5
  4. 13 Jan, 2016 2 commits
    • Morris Jette's avatar
      backfill scheduling with group limits fix · 3ee1632f
      Morris Jette authored
      Backfill scheduling fix: If a job can't be started due to a "group" resource
          limit, rather than reserve resources for it when the next job ends, don't
          reserve any resources for it. The problem with the original logic is that
          if a lot of resources are reserved for such pending jobs, then jobs futher
          down the queue may defered when they really can and should be started. An
          ideal solution would track all of the TRES resources through time as jobs
          start and end, but we don't have that logic in the backfill scheduler and
          don't want that extra overhead in the backfill scheduler.
      bugs 2326 and 2282
      3ee1632f
    • Alejandro Sanchez's avatar
      Add more partition info to "scontrol write config" · f428705b
      Alejandro Sanchez authored
      bug 2303
      f428705b
  5. 12 Jan, 2016 5 commits
  6. 11 Jan, 2016 6 commits
  7. 08 Jan, 2016 2 commits
    • Tim Wickberg's avatar
      Remove Sun Constellation. · bf04fa4d
      Tim Wickberg authored
      Update NEWS file for final removal of Sun Constellation, Elan, and
      IBM Federation (switch/nrt plugin replaces). Clean up documentation
      and few outstanding ifdef blocks. Unless you were defining
      HAVE_SUN_CONST there are no functional changes.
      bf04fa4d
    • Tim Wickberg's avatar
      Change slurmstepd to initialize authentication before task launch. · 870273ca
      Tim Wickberg authored
      Otherwise upgrading slurm on a compute node while tasks are running
      will cause plugin mismatch, as slurmstepd would not load the library
      until task completion before. Bug 2319.
      870273ca
  8. 07 Jan, 2016 7 commits
  9. 06 Jan, 2016 7 commits