1. 10 May, 2019 10 commits
    • Nate Rini's avatar
      Remove extra whitespace in clus_jobs declaration. · 7bbb6220
      Nate Rini authored
      No functional change.
      
      Bug 6952.
      7bbb6220
    • Nate Rini's avatar
      Cleanup runaway jobs list to avoid leaking memory. · 6f60c6ca
      Nate Rini authored
      Call _purge_known_jobs() from _get_runaway_jobs() to purge
      known jobs (to slurmctld) from the list.
      
      Removed secondary list runaway_jobs as it was no longer needed.
      This also avoids leaking all the runaway_jobs.
      
      Bug 6952.
      6f60c6ca
    • Alejandro Sanchez's avatar
      62dc419e
    • Marshall Garey's avatar
      Document behavior of duplicate archive file names. · 7e7fd1bc
      Marshall Garey authored
      Bug 6033.
      7e7fd1bc
    • Marshall Garey's avatar
      Prevent infinite loop if 0 records are archived. · df5f748d
      Marshall Garey authored
      If _get_oldest_record() finds a record to archive/purge, then archive
      should always archive at least one record. If for whatever reason it
      fails to archive any records (_archive_table() returns a 0), then we
      don't want call continue, but want to return an error. Calling continue
      to go back to the beginning of the while loop would result in an
      infinite loop.
      
      Bug 6033.
      df5f748d
    • Marshall Garey's avatar
      Make archive job sql query consistent with purge. · 90471db8
      Marshall Garey authored
      Bug 6033.
      90471db8
    • Marshall Garey's avatar
      Only archive 50k records at a time. · ddd49896
      Marshall Garey authored
      Trying to archive too many records at once can result in archive files
      that are too big to read or even too big to be written. Only archive 50k
      records at a time, like we only purge 50k records at a time.
      
      Bug 6033.
      ddd49896
    • Marshall Garey's avatar
      Handle duplicate archive file names. · 1e234c3d
      Marshall Garey authored
      The time period of the archive file currently depends on submit or start
      time and whether the purge period is in hours, days, or months.
      Previously, if the archive file name already exists, we would overwrite
      the old archive file with the assumption that these are duplicate
      records being archived after an archive load. However, that could result
      in lost records in a couple of ways:
      
        * If there were runaway jobs that were part of an old archive file's
        time period and are later fixed and then purged, the old file would
        be overwritten.
        * If jobs or steps are purged but there are still jobs or steps in
        that time period that are pending or running, the pending or running
        jobs and steps won't be purged. When they finish and are purged, the
        old file would be overwritten.
      
      Instead of overwriting the old file, we append a number to the file name
      to create a new file. This will also be important in an upcoming commit.
      
      Bug 6033.
      1e234c3d
    • Marshall Garey's avatar
      Remove unused static variable high_buffer_size. · 3ffb4b4c
      Marshall Garey authored
      It was set but never read.
      
      Bug 6033.
      3ffb4b4c
    • Marshall Garey's avatar
      Use correct signed/unsiged types. · 4a26e486
      Marshall Garey authored
      Change a few variables in archiving to use the correct signed or
      unsigned type to avoid implicit casting.
      
      Bug 6033.
      4a26e486
  2. 09 May, 2019 6 commits
  3. 08 May, 2019 5 commits
  4. 07 May, 2019 5 commits
  5. 06 May, 2019 1 commit
    • Felip Moll's avatar
      Fix seff memory display overflow · bab13dfd
      Felip Moll authored
      When tres_usage_in_max field is empty it is recorded as '' in the database
      which leads find_tres_count_in_string() to return an INFINITE64. Seff treats
      INIFINITE64 as a valid value. This patch fixes this issue.
      
      Bug 6817
      bab13dfd
  6. 03 May, 2019 4 commits
  7. 02 May, 2019 7 commits
  8. 01 May, 2019 2 commits