1. 15 Mar, 2018 1 commit
  2. 14 Mar, 2018 1 commit
  3. 24 Jan, 2018 1 commit
  4. 19 Jan, 2018 1 commit
  5. 12 Jan, 2018 1 commit
    • Dominik Bartkiewicz's avatar
      Fix to aftercorr job array dependency logic · 7b5a3674
      Dominik Bartkiewicz authored
      Fix job array dependency with "aftercorr" option and some task arrays in
          the first job fail. This fix lets all task array elements that can run
          proceed rather than stopping all subsequent task array elements.
      Bug 4590
      7b5a3674
  6. 11 Jan, 2018 2 commits
    • Morris Jette's avatar
      node_feature/knl_cray - Fix memory leak · deaacad2
      Morris Jette authored
      node_feature/knl_cray - Fix memory leak that can occur during normal
          operation. This will happen when an update request for a specific
          node happens.
      deaacad2
    • Morris Jette's avatar
      node_feature/knl_cray - Fix memory leaks · 32c93fce
      Morris Jette authored
      If CnselectPath and/or SyscfgPath defined in knl_cray.conf file and
        slurmctld reconfigured, the original values of those paraemters
        would be over-written and their memory leaked.
      32c93fce
  7. 05 Jan, 2018 1 commit
    • Felip Moll's avatar
      Avoid node hang in COMPLETING state · c43df3a7
      Felip Moll authored
      Avoid setting node in COMPLETING state indefinitely if the job initiating
      the node reboot is cancelled while the reboot in in progress. Bug
      introduced in commit 7d246784
      
      Bug 4536
      c43df3a7
  8. 03 Jan, 2018 3 commits
  9. 28 Dec, 2017 2 commits
  10. 18 Dec, 2017 1 commit
  11. 15 Dec, 2017 1 commit
    • Morris Jette's avatar
      Add --bb support for "access_mode" · 43bd77e4
      Morris Jette authored
      This adds support for the --bb option "access_mode" in addition to
        "access" for better compatability with Cray's DataWarp options.
      Related to bug 4528
      43bd77e4
  12. 12 Dec, 2017 1 commit
  13. 09 Dec, 2017 1 commit
  14. 08 Dec, 2017 1 commit
    • Danny Auble's avatar
      Fix Slurm to work correctly with HDF5 1.10+. · 006d172a
      Danny Auble authored
      In 1.10+ they changed the hid_t from an int to a long int which
      messes things up as they use the top 32 bits for stuff right off
      the bat.  This fixes the scenario by handing the number with a int32_t
      instead of an int.
      
      Bug 3795
      006d172a
  15. 05 Dec, 2017 1 commit
    • Alejandro Sanchez's avatar
      Fix to properly remove extern steps from the starting_steps list. · 99b3796b
      Alejandro Sanchez authored
      Since NO_VAL = SLURM_BATCH_SCRIPT, the else statement would only compare
      the job_id and not the step_id, thus when a batch step was removed all
      the steps from that job would be removed too. Then when attempting to
      remove the extern step in the next iteration, it was already removed
      and we were incorrectly erroring out.
      
      Bug 4458.
      99b3796b
  16. 30 Nov, 2017 1 commit
    • Alejandro Sanchez's avatar
      Fix memory leak · 0bb71ce2
      Alejandro Sanchez authored
      Fix memory leak of MailDomain configuration string when slurmctld daemon
         is reconfigured.
      bug 4272 (comment 35)
      0bb71ce2
  17. 29 Nov, 2017 1 commit
    • Brian Christiansen's avatar
      Fix sbatch --wait to stop after job is gone · f9977ee5
      Brian Christiansen authored
      slurm_load_job() prior to 17.11 returns the error code in errno and not
      in rc.  With the addition of 47175901 if a job is removed from memory
      before sbatch checks for the job again, sbatch could get in an loop
      checking for the job. This only happens if you have a very small
      MinJobAge (<10) -- which is not recommended.
      f9977ee5
  18. 28 Nov, 2017 12 commits
  19. 27 Nov, 2017 1 commit
  20. 22 Nov, 2017 1 commit
  21. 10 Nov, 2017 2 commits
  22. 07 Nov, 2017 2 commits
    • Alejandro Sanchez's avatar
      Fix issue when resetting the partition pointers on nodes. · e6b2bd2d
      Alejandro Sanchez authored
      Issue could be triggered when updating a partition node(s) with node(s)
      that were already in the partition, incorrectly increasing the
      node_record->part_cnt (number of associated partitions) and thus
      incorrectly extending the array of pointers to partitions associated
      with this node, leading to an array with repeated associated
      partitions pointers.
      
      Bug 4318.
      e6b2bd2d
    • Brian Gilmer's avatar
      Cray module file - remove munge support. · 8b71b9fc
      Brian Gilmer authored
      On CLE 6.0 mungedir is /usr; a 'module unload' call then removes /usr/bin
      from PATH which is rather inconvenient.
      
      Bug 4334.
      8b71b9fc
  23. 05 Nov, 2017 1 commit