1. 11 Feb, 2015 1 commit
  2. 10 Feb, 2015 2 commits
  3. 09 Feb, 2015 3 commits
  4. 05 Feb, 2015 1 commit
  5. 04 Feb, 2015 3 commits
    • Morris Jette's avatar
      Report correct job "shared" field value · 3de14946
      Morris Jette authored
      Previously it was not possible to distinguish between a job needing
      exclusive nodes and the default job/partition configuration.
      3de14946
    • Morris Jette's avatar
      job array slurmctld abort fix · 0ff342b5
      Morris Jette authored
      Fix job array logic that can cause slurmctld to abort.
      bug 1426
      0ff342b5
    • Morris Jette's avatar
      Fix for CUDA v7.0+ · da2fba48
      Morris Jette authored
      Enable CUDA v7.0+ use with a Slurm configuration of TaskPlugin=task/cgroup
      ConstrainDevices=yes (in cgroup.conf). With that configuration
      CUDA_VISIBLE_DEVICES will start at 0 rather than the device number.
      bug 1421
      da2fba48
  6. 03 Feb, 2015 6 commits
  7. 02 Feb, 2015 3 commits
  8. 31 Jan, 2015 2 commits
  9. 30 Jan, 2015 2 commits
  10. 28 Jan, 2015 3 commits
  11. 27 Jan, 2015 1 commit
  12. 26 Jan, 2015 1 commit
  13. 23 Jan, 2015 1 commit
  14. 22 Jan, 2015 1 commit
  15. 21 Jan, 2015 2 commits
    • Morris Jette's avatar
      fix job array scheduling anomaly · 3787c01f
      Morris Jette authored
      If some tasks of a job array are runnable and the meta-job array
      record is not runable (e.g. held), the old logic could start a
      runable task then try to start the non-runable meta-job, discover
      it can not run, and set its reason to "BadConstraints".
      
      Test case:
      Make it so no jobs can start (partition stopped, slurmd down, etc.)
      submit a job array
      hold the job array
      release the first two tasks of the job array
      Make it so jobs can start
      3787c01f
    • Morris Jette's avatar
      fix squeue merging of job arrays · 261580be
      Morris Jette authored
      Squeue modified to not merge tasks of a job array if their wait reasons
      differ.
      bug 1388
      261580be
  16. 20 Jan, 2015 2 commits
  17. 19 Jan, 2015 1 commit
  18. 17 Jan, 2015 1 commit
  19. 15 Jan, 2015 3 commits
    • Danny Auble's avatar
      Make CR_ONE_TASK_PER_CORE work correctly with task/affinity. · db926ab7
      Danny Auble authored
      What this does is use the core level binding after each task is laid out
      to skip all the extra threads in the core so it doesn't give them to
      another task.
      
      It probably isn't perfect, but does solve all the scenarios I found.
      db926ab7
    • Morris Jette's avatar
      GRES scheduling fix · 72cefd54
      Morris Jette authored
      Fix for GRES scheduling in which there is CPU topology defined or
      GRES types defined and there is more than 1 GPU per topology record
      in slurmctld. Without this fix, only one GRES could be allocated
      from each defined topology.
      bug 1369
      72cefd54
    • Morris Jette's avatar
      Fix for slurmctld abort on gres error · ce1d99f5
      Morris Jette authored
      The slurmctld could abort with a gres configuration having
      Type= configured, but no CPU binding configured.
      ce1d99f5
  20. 14 Jan, 2015 1 commit