1. 24 Oct, 2016 5 commits
    • Morris Jette's avatar
      8e461b60
    • Morris Jette's avatar
      burst_buffer/cray: accept jobs without dw_wlm_cli · 5acd7e76
      Morris Jette authored
      burst_buffer/cray: Accept new jobs on backup slurmctld daemon without access
          to dw_wlm_cli command. No burst buffer actions will take place. Newly
          submitted jobs will be accepted and stay in pending state. Jobs depedent
          upon stage-in or stage-out will remain in their current state until the
          action can take place.
      5acd7e76
    • Brian Christiansen's avatar
      Only free the msg once. · 10c9b0d2
      Brian Christiansen authored
      10c9b0d2
    • Morris Jette's avatar
      Merge branch 'slurm-16.05' · 6c05c82e
      Morris Jette authored
      6c05c82e
    • Dorian Krause's avatar
      Fux use-after-free in srun · 2c7c5459
      Dorian Krause authored
      This commit fixes a bug in the multi-prog handling. When running
      salloc -N 2 srun -O --multi-prog mp.conf where mp.conf reads
      
      0-192 true
      
      srun crashes can be observed. valgrind reports:
      
      ==6857== Invalid read of size 4
      ==6857==    at 0x45938D: bit_realloc (bitstring.c:189)
      ==6857==    by 0x5977A9: _update_task_mask (multi_prog.c:335)
      ==6857==    by 0x597A5E: _validate_ranks (multi_prog.c:403)
      ==6857==    by 0x597D1E: verify_multi_name (multi_prog.c:469)
      ==6857==    by 0x6E7B4BE: launch_p_handle_multi_prog_verify (launch_slurm.c:453)
      ==6857==    by 0x58A25D: launch_g_handle_multi_prog_verify (launch.c:493)
      ==6857==    by 0x58E556: _opt_args (opt.c:1927)
      ==6857==    by 0x58A3B9: initialize_and_process_args (opt.c:270)
      ==6857==    by 0x591F82: init_srun (srun_job.c:459)
      ==6857==    by 0x427E70: srun (srun.c:193)
      ==6857==    by 0x428E23: main (srun.wrapper.c:17)
      ==6857==  Address 0x5ace440 is 16 bytes inside a block of size 28 free'd
      ==6857==    at 0x4C2BB4A: realloc (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so)
      ==6857==    by 0x446886: slurm_xrealloc (xmalloc.c:139)
      ==6857==    by 0x45944C: bit_realloc (bitstring.c:191)
      ==6857==    by 0x5977A9: _update_task_mask (multi_prog.c:335)
      ==6857==    by 0x597A5E: _validate_ranks (multi_prog.c:403)
      ==6857==    by 0x597D1E: verify_multi_name (multi_prog.c:469)
      ==6857==    by 0x6E7B4BE: launch_p_handle_multi_prog_verify (launch_slurm.c:453)
      ==6857==    by 0x58A25D: launch_g_handle_multi_prog_verify (launch.c:493)
      ==6857==    by 0x58E556: _opt_args (opt.c:1927)
      ==6857==    by 0x58A3B9: initialize_and_process_args (opt.c:270)
      ==6857==    by 0x591F82: init_srun (srun_job.c:459)
      ==6857==    by 0x427E70: srun (srun.c:193)
      2c7c5459
  2. 21 Oct, 2016 5 commits
  3. 20 Oct, 2016 5 commits
  4. 19 Oct, 2016 8 commits
  5. 18 Oct, 2016 6 commits
  6. 17 Oct, 2016 9 commits
  7. 15 Oct, 2016 2 commits