1. 24 Aug, 2017 5 commits
    • Alejandro Sanchez's avatar
      Fix Coverity CID 174746: Control flow issues (DEADCODE). · 1a603a7b
      Alejandro Sanchez authored
      Testing if curl_handle != NULL or rc != SLURM_SUCCESS was already done
      in the right above if/else statements, jumping to the consequent goto
      cleanup label if needed. Thus the removed test was never going to be
      evaluated to true, and Coverity properly warned about this.
      
      Regression introduced in commit 5f5e6472 (code cleanup).
      1a603a7b
    • Alejandro Sanchez's avatar
      Merge branch 'slurm-17.02' · 75777f44
      Alejandro Sanchez authored
      75777f44
    • Alejandro Sanchez's avatar
      Prevent slurmstepd ABRT when parsing gres.conf CPUs. · 3e1fffb6
      Alejandro Sanchez authored
      Calling bit_unfmt() with a zero bit_size() bitmap leads to a later
      call to bit_nclear() with start=0 and stop=-1, leading to the ABRT.
      
      This scenario happened when cgroup.conf has ConstrainDevices=yes and
      task_cgroup_devices_create() tries to collect the GRES devices
      but gres_cpu_cnt=0, thus creating a p->cpus_bitmap = bit_alloc(gres_cpu_cnt);
      of zero size which is passed by argument to bit_unfmt().
      
      gres_cpu_cnt is 0 because we have defined a gres.conf like this:
      
      Name=gpu Type=tesla File=/tmp/gres/tesla0 CPUs=0,1
      Name=gpu Type=tesla File=/tmp/gres/tesla1 CPUs=0,1
      Name=gpu Type=kepler File=/tmp/gres/kepler0 CPUs=2,3
      Name=gpu Type=kepler File=/tmp/gres/kepler1 CPUs=2,3
      
      but have no GresTypes nor GRES option in the slurm.conf / node config def.
      
      Bug 3974
      3e1fffb6
    • Alejandro Sanchez's avatar
    • Danny Auble's avatar
      Simplify common code in sview. · 90ca37b0
      Danny Auble authored
      90ca37b0
  2. 23 Aug, 2017 5 commits
    • Alejandro Sanchez's avatar
    • Alejandro Sanchez's avatar
      Merge branch 'slurm-17.02' · d61c0d3e
      Alejandro Sanchez authored
      d61c0d3e
    • Alejandro Sanchez's avatar
      jobcomp/elasticsearch - fix memory leak when transferring generated buffer. · 8172b7df
      Alejandro Sanchez authored
      Running slurmctld under valgrind while operating with jobcomp/elasticsearch
      reported the following bytes definitely lost:
      
      ==27403== 658 bytes in 1 blocks are definitely lost in loss record 301 of 342
      ==27403==    at 0x4C2FD4F: realloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so)
      ==27403==    by 0x2281B3: slurm_xrealloc (xmalloc.c:137)
      ==27403==    by 0x22856A: makespace (xstring.c:114)
      ==27403==    by 0x2285D0: _xstrcat (xstring.c:132)
      ==27403==    by 0x228CE0: _xstrfmtcat (xstring.c:291)
      ==27403==    by 0x83C5BCD: ???
      ==27403==    by 0x30A913: g_slurm_jobcomp_write (slurm_jobcomp.c:172)
      ==27403==    by 0x18D8FC: job_completion_logger (job_mgr.c:13652)
      
      It turns out the generated buffer in slurm_jobcomp_log_record was xstrdup'ed to
      the corresponding job_node->serialized_job, but the originally generated buffer
      wasn't freed afterwards. The fix consists in change the transfer so that instead
      of xstrdup'ing the char * we just assign the pointer and NULL the buffer.
      
      The job_node->serialized_job was already xfree'd properly later when the job
      was indexed.
      
      Discovered while working on Bug 4065.
      8172b7df
    • Tim Wickberg's avatar
      Print a warning if no results list is available. · 6d15591f
      Tim Wickberg authored
      This should only happen due to ESLURM_RESULT_TOO_LARGE,
      which leads to no list being packed.
      
      Follow on to 390da8cf / 8cf1835c.
      
      Bug 3624.
      6d15591f
    • Danny Auble's avatar
      Fix from change in ffe2c5dc that removed last parameter to the · d24525d4
      Danny Auble authored
      launch_g_step_wait() function.
      d24525d4
  3. 22 Aug, 2017 20 commits
  4. 21 Aug, 2017 6 commits
  5. 19 Aug, 2017 3 commits
  6. 18 Aug, 2017 1 commit