1. 02 Feb, 2016 7 commits
    • Tim Wickberg's avatar
      Merge branch 'slurm-15.08' · e7915fdf
      Tim Wickberg authored
      e7915fdf
    • Tim Wickberg's avatar
      replace bash-isms in slurm.m4, run autogen.sh · 7c5fedbc
      Tim Wickberg authored
      $((10#$SLURM_API_MAJOR)) is bash-specific. replace with portable
      ${SLURM_API_MAJOR#0} which accomplishes the same thing.
      
      The first forces bash to treat the value as base-10 even with a
      leading zero, the second portable format strips a leading zero off.
      7c5fedbc
    • Tim Wickberg's avatar
      remove unused sys/termios.h include · 3f597bae
      Tim Wickberg authored
      Also remove checks for sys/termios.h from build system.
      Slurm directly includes the POSIX-required <termios.h> already,
      and the one use of this conditional is being removed here.
      
      Fixes one of several build errors on FreeBSD.
      3f597bae
    • Morris Jette's avatar
      Merge branch 'slurm-15.08' · e84e3ea3
      Morris Jette authored
      e84e3ea3
    • Morris Jette's avatar
      reservation relocation fix · 31ac01ce
      Morris Jette authored
      This fixes a bug introduced in commit a801d264
      Whole node resource allocations with REPLACE option were not working.
      Detected by test3.14 failure.
      31ac01ce
    • Didier GAZEN's avatar
      Fix support for AuthInfo in slurmdbd.conf · fa4222ec
      Didier GAZEN authored
      Support AuthInfo in slurmdbd.conf that is different from the value in
          slurm.conf.
      There is a possible bug in the slurm_get_auth_info function (src/common/slurm_protocol_api.c) that can cause the slurmdbd daemon to look for the AuthInfo parameter in slurm.conf instead of slurmdbd.conf when the auth/munge authentication method is used (AuthType=auth/munge).
      
      Here is the slurmdbd log revealing the problem (debug5() printing were added in the sources) :
      
      slurmdbd: slurmdbd version 15.08.7 started
      slurmdbd: debug2: running rollup at Tue Feb 02 14:20:14 2016
      slurmdbd: debug5: in ../../../src/slurmdbd/slurmdbd.c, _send_slurmctld_register_req (line 690)
      slurmdbd: debug5: in ../../../src/common/slurm_protocol_api.c, slurm_send_node_msg (line 3601)
      slurmdbd: debug5: in ../../../../../src/plugins/auth/munge/auth_munge.c, slurm_auth_create (line 217)
      slurmdbd: debug5: in ../../../src/common/slurm_protocol_api.c, slurm_get_auth_ttl (line 1732)
      slurmdbd: debug5: Entering ../../../src/common/slurm_protocol_api.c, slurm_get_auth_info
      slurmdbd: debug:  Reading slurm.conf file: /usr/local/slurm-15-08-7-1/etc/slurm.conf
      slurmdbd: error: s_p_parse_file: unable to status file /usr/local/slurm-15-08-7-1/etc/slurm.conf: No such file or directory, retrying in 1sec up to 60sec
      ...
      
      Then 60 seconds later, the auth_info value returned by slurm_get_auth_info is NULL:
      
      slurmdbd: debug5: Leaving ../../../src/common/slurm_protocol_api.c, slurm_get_auth_info, auth_info=(null)
      
      and slurmdbd continues without crashing, but I am not sure it is in a safe state.
      
      When applying this patch :
      
      diff --git a/src/common/slurm_protocol_api.c b/src/common/slurm_protocol_api.c
      index c5db879..be1dab6 100644
      --- a/src/common/slurm_protocol_api.c
      +++ b/src/common/slurm_protocol_api.c
      @@ -1703,9 +1703,13 @@ extern char *slurm_get_auth_info(void)
              char *auth_info;
              slurm_ctl_conf_t *conf;
      
      -       conf = slurm_conf_lock();
      -       auth_info = xstrdup(conf->authinfo);
      -       slurm_conf_unlock();
      +       if (slurmdbd_conf) {
      +                auth_info = xstrdup(slurmdbd_conf->auth_info);
      +        } else {
      +               conf = slurm_conf_lock();
      +               auth_info = xstrdup(conf->authinfo);
      +               slurm_conf_unlock();
      +       }
      
              return auth_info;
       }
      
      the auth_info value is now valid and consistent with the slurmdbd.conf setting:
      
      slurmdbd: slurmdbd version 15.08.7 started
      slurmdbd: debug2: running rollup at Tue Feb 02 14:47:37 2016
      slurmdbd: debug5: in ../../../src/slurmdbd/slurmdbd.c, _send_slurmctld_register_req (line 690)
      slurmdbd: debug5: in ../../../src/common/slurm_protocol_api.c, slurm_send_node_msg (line 3600)
      slurmdbd: debug5: in ../../../../../src/plugins/auth/munge/auth_munge.c, slurm_auth_create (line 217)
      slurmdbd: debug5: in ../../../src/common/slurm_protocol_api.c, slurm_get_auth_ttl (line 1731)
      slurmdbd: debug5: Entering ../../../src/common/slurm_protocol_api.c, slurm_get_auth_info
      slurmdbd: debug5: Leaving ../../../src/common/slurm_protocol_api.c, slurm_get_auth_info, auth_info=socket=/var/run/munge/munge_dbd.socket.2
      fa4222ec
    • Morris Jette's avatar
      Reboot logic · 6f157dbe
      Morris Jette authored
      Reserve node weight value of INIFINITE for nodes which require reboot
      Avoid scheduling on nodes requiring reboot that are not IDLE
        (More work needed for backfill and will_run RPC).
      6f157dbe
  2. 01 Feb, 2016 8 commits
  3. 30 Jan, 2016 1 commit
  4. 29 Jan, 2016 4 commits
  5. 28 Jan, 2016 10 commits
  6. 27 Jan, 2016 10 commits