1. 25 Jun, 2014 9 commits
  2. 24 Jun, 2014 2 commits
    • Morris Jette's avatar
      core reservation distribution fix · aeecd03c
      Morris Jette authored
      Fix for core-based advanced reservations where the distribution of cores
      across nodes is not even. Failing test case:
      system has 10 nodes, 1 of which is fully occupied
      create reservation with 9 nodes and 10 cores
      always would fail with "busy nodes" error
      aeecd03c
    • Morris Jette's avatar
      Purely cosmetic changes · 339753a1
      Morris Jette authored
      339753a1
  3. 20 Jun, 2014 2 commits
    • Morris Jette's avatar
      Fix in HDF5 for NULL hostname · 5520ec6a
      Morris Jette authored
      The hostname was being set as an HDF5 value before the field was
      set in slurmstepd, resulting in SEGV. This change sets hostname
      before the HDF5 call and also tests for NULL before trying to set
      the value. Backtrace of failure:
      (gdb) bt
      0  strlen () at ../sysdeps/x86_64/strlen.S:106
      1  0x00007f4af5bb756d in put_string_attribute (parent=33554432,
         name=0x7f4af5bb8591 "Node Name", value=0x0)
         at src/plugins/acct_gather_profile/hdf5/hdf5_api.c:1711
      2  0x00007f4af5bad224 in acct_gather_profile_p_node_step_start (job=0x194cc80)
         at src/plugins/acct_gather_profile/hdf5/acct_gather_profile_hdf5.c:372
      3  0x000000000052ca86 in acct_gather_profile_g_conf_set (tbl=0x194cc80)
         at src/common/slurm_acct_gather_profile.c:490
      4  0x000000000042e028 in batch_stepd_step_rec_create (msg=0x194d280)
         at src/slurmd/slurmstepd/slurmstepd_job.c:496
      5  0x0000000000426ae5 in mgr_launch_batch_job_setup (msg=0x194d280, cli=0x194bec0)
         at src/slurmd/slurmstepd/mgr.c:422
      6  0x00000000004263da in _step_setup (cli=0x194bec0, self=0x0, msg=0x194bd90)
         at slurmd/slurmstepd/slurmstepd.c:516
      7  0x0000000000424302 in main (argc=1, argv=0x7fff1f7c6c98)
          at src/slurmd/slurmstepd/slurmstepd.c:127
      5520ec6a
    • Matthieu Hautreux's avatar
      992ec094
  4. 19 Jun, 2014 5 commits
  5. 18 Jun, 2014 2 commits
  6. 17 Jun, 2014 15 commits
  7. 16 Jun, 2014 2 commits
  8. 14 Jun, 2014 2 commits
    • jette's avatar
      Don't reject job on fast-schedule · a8c0b701
      jette authored
      If FastSchedule=0 is configured and some nodes have not registered
      for service (so we do not know their actual resource counts), then
      leave the job pending rather than rejecting it without knowing if
      it can run later (when the node registers and we know its specs).
      bug 872
      a8c0b701
    • jette's avatar
      Replace RPC number with name in log · 24bec0bf
      jette authored
      24bec0bf
  9. 13 Jun, 2014 1 commit