1. 25 Jun, 2014 1 commit
  2. 24 Jun, 2014 2 commits
    • Morris Jette's avatar
      core reservation distribution fix · aeecd03c
      Morris Jette authored
      Fix for core-based advanced reservations where the distribution of cores
      across nodes is not even. Failing test case:
      system has 10 nodes, 1 of which is fully occupied
      create reservation with 9 nodes and 10 cores
      always would fail with "busy nodes" error
      aeecd03c
    • Morris Jette's avatar
      Purely cosmetic changes · 339753a1
      Morris Jette authored
      339753a1
  3. 20 Jun, 2014 2 commits
    • Morris Jette's avatar
      Fix in HDF5 for NULL hostname · 5520ec6a
      Morris Jette authored
      The hostname was being set as an HDF5 value before the field was
      set in slurmstepd, resulting in SEGV. This change sets hostname
      before the HDF5 call and also tests for NULL before trying to set
      the value. Backtrace of failure:
      (gdb) bt
      0  strlen () at ../sysdeps/x86_64/strlen.S:106
      1  0x00007f4af5bb756d in put_string_attribute (parent=33554432,
         name=0x7f4af5bb8591 "Node Name", value=0x0)
         at src/plugins/acct_gather_profile/hdf5/hdf5_api.c:1711
      2  0x00007f4af5bad224 in acct_gather_profile_p_node_step_start (job=0x194cc80)
         at src/plugins/acct_gather_profile/hdf5/acct_gather_profile_hdf5.c:372
      3  0x000000000052ca86 in acct_gather_profile_g_conf_set (tbl=0x194cc80)
         at src/common/slurm_acct_gather_profile.c:490
      4  0x000000000042e028 in batch_stepd_step_rec_create (msg=0x194d280)
         at src/slurmd/slurmstepd/slurmstepd_job.c:496
      5  0x0000000000426ae5 in mgr_launch_batch_job_setup (msg=0x194d280, cli=0x194bec0)
         at src/slurmd/slurmstepd/mgr.c:422
      6  0x00000000004263da in _step_setup (cli=0x194bec0, self=0x0, msg=0x194bd90)
         at slurmd/slurmstepd/slurmstepd.c:516
      7  0x0000000000424302 in main (argc=1, argv=0x7fff1f7c6c98)
          at src/slurmd/slurmstepd/slurmstepd.c:127
      5520ec6a
    • Matthieu Hautreux's avatar
      992ec094
  4. 19 Jun, 2014 5 commits
  5. 18 Jun, 2014 2 commits
  6. 17 Jun, 2014 15 commits
  7. 16 Jun, 2014 2 commits
  8. 14 Jun, 2014 2 commits
    • jette's avatar
      Don't reject job on fast-schedule · a8c0b701
      jette authored
      If FastSchedule=0 is configured and some nodes have not registered
      for service (so we do not know their actual resource counts), then
      leave the job pending rather than rejecting it without knowing if
      it can run later (when the node registers and we know its specs).
      bug 872
      a8c0b701
    • jette's avatar
      Replace RPC number with name in log · 24bec0bf
      jette authored
      24bec0bf
  9. 13 Jun, 2014 2 commits
  10. 12 Jun, 2014 7 commits
    • Morris Jette's avatar
      scontrol show job report correct CPU_IDs · 04921aa2
      Morris Jette authored
      For "scontrol --details show job" report the correct CPU_IDs when thre are
      multiple threads per core (we are translating a core bitmap to CPU IDs).
      This is an enhancement of commit 83d626ca
      so the node table is only loaded once for the entire job table.
      bug 850
      04921aa2
    • Martin Perry's avatar
      Correct ID of CPUs allocated to job · 83d626ca
      Martin Perry authored
      Correct the record of CPU_IDs allocated to a job if there is more
      than one CPU per core.
      83d626ca
    • Morris Jette's avatar
      Fix job --exclusive option enforcement · f07f19eb
      Morris Jette authored
      If job requests --exclusive then do not use nodes which have any cores in an
      advanced reservation. Also prevents case where nodes can be shared by other
      jobs.
      f07f19eb
    • Morris Jette's avatar
      select/cons_res log change · 9ed92aa2
      Morris Jette authored
      Disable some logging that would be very slow unless
      the _DEBUG flag is set in the plugin
      9ed92aa2
    • Morris Jette's avatar
      Honor job exclusive with reserved cores · f5d6bda0
      Morris Jette authored
      If job requests --exclusive then do not use nodes which have any cores in an
      advanced reservation. Previously the job would be allocated all of the cores
      outside of the advanced reservation.
      f5d6bda0
    • Morris Jette's avatar
      Fix shared=yes support · c773b750
      Morris Jette authored
      Correct support for partition with Shared=YES configuration.
      Previous logic would share resources for jobs by default
      (i.e. if user did not explicitly request --exclusive).
      bug 758
      c773b750
    • Jens Dreger's avatar
      Correct PTY use FAQ · e4798fcf
      Jens Dreger authored
      e4798fcf