1. 26 Jul, 2011 5 commits
    • Morris Jette's avatar
      Fix a task affinity bug for misconfigured system · d3c9706c
      Morris Jette authored
      If a node's configuration differs from the actual hardware configureation,
      an internal bitmap may be referenced with an invalid index causing slurmd
      to abort without this patch.
      d3c9706c
    • Morris Jette's avatar
      fix regression tests when mem limit present · 7edfe1ba
      Morris Jette authored
      If system is configured with a default memory limit, that applies to both the job
      and job step which can prevent the expected number of job steps from being started.
      Two tests were modified to explicitly set a job step memory limit to start more steps.
      7edfe1ba
    • Morris Jette's avatar
      Fix bug in node info parsing with multiple DEFAULT values · 8245a307
      Morris Jette authored
      This fixes a bug in parsing slurm.conf for node information if there
      are more than one NodeName=DEFAULT value. This adds to existing default
      values rather than clearing old default values that are not explicitly
      set to new values on that configuration line.
      8245a307
    • Morris Jette's avatar
      Prevent invalid bitmap offset with bad node configuration · a3ae51cd
      Morris Jette authored
      If a node has fewer CPUs than configured and task/affinity is configured a
      reference off of the end of a bitmap may result without this patch.
      a3ae51cd
    • Morris Jette's avatar
      Test suite fix for get_suffix() · 18179d44
      Morris Jette authored
      If hostname input had suffix with leading zeros then the suffix returned
      had leading zeros, which performed octal arithmetic causing at least
      test1.83 to fail in some cases.
      18179d44
  2. 25 Jul, 2011 8 commits
    • Morris Jette's avatar
      Fix for setting HostAddr and HostNodeName for BlueGene · 0d11e9a3
      Morris Jette authored
      Fix logic in how each node's  HostAddr and HostNodeName fields are
      set on a BlueGene system. When values are set on the NodeName line
      of slurm.conf, only the first node's  HostAddr and HostNodeName fields
      were being set, others were NULL.
      0d11e9a3
    • Morris Jette's avatar
      BGQ test suite updates · 3d1b1e1d
      Morris Jette authored
      Update test suite for BGQ emulation. Some command output has changed
      slightly. srun command only launches one task (runjob) now.
      3d1b1e1d
    • Morris Jette's avatar
      Fix regression test for current bluegene emulation logic · 7964661c
      Morris Jette authored
      We only launch one user task on an emulated bluegene system, so modify test
      to only check for the currently expected file(s) for one task.
      7964661c
    • Morris Jette's avatar
      Fix misleading error message · 98d67658
      Morris Jette authored
      Due to a race condition, a job may be cancelled before the launch completes.
      In that case, an error message may be logged by slurmctld. This change makes
      that condition be logged using info() rather than error().
      98d67658
    • Morris Jette's avatar
      Corrections to recent cgroup patches · 25b1c058
      Morris Jette authored
      One new file was not packaged in the RPM and compiler reported that one
      variable could be used without being initialized and another variable
      was never used.
      25b1c058
    • Morris Jette's avatar
      tches for cgroup devices support · 3ff4eb9b
      Morris Jette authored
      third patch adds the man page and an example.
      0003_bull_cgroup_devices_doc_add_allowed_devices_support-2.3.0-0.pre7.patch
      Patch from Yiannis Georgiou, Bull.
      3ff4eb9b
    • Morris Jette's avatar
      tches for cgroup devices support · 35c28774
      Morris Jette authored
      adds the support of a file to declare the default allowed devices for
      all the jobs.
      0002_bull_cgroup_devices_add_allowed_devices_support-2.3.0-0.pre7.patch
      Patch from Yiannis.Georgiou, Bull.
      35c28774
    • Morris Jette's avatar
      patches for cgroup devices support · 61bd6a43
      Morris Jette authored
      bug correction that I found when using sbatch,
      0001_bull_cgroup_devices_correct_memory_leak_with_sbatch-2.3.0-0.pre7.patch
      Patch from Yiannis.Georgiou, Bull.
      61bd6a43
  3. 22 Jul, 2011 23 commits
  4. 21 Jul, 2011 2 commits
    • Morris Jette's avatar
      man2html patch corrects links · 94e41119
      Morris Jette authored
      I've found a minor problem in the script that converts man pages into
      html. The current script produces two incorrect links on every html man
      page. Patch from Rod Schultz, Bull.
      94e41119
    • Morris Jette's avatar
      Restore node configuration information on slurmctld restart · f729d72b
      Morris Jette authored
      Restore node configuration information (CPUs, memory, etc.) for powered
      down when slurmctld daemon restarts rather than waiting for the node to be
      restored to service and getting the information from the node (NOTE: Only
      relevent if FastSchedule=0).
      f729d72b
  5. 20 Jul, 2011 2 commits
    • Morris Jette's avatar
      Add clarifying comment · 2ccd9456
      Morris Jette authored
      2ccd9456
    • Morris Jette's avatar
      Fix select/cons_res task distribution bug · b70cc235
      Morris Jette authored
      Fix bug in select/cons_res task distribution logic when tasks-per-node=0.
      Eliminates misleading slurmctld message
      "error:  cons_res: _compute_c_b_task_dist oversubscribe."
      This problem was introduced in SLURM version 2.2.5 in order to fix
      a task distribution problem when cpus_per_task=0. Patch from Rod Schultz, Bull.
      b70cc235