- 26 Jul, 2011 5 commits
-
-
Morris Jette authored
If a node's configuration differs from the actual hardware configureation, an internal bitmap may be referenced with an invalid index causing slurmd to abort without this patch.
-
Morris Jette authored
If system is configured with a default memory limit, that applies to both the job and job step which can prevent the expected number of job steps from being started. Two tests were modified to explicitly set a job step memory limit to start more steps.
-
Morris Jette authored
This fixes a bug in parsing slurm.conf for node information if there are more than one NodeName=DEFAULT value. This adds to existing default values rather than clearing old default values that are not explicitly set to new values on that configuration line.
-
Morris Jette authored
If a node has fewer CPUs than configured and task/affinity is configured a reference off of the end of a bitmap may result without this patch.
-
Morris Jette authored
If hostname input had suffix with leading zeros then the suffix returned had leading zeros, which performed octal arithmetic causing at least test1.83 to fail in some cases.
-
- 25 Jul, 2011 8 commits
-
-
Morris Jette authored
Fix logic in how each node's HostAddr and HostNodeName fields are set on a BlueGene system. When values are set on the NodeName line of slurm.conf, only the first node's HostAddr and HostNodeName fields were being set, others were NULL.
-
Morris Jette authored
Update test suite for BGQ emulation. Some command output has changed slightly. srun command only launches one task (runjob) now.
-
Morris Jette authored
We only launch one user task on an emulated bluegene system, so modify test to only check for the currently expected file(s) for one task.
-
Morris Jette authored
Due to a race condition, a job may be cancelled before the launch completes. In that case, an error message may be logged by slurmctld. This change makes that condition be logged using info() rather than error().
-
Morris Jette authored
One new file was not packaged in the RPM and compiler reported that one variable could be used without being initialized and another variable was never used.
-
Morris Jette authored
third patch adds the man page and an example. 0003_bull_cgroup_devices_doc_add_allowed_devices_support-2.3.0-0.pre7.patch Patch from Yiannis Georgiou, Bull.
-
Morris Jette authored
adds the support of a file to declare the default allowed devices for all the jobs. 0002_bull_cgroup_devices_add_allowed_devices_support-2.3.0-0.pre7.patch Patch from Yiannis.Georgiou, Bull.
-
Morris Jette authored
bug correction that I found when using sbatch, 0001_bull_cgroup_devices_correct_memory_leak_with_sbatch-2.3.0-0.pre7.patch Patch from Yiannis.Georgiou, Bull.
-
- 22 Jul, 2011 23 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
code base by setting the SLURM_LOCAL_GLOBALS_FILE env var. This way if you are running off a git repo and want to test if your changes work on multiple architectures you only have to set the env var to point at a different install/build instead of having to check things out again.
-
Danny Auble authored
code base by setting the SLURM_LOCAL_GLOBALS_FILE env var. This way if you are running off a git repo and want to test if your changes work on multiple architectures you only have to set the env var to point at a different install/build instead of having to check things out again.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
BlueGene: Permit users to specify a separate connection type for each dimension (e.g. "--conn-type=torus,mesh,torus").
-
Morris Jette authored
-
Morris Jette authored
If the user does not set the job name for srun, then set its name to the executable files name rather than having it always be "runjob".
-
Morris Jette authored
When reading an old state file, if the old coordinates would be out of bounds in the new array, return a NULL pointer rather than going off the end of the array and getting a segv.
-
Morris Jette authored
If unpacking BGQ state information and a bad coordinate is read, then return a NULL pointer rather than going off the end of an array and causing an invalid memory error.
-
Morris Jette authored
The latest version of man2html used to make web pages from man pages produces slightly different output. This patch from Rod Schultz (Bull) will work with the latest version of man2html plus older versions.
-
Morris Jette authored
This reverts commit 48f468ea7136d27bee8c598eb58c0390be7f9ca8.
-
Morris Jette authored
Different versions of man2html generated slightly different output. This change supports both known varieties of output.
-
Morris Jette authored
On Cray systems with the srun2aprun wrapper, build an srun man page that describes which options are available with the wrapper.
-
- 21 Jul, 2011 2 commits
-
-
Morris Jette authored
I've found a minor problem in the script that converts man pages into html. The current script produces two incorrect links on every html man page. Patch from Rod Schultz, Bull.
-
Morris Jette authored
Restore node configuration information (CPUs, memory, etc.) for powered down when slurmctld daemon restarts rather than waiting for the node to be restored to service and getting the information from the node (NOTE: Only relevent if FastSchedule=0).
-
- 20 Jul, 2011 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
Fix bug in select/cons_res task distribution logic when tasks-per-node=0. Eliminates misleading slurmctld message "error: cons_res: _compute_c_b_task_dist oversubscribe." This problem was introduced in SLURM version 2.2.5 in order to fix a task distribution problem when cpus_per_task=0. Patch from Rod Schultz, Bull.
-