Commits · 46a3767e1d550e3d440dcb8a39a0de94a36d0d69 · Manuel G. Marciani / ces_slurm_simulator

12 Jul, 2012 4 commits
- BGQ - Make it possible for a multi midplane allocation to run on more · 010570f4
  Danny Auble authored Jul 12, 2012
```
than 1 midplane but not the entire allocation.
```
  010570f4
- BGQ - correct logic to place multiple (< 1 midplane) steps inside a · 5ed86088
  Danny Auble authored Jul 12, 2012
```
multi midplane block allocation.
```
  5ed86088
- BGQ - correctly remove running jobs when freeing a shared block. · a1f9b6a7
  Danny Auble authored Jul 12, 2012
  
  a1f9b6a7
- BLUEGENE - Handle job completion correctly if an admin removes a block · 5430c095
  Danny Auble authored Jul 12, 2012
```
where other blocks on an overlapping midplane are running jobs.
```
  5430c095
11 Jul, 2012 3 commits
- BLUEGENE - If a large block (> 1 midplane) is in error and underlying · 0c371d36
  Danny Auble authored Jul 11, 2012
```
hardware is marked bad remove the larger block and create a block over
just the bad hardware making the other hardware available to run on.
```
  0c371d36
- BGQ - make sure we have a valid block when creating or finishing a step · 4731a11b
  Danny Auble authored Jul 11, 2012
```
allocation.
```
  4731a11b
- BLUEGENE - remove race condition where if a block is removed while waiting · 11e2759f
  Danny Auble authored Jul 11, 2012
```
for a job to finish on it the number of unused cpus wasn't updated
correctly.
```
  11e2759f
09 Jul, 2012 1 commit
- Fix bug in task layout with select/cons_res plugin and --ntasks-per-node · f9f087f2
  Martin Perry authored Jul 09, 2012
```
See Bugzilla #73 for more complete description of the problem.
Patch by Martin Perry, Bull.
```
  f9f087f2
06 Jul, 2012 1 commit

Fix for incorrect partition point for job · dd1d573f

Carles Fenoy authored Jul 05, 2012

If job is submitted to more than one partition, it's partition pointer can
be set to an invalid value. This can result in the count of CPUs allocated
on a node being bad, resulting in over- or under-allocation of its CPUs.
Patch by Carles Fenoy, BSC.

Hi all,

After a tough day I've finally found the problem and a solution for 2.4.1
I was able to reproduce the explained behavior by submitting jobs to 2 partitions.
This makes the job to be allocated in one partition but in the schedule function the partition of the job is changed to the NON allocated one. This makes that the resources can not be free at the end of the job.

I've solved this by changing the IS_PENDING test some lines above in the schedule function in (job_scheduler.c)

This is the code from the git HEAD (Line 801). As this file has changed a lot from 2.4.x I have not done a patch but I'm commenting the solution here.
I've moved the if(!IS_JOB_PENDING) after the 2nd line (part_ptr...). This prevents the partition of the job to be changed if it is already starting in another partition.

job_ptr = job_queue_rec->job_ptr;

part_ptr = job_queue_rec->part_ptr;
job_ptr->part_ptr = part_ptr;
xfree(job_queue_rec);

if (!IS_JOB_PENDING(job_ptr))

continue; /* started in other partition */

Hope this is enough information to solve it.

I've just realized (while writing this mail) that my solution has a memory leak as job_queue_rec is not freed.

Regards,
Carles Fenoy

dd1d573f

03 Jul, 2012 1 commit
- BLUEGENE - Correct potential deadlock issue when hardware goes bad and · f0949d91
  Danny Auble authored Jul 03, 2012
```
there are jobs running on that hardware.
```
  f0949d91
02 Jul, 2012 1 commit
- Fix bug for job state change from 2.3 -> 2.4 job state can now be preserved · 3bc86988
  Carles Fenoy authored Jul 02, 2012
```
correctly when transitioning.  This also applies for 2.4.0 -> 2.4.1, no
state will be lost. (Thanks to Carles Fenoy)
```
  3bc86988
28 Jun, 2012 1 commit
- Changes for 2.4 tag · 94ea2e84
  Danny Auble authored Jun 28, 2012
  
  94ea2e84
26 Jun, 2012 4 commits
- BGQ - change linking from libslurm.o to libslurmhelper.la to avoid warning. · d83bf5f8
  Danny Auble authored Jun 26, 2012
  
  d83bf5f8
- BGQ - Modified documents to explain new plugin_flags needed in · f8ae9a15
  Danny Auble authored Jun 26, 2012
```
bg.properties in order for the runjob_mux to run correctly.

Signed-off-by: Danny Auble <da@schedmd.com>
```
  f8ae9a15
- If preempted job should have a grace time and preempt mode is not cancel · 501688ed
  Danny Auble authored Jun 26, 2012
```
but job is going to be canceled because it is interactive or other reason
it now receives the grace time.
```
  501688ed
- Put nodes names in alphabetic order in node table. · 37587b6b
  Morris Jette authored Jun 26, 2012
  
  37587b6b
25 Jun, 2012 3 commits
- BLUEGENE - fix issue if a cable was in an error state make it so we can · 337db3f1
  Danny Auble authored Jun 25, 2012
```
check if a block is still makable if the cable wasn't in error.
```
  337db3f1
- BLUEGENE - fix possible race condition if cleaning up a block and the · 66c0a2b3
  Danny Auble authored Jun 25, 2012
```
removal of the job on the block failed.
```
  66c0a2b3
- Fix bug when querying accounting looking for a job node size. · bbb4e741
  Danny Auble authored Jun 25, 2012
  
  bbb4e741
22 Jun, 2012 3 commits
- remove NEWS item missed from commit · 86196b70
  Danny Auble authored Jun 22, 2012
```
29d79ef8
```
  86196b70
- BLUEGENE - fix race condition where if a nodeboard/card goes down at the · ea8ca91d
  Danny Auble authored Jun 22, 2012
```
same time a block is destroyed and that block just happens to be the
smallest overlapping block over the bad hardware.
```
  ea8ca91d
- Move logic to always give the first · c79cd503
  Danny Auble authored Jun 22, 2012
  
  c79cd503
20 Jun, 2012 2 commits
- BGQ - fix issue where if a user was asking for tasks and ntasks-per-node · 74daee90
  Danny Auble authored Jun 20, 2012
```
but not node count the node count is correctly figured out.
```
  74daee90
- Fix bug in gang scheduling table initialization · 7fc48554
  Morris Jette authored Jun 20, 2012
```
Without this fix, gang scheduling mode could start without creating
a list resulting in an assert when jobs are submitted.
```
  7fc48554
18 Jun, 2012 2 commits
- Fix issues on large jobs (>64k tasks) to have the correct counter type when · cd025504
  Danny Auble authored Jun 18, 2012
```
packing the step layout structure.
```
  cd025504
- BGQ - fix for if a request comes in smaller than the smallest block and · 1b7035dc
  Danny Auble authored Jun 18, 2012
```
we must use a small block instead of a shared midplane block.
```
  1b7035dc
13 Jun, 2012 2 commits
- BGQ - quiter debug when the real time server comes back but there are · 9c0ca8db
  Danny Auble authored Jun 13, 2012
```
still messages we find when we poll but haven't given it back to the real
time yet.
```
  9c0ca8db
- Improve memory consumption on step layouts with high task count. · f0d470e6
  Danny Auble authored Jun 13, 2012
  
  f0d470e6
12 Jun, 2012 1 commit
- BGQ - Added information on how to setup the runjob_mux to run as SlurmUser. · 68e797cd
  Danny Auble authored Jun 12, 2012
  
  68e797cd
05 Jun, 2012 1 commit
- BGQ - When using an old IBM driver cnodes that go into error because of · 07843fd3
  Danny Auble authored Jun 04, 2012
```
a job kill timeout aren't always reported to the system.  This is now
handled by the runjob_mux plugin.
```
  07843fd3
01 Jun, 2012 2 commits
- BGQ - better fix for making new blocks when nodeboard goes down and using · c0fb0bbb
  Danny Auble authored Jun 01, 2012
```
sub-blocks.
```
  c0fb0bbb
- BGQ - Fix issue when a nodeboard goes down and you want to combine blocks · 8f429bfb
  Danny Auble authored May 31, 2012
```
to make a larger small block and are running with sub-blocks.
```
  8f429bfb
31 May, 2012 1 commit
- BGQ - Fix checking for IO on a block with new IBM driver V1R1M1 previous · f6bede58
  Danny Auble authored May 31, 2012
```
function didn't always work correctly.
```
  f6bede58
30 May, 2012 3 commits
- BGQ - fix issue where if a step uses the entire allocation and then · d08e2813
  Danny Auble authored May 30, 2012
```
the next step in the allocation only uses part of the allocation it gets
the correct cnodes.
```
  d08e2813
- Fix in scheduling logic that can delay jobs with min/max node counts. · aa7c59d3
  Morris Jette authored May 30, 2012
  
  aa7c59d3
- In etc/init.d/slurm move check for scontrol · 1385a9f0
  Andy Wettstein authored May 30, 2012
```
In etc/init.d/slurm move check for scontrol after sourcing
/etc/sysconfig/slurm. Patch from Andy Wettstein, University of Chicago.
```
  1385a9f0
29 May, 2012 1 commit
- Fix bug that clears job pending reason field · f0324da5
  Don Lipari authored May 29, 2012
  
  f0324da5
25 May, 2012 2 commits

Change SchedulerParamters option from "bf_res=" to "bf_resolution=" · 0f590296

Rod Schultz authored May 25, 2012

This change makes the code consistent with the documentation.
Note that "bf_res=" will continue to be recognized for now.
Patch from Rod Schultz, Bull.

0f590296

Modify scontrol show job to require -dd option to print batch script. · 8ed1b303

Don Albert authored May 25, 2012

I have implemented the changes as you suggested: using a "-dd" option to indicate that the display of the script is wanted, and setting both the "SHOW_DETAIL" and a new "SHOW_DETAIL2" flag.

Since "scontrol" can be run interactively as well, I added a new "script" option to indicate that display of both the script and the details is wanted if the job is a batch job.

Here are the man page updates for "man scontrol". For the "-d, --details" option:

-d, --details
Causes the show command to provide additional details where available. Repeating the option more than
once (e.g., "-dd") will cause the show job command to also list the batch script, if the job was a batch
job.

For the interactive "details" option:

details
Causes the show command to provide additional details where available. Job information will include
CPUs and NUMA memory allocated on each node. Note that on computers with hyperthreading enabled and
SLURM configured to allocate cores, each listed CPU represents one physical core. Each hyperthread on
that core can be allocated a separate task, so a job's CPU count and task count may differ. See the
--cpu_bind and --mem_bind option descriptions in srun man pages for more information. The details
option is currently only supported for the show job command. To also list the batch script for batch
jobs, in addition to the details, use the script option described below instead of this option.

And for the new interactive "script" option:

script Causes the show job command to list the batch script for batch jobs in addition to the detail informa-
tion described under the details option above.

Attached are the patch file for the changes and a text file with the results of the tests I did to check out the changes. The patches are against SLURM 2.4.0-rc1.

-Don Albert-

8ed1b303

24 May, 2012 1 commit
- BGQ - Fix issue when running with AllowSubBlockAllocations=Yes without · c592b8c1
  Danny Auble authored May 24, 2012
```
compiling with --enable-debug
```
  c592b8c1