Commits · c301e59990d3840acf0d6be2571c89447a00304b · Manuel G. Marciani / ces_slurm_simulator

31 Jan, 2013 4 commits
- BLUEGENE - Fix in reservation logic that could cause abort. · c301e599
  Morris Jette authored Jan 31, 2013
  
  c301e599
- Update META for v2.5.2 tag · ac6909cc
  Morris Jette authored Jan 31, 2013
  
  ac6909cc
- Expanded BGQ allocation test, needs more work · 51425724
  Nathan Yee authored Jan 31, 2013
  
  51425724
- Switch/nrt - Dynamically load libnrt.so from within the plugin as needed. · 7c6c6fb4
  Morris Jette authored Jan 31, 2013
```
This eliminates the need for libnrt.so on the head node.
```
  7c6c6fb4
30 Jan, 2013 3 commits
- Note that slurmctld restart is needed to add or remove nodes · 799869bd
  Morris Jette authored Jan 30, 2013
  
  799869bd
- Update maximum line length in slurm configuration file documentation · 745d8f5c
  David Bigagli authored Jan 30, 2013
  
  745d8f5c
- Describe how to add nodes to slurm.conf in FAQ · 9bd2b996
  Morris Jette authored Jan 30, 2013
  
  9bd2b996
29 Jan, 2013 9 commits
- BLUEGENE - If we made a block that isn't runnable because of a overlapping · 73d46476
  Danny Auble authored Jan 29, 2013
```
block, destroy it correctly.
```
  73d46476
- Avoid apparent kernel bug in 2.6.32 which apparently is solved in · 43ffa23a
  Danny Auble authored Jan 29, 2013
```
at least 3.5.0.  This avoids a stack overflow when running jobs on
more than 120k nodes.
```
  43ffa23a
- Flush out perl callbacks for step launching · 82f97a86
  Danny Auble authored Jan 29, 2013
  
  82f97a86
- Fix typo · 453b5608
  Danny Auble authored Jan 29, 2013
  
  453b5608
- Add missing perl callbacks for step_ctx.c (task launch callbacks) · 0ffa571a
  Morris Jette authored Jan 29, 2013
```
The new callbacks are not fleshed out, but eliminates a build error
```
  0ffa571a
- Avoid invalid memory when GRES has no available plugin · 8a63416d
  David Bigagli authored Jan 29, 2013
  
  8a63416d
- Fix for write off end of allocated memory · e60849a5
  David Bigagli authored Jan 29, 2013
  
  e60849a5
- Change variable info to be io_info to avoid confusion with the info · 23bf863f
  Danny Auble authored Jan 28, 2013
```
function.
```
  23bf863f
- Fix typo in log message · 3164de16
  Morris Jette authored Jan 28, 2013
  
  3164de16
28 Jan, 2013 1 commit
- Fix typo in squeue man page · 282b964c
  David Bigagli authored Jan 28, 2013
  
  282b964c
26 Jan, 2013 1 commit
- reset errno to 0 if able to coomunicate with a socket · eedbceda
  Danny Auble authored Jan 26, 2013
  
  eedbceda
23 Jan, 2013 2 commits

In select/cons_res, correct logic when job removed from only some nodes. · eb3c1046

jette authored Jan 23, 2013

I run into a problem with slurm-2.5.1 that IDLE nodes can not be
allocated to jobs. This can be reproduced as follows:

First, submit a job with --no-kill option (I have SLURM_EXCLUSIVE set to
allocate nodes exclusively by default). Then set one of the nodes
allocated to the job(cn2) to state DOWN:

srun: error: Node failure on cn2
srun: error: Node failure on cn2
srun: error: cn2: task 0: Killed
^Csrun: interrupt (one more within 1 sec to abort)
srun: task 1: running
srun: task 0: exited abnormally
^Csrun: sending Ctrl-C to job 22605.0
srun: Job step aborted: Waiting up to 2 seconds for job step to finish.
srun: Force Terminated job step 22605.0

Then change state of the node to IDLE again. But it can not be allocated
to jobs:

srun: job 22606 queued and waiting for resources

  JOBID PARTITION     NAME     USER  ST       TIME  NODES
NODELIST(REASON)
  22606      work hostname     root  PD       0:00      1 (Resources)
  22604      work   sbatch     root   R       3:06      1 cn1

NodeName=cn2 Arch=x86_64 CoresPerSocket=8
   CPUAlloc=16 CPUErr=0 CPUTot=16 CPULoad=0.05 Features=abc
   Gres=(null)
   NodeAddr=cn2 NodeHostName=cn2
   OS=Linux RealMemory=30000 Sockets=2 Boards=1
   State=IDLE ThreadsPerCore=1 TmpDisk=0 Weight=1
   BootTime=2012-12-24T15:22:34 SlurmdStartTime=2013-01-14T11:06:32
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0

I traced and located the problem in select/cons_res. The call sequence
is:

slurmctld/node_mgr.c: update_node() =>
slurmctld/job_mgr.c: kill_running_job_by_node_name() =>
excise_node_from_job() =>
plugins/select/cons_res/select_cons_res.c: select_p_job_resized() =>
_rm_job_from_one_node() => _build_row_bitmaps() =>
common/job_resources: remove_job_from_cores()

If there are other jobs running in the partition, the partition row
bitmap will not be set correctly. In the example above, before
_build_row_bitmaps(), output of _dump_part() is:

[2013-01-19T13:24:56+08:00] part:work rows:1 pri:1
[2013-01-19T13:24:56+08:00]   row0: num_jobs 2: bitmap: 16,32-63

after setting the node down, output of _dump_part() is

[2013-01-19T13:24:56+08:00] part:work rows:1 pri:1
[2013-01-19T13:24:56+08:00]   row0: num_jobs 2: bitmap: 16,32-47

Cores of cn2 are not marked as available. Instead, cores of other nodes
are released. When another job requires the node cn2, the following log
message appears:

[2013-01-19T13:25:03+08:00] debug3: cons_res: _vns: node cn2 busy

I do not understand the design of select/cons_res well and I do not know
how to fix this. But it seems that _build_row_bitmaps() should not be
called, since the job is not removed totally, but only one of the nodes
released.

eb3c1046

Correction to comment in spank.h · 8e0ee95a
Morris Jette authored Jan 22, 2013

8e0ee95a

22 Jan, 2013 4 commits
- Minor fix to doc · cf28bf32
  Danny Auble authored Jan 22, 2013
  
  cf28bf32
- In select/cons_res, correct logic to allocate whole sockets to jobs · 47a39777
  Magnus Jonsson authored Jan 21, 2013
  
  47a39777
- select/cons_res plugin: CPU allocation logic fix · a86d2a7a
  jette authored Jan 21, 2013
```
Correction to CPU allocation logic for cores without hyperthreading
Backport of
https://github.com/SchedMD/slurm/commit/1ef41ac9590e018e631eaefb31254622984b7d2d
```
  a86d2a7a
- Remove rosetta.pdf from doc/html/Makefiles · 4b6da14a
  jette authored Jan 21, 2013
  
  4b6da14a
19 Jan, 2013 1 commit
- Fix formatting problem in salloc,sbatch and srun man pages · 17e06ae1
  jette authored Jan 18, 2013
  
  17e06ae1
18 Jan, 2013 7 commits

Fix topology/tree logic when nodes defined in slurm.conf get re-ordered · 29df4c83

Morris Jette authored Jan 18, 2013

From Chris Holmes, HP:
After several days of brainstorming and debugging, I have identified
a bug in SLURM 2.5.0rc2, related to the 'tree' topology. It was so
early in the execution of the whole SLURM machinery that it took me
some time to figure it out (say, 100 or 200 jobs showing the issue,
with more or less debugging levels increased and extra
instrumentation, with sometimes an uncertain reliability)...

For every “switch” a bitmap of nodes (seen down by the switch) is
built as the topology is discovered through 'topology.conf'.

There is code in read_config.c, executed when the SLURM control
daemon starts, that reorders the nodes (according to their hostname
by default), while the switches table (ie the bitmaps) has already
being built. To reorder the nodes means that the bitmaps of the switches become wrong.

29df4c83

Remove rosetta stone pdf, manage file outside of GIT · 58cb666b
Morris Jette authored Jan 18, 2013

58cb666b
Add link to EMC tutorial · 4ac02b99
Morris Jette authored Jan 18, 2013

4ac02b99
Make more variables available to job_submit/lua plugin · 28740196
Morris Jette authored Jan 18, 2013
```
slurm.MEM_PER_CPU, slurm.NO_VAL, etc.
```
28740196
Update rosetta stone · 3cec511a
Morris Jette authored Jan 17, 2013

3cec511a
Add Rosetta Stone table of workload managers · a4417570
Morris Jette authored Jan 17, 2013

a4417570

Permit job with invalid QOS to run if QOS set by administrator · 7aef4f80

Phil Eckert authored Jan 17, 2013

About a year ago I submitted a modification that you incorporated
into SLURM 2.4, which was to allow an admin to modify a job to use
a QOS even though the user did not have access to the QOS.

However, I must have tested it without having the Accounting set
to enforce QOS's. So, if an admin modifies a job to a QOS they
don't have access to, it will be modified, but the job will result
in a state of InvalidQOS, which is reasonable, since this would
handle the case where a user has their QOS removed. A problem,
however, is that even though the scheduler won't schedule the job,
backfill still will.

One approach would be to fix backfill to be consistent with
the scheduler (which should probably occur regardless), but
my thought would be to modify the scheduler to allow the QOS
as long as it was set by an admin, since that was the intent
of the modification to begin with.

I believe it  would only take a single line to change, just
adding a check on the job_ptr->limit_set_qos, to make sure
it was set by an admin:

                if (job_ptr->qos_id) {
                        slurmdb_association_rec_t *assoc_ptr;
                        assoc_ptr = (slurmdb_association_rec_t *)job_ptr->assoc_ptr;
                        if (assoc_ptr &&
                            !bit_test(assoc_ptr->usage->valid_qos,
                                      job_ptr->qos_id) &&
                            !job_ptr->limit_set_qos) {
                                info("sched: JobId=%u has invalid QOS",
                                        job_ptr->job_id);
                                xfree(job_ptr->state_desc);
                                job_ptr->state_reason = FAIL_QOS;
                                continue;
                        } else if (job_ptr->state_reason == FAIL_QOS) {
                                xfree(job_ptr->state_desc);
                                job_ptr->state_reason = WAIT_NO_REASON;
                        }
                }

Phil

7aef4f80

17 Jan, 2013 3 commits
- Terminate sreport on EOF · 892b14aa
  David Bigagli authored Jan 17, 2013
  
  892b14aa
- Fix typo in comment · 56a821a7
  Morris Jette authored Jan 17, 2013
  
  56a821a7
- sacctmgr: terminate sacctmgr on EOF if readline function missing · 4277f365
  David Bigagli authored Jan 17, 2013
  
  4277f365
16 Jan, 2013 5 commits
- Clarify use of job_submit/lua plugin · e7a7d483
  Morris Jette authored Jan 16, 2013
  
  e7a7d483
- Terminate sacctmgr on stdin EOF · fe2f22c1
  David Bigagli authored Jan 16, 2013
  
  fe2f22c1
- Improve example of SallocDefaultCommand in slurm.conf man page · f624d8d0
  Morris Jette authored Jan 16, 2013
  
  f624d8d0
- Add links to LBL Node Health Check program in FAQ and Download web pages · dd7fd98a
  Morris Jette authored Jan 16, 2013
  
  dd7fd98a
- Fix for scheduling batch jobs in multiple partitions · 04fbf26a
  Morris Jette authored Jan 16, 2013
```
Without this change a high priority batch job may not start at submit
time. In addtion, a pending job with mutltiple partitions be cancelled
when the scheduler runs if any of it's partitions can not be used by
the job.
```
  04fbf26a