Commits · 9f97c2e90a086abfae4055a01550320e078799c4 · Manuel G. Marciani / ces_slurm_simulator

05 Feb, 2014 3 commits
- take back 2.6.6 · 9f97c2e9
  Danny Auble authored Feb 05, 2014
  
  9f97c2e9
- Added support for selecting AMD GP · f728ee8e
  Dominik Bartkiewicz authored Feb 05, 2014
```
Set GPU_DEVICE_ORDINAL environment variable.
```
  f728ee8e
- new news file · c59fa258
  Danny Auble authored Feb 04, 2014
  
  c59fa258
04 Feb, 2014 2 commits
- Fix to reserving all nodes in partition · c4c462a2
  Morris Jette authored Feb 04, 2014
```
Previous logic would try to pick a specific node count and on a
heterogeneous system, this would cause a problem. This change
largely reverts commit a270417b
```
  c4c462a2
- Retry task exit message from slurmstepd to srun on message timeout. · 2ccda7f2
  Danny Auble authored Feb 04, 2014
  
  2ccda7f2
03 Feb, 2014 1 commit
- Update documentation about QOS limits · f9cfa21a
  Danny Auble authored Feb 03, 2014
  
  f9cfa21a
31 Jan, 2014 3 commits

Removed obsolete slurm_terminate_job() API. · 31d409b7
David Bigagli authored Jan 31, 2014

31d409b7

Make sure node limits get assessed if no node count was given in request. · 5b0f9c39

Danny Auble authored Jan 31, 2014

i.e. salloc -n32 doesn't request the number of nodes and with the previous
code if this request used 4 nodes and only 1 was left in GrpNodes it
would just run with no issue since we were checking things before we
selected how many nodes it ran on.

Now we check this afterwards so we always check the limits on how many
nodes, cpus and how much memory is to be used.

5b0f9c39

Fix step allocation failure due to memory use · 8b76b93c

Morris Jette authored Jan 31, 2014

Fix step allocation when some CPUs are not available due to memory limits.
This happens when one step is active and using memory that blocks the
scheduling of another step on a portion of the CPUs needed. The new step
is now delayed rather than aborting with "Requested node configuration is
not available".
bug 577

8b76b93c

28 Jan, 2014 1 commit
- BLUEGENE - If IONodesPerMP changes in bluegene.conf recalculate bitmaps · ee3844aa
  Danny Auble authored Jan 28, 2014
```
based on ionode count correctly on slurmctld restart.
```
  ee3844aa
23 Jan, 2014 2 commits
- MYSQL - If starting the plugin and the database isn't up attempt to · 9ef64da7
  Danny Auble authored Jan 23, 2014
```
connect in a loop instead of producing a fatal.
```
  9ef64da7
- Fix purging of old reservation errors in database. · 8961739e
  Danny Auble authored Jan 23, 2014
  
  8961739e
21 Jan, 2014 2 commits
- Don't allow PMI_TIME to be zero which will cause floating exception. · 9f19dc67
  David Bigagli authored Jan 21, 2014
  
  9f19dc67
- Revert "Increase the PW_BUF_SIZE so the getgrnam_r() can process large user" · 570c36bb
  David Bigagli authored Jan 21, 2014
```
This reverts commit 2fa28eb6.

Conflicts:

	NEWS
```
  570c36bb
18 Jan, 2014 1 commit

Fix the acct_gather_filesystem_lustre.c to compute the Lustre accounting · ab375c5a

David Bigagli authored Jan 17, 2014

data correctly accumulating differences between sampling intervals.
Fix the data structure mismatch between acct_gather_filesystem_lustre.c
and slurm_jobacct_gather.h which caused the hdf5 plugin to log incorrect
data.

ab375c5a

16 Jan, 2014 2 commits
- Correct the documentation to read filesystem instead of Lustre. Update · a08d9ae3
  David Bigagli authored Jan 16, 2014
```
the srun help.
```
  a08d9ae3
- Fix slurmstepd lock when job terminates inside the infiniband · 6ad81fcd
  David Bigagli authored Jan 16, 2014
```
network traffic accounting plugin.
```
  6ad81fcd
15 Jan, 2014 1 commit
- sview - Fix regression where the Node tab wasn't able to · 977ba133
  Danny Auble authored Jan 15, 2014
```
add/remove columns.

caused by commit 68f0f5db
```
  977ba133
13 Jan, 2014 2 commits
- Do not reset priority of job if explicitly set · 470fcc51
  Morris Jette authored Jan 13, 2014
```
Do not reset a job's priority when the slurmctld restarts if previously
set to some specific value.
bug 561
```
  470fcc51
- Increase the PW_BUF_SIZE so the getgrnam_r() can process large user · 2fa28eb6
  John Morrissey authored Jan 12, 2014
```
groups.
```
  2fa28eb6
08 Jan, 2014 3 commits
- Update sshare.1 man page making it consistent with sacctmgr.1. · 047f1e0c
  David Bigagli authored Jan 08, 2014
  
  047f1e0c
- Revert "Update the sshare.1 man page making it consistent with sacctmgr.1." · 578271e8
  David Bigagli authored Jan 08, 2014
```
This reverts commit 3464295e.
```
  578271e8
- Update the sshare.1 man page making it consistent with sacctmgr.1. · 3464295e
  David Bigagli authored Jan 08, 2014
  
  3464295e
07 Jan, 2014 2 commits
- Remove vestigial note. · dcb67076
  Danny Auble authored Jan 07, 2014
  
  dcb67076
- If FastSchedule=0 just log low mem or disk · 648735c5
  Morris Jette authored Jan 07, 2014
```
Do not mark the node DOWN if its memory or tmp disk space is lower
than configured, just log it using debug message type
```
  648735c5
06 Jan, 2014 2 commits

Reset job priority on manual resume · 65d9196c

Morris Jette authored Jan 06, 2014

If a job is explicitly suspended, its priority is set to zero.
This resets the priority when requeued and also documents that
if the job is requeued (e.g. due to a node failure), then it
is placed in a held state.

65d9196c

Correct job RunTime if requeued from suspend state · bc3d8828

Morris Jette authored Jan 06, 2014

Without this patch, the job's RunTime includes its RunTime from
before it's prior suspend (i.e. the job's full RunTime rather than
just the RunTime of the requeued job).

bc3d8828

27 Dec, 2013 1 commit

Fix sched/backfill bug that could starve jobs · 2bae8bd6

Filip Skalski authored Dec 27, 2013

Hello,

I think I found another bug in the code (I'm using 2.6.3 but I checked the 2.6.5 and 14.03 versions and it's the same there).

In file sched/backfill/backfill.c:

1)
_add_reservation function, from lines 1172:

if (placed == true) {
        j = node_space[j].next;
        if (j && (end_reserve < node_space[j].end_time)) {
                /* insert end entry record */
                i = *node_space_recs;
                node_space[i].begin_time = end_reserve;
                node_space[i].end_time = node_space[j].end_time;
                node_space[j].end_time = end_reserve;
                node_space[i].avail_bitmap =
                        bit_copy(node_space[j].avail_bitmap);
                node_space[i].next = node_space[j].next;
                node_space[j].next = i;
                (*node_space_recs)++;
        }
        break;
}
I draw a picture with `node_space` state after 2 iterations (see attachment).

In case where the new reservation is fully inside another reservation,
then everything is OK.
But if the new reservation spans multiple existing reservations then the `end entry record` is not created.
This is because only the newly created `start entry record` is checked.

Easy fix would be to change the if into a loop, for example:

if (placed == true) {
    while((j = node_space[j].next) > 0) {
        if (end_reserve < node_space[j].end_time) {
           //same as above
           break;
        }
    }
    break;
}

2)
You could also change line 612:
        node_space = xmalloc(sizeof(node_space_map_t) *
                             (max_backfill_job_cnt + 3));
To `(max_backfill_job_cnt * 2 + 1)` , since each reservation can add at most two entries (check at line 982 should never execute). At the moment, in a worst case scenario this only checks half of the max_backfill_job_cnt.

NOTE: However this is all based on the assumption, that it is not done on purpose to speed up the calculations and trading some of the accuracy (especially point 2).

Best regards,
Filip Skalski

2bae8bd6

23 Dec, 2013 2 commits
- Update news to start 2.6.6 · b281bb01
  Morris Jette authored Dec 23, 2013
  
  b281bb01
- Update AllocNodes paragraph in slurm.conf.5. · ad58005e
  David Bigagli authored Dec 23, 2013
  
  ad58005e
20 Dec, 2013 2 commits
- BGQ - Add midplane to the total_cnodes used in the runjob_mux plugin · 626d44e0
  Danny Auble authored Dec 20, 2013
```
for better debug
```
  626d44e0
- BGQ - Fix issue if user runs multiple sub-block jobs inside a multiple · c2af01cb
  Danny Auble authored Dec 20, 2013
```
midplane block that starts on a higher coordinate than it ends (i.e if a
block has midplanes [0010,0013] 0013 is the start even though it is
listed second in the hostlist).
```
  c2af01cb
19 Dec, 2013 1 commit

scontrol show job - Correct NumNodes value · b31e2176

Morris Jette authored Dec 19, 2013

It has been changed to improve the calculated value for pending
jobs and use the actual node count value for jobs that have been
started (including suspended, completed, etc.)
bug 549

b31e2176

18 Dec, 2013 1 commit
- BGQ - Fix logic to better handle when midplanes come back on line after · 6946b82e
  Danny Auble authored Dec 18, 2013
```
being in error.
```
  6946b82e
17 Dec, 2013 2 commits
- Validate return code from calls to slurm_get_peer_addr · f9ea996b
  Danny Auble authored Dec 16, 2013
  
  f9ea996b
- If a connection is reset while trying to talk to it slurm_get_peer_addr · d3aec398
  Danny Auble authored Dec 16, 2013
```
will return ENOTCONN and not initialize the addr_str causing valgrind
errors.
```
  d3aec398
16 Dec, 2013 1 commit
- scontrol to operate on mulitple jobs · ed3398f0
  Hughes, Doug authored Dec 16, 2013
```
This allows multiple job ids to hold, uhold, resume, suspend, release, etc.
```
  ed3398f0
14 Dec, 2013 1 commit
- Fix minor memory leak · b4015f90
  Danny Auble authored Dec 13, 2013
  
  b4015f90
13 Dec, 2013 2 commits

Fix erroneous error messages when running gang scheduling. · 206dc223
Danny Auble authored Dec 13, 2013

206dc223

Fix slurmstepd race condition causing abort · be703c47

Morris Jette authored Dec 13, 2013

Fix slurmstepd race condition when separate threads are reading and
modifying the job's environment, which can result in the slurmstepd failing
with an invalid memory reference. Observed at shutdown when trying
to run the task epilog and trying to read the env var:
SLURM_STEP_KILLED_MSG_NODE_ID

be703c47