Commits · db36b0f0adabe232c0e68649ac8c97ca161effd4 · Manuel G. Marciani / ces_slurm_simulator

03 Dec, 2013 13 commits
- Make sure batch complete RPC has node name · db36b0f0
  Morris Jette authored Dec 03, 2013
```
This is a correction in the logic of commit
3f4b2d51 on launch failures.
```
  db36b0f0
- Make rpmbuild test more general · efdb7cc9
  jette authored Dec 02, 2013
```
Make it work for more system types
Do not delete files if the build fails
```
  efdb7cc9
- remove unneeded files. If someone could figure out how to make rsync work · ad00cb13
  Danny Auble authored Dec 02, 2013
```
here instead of cp please change this :).
```
  ad00cb13
- Remove unneeded files · aa5d9b15
  Danny Auble authored Dec 02, 2013
  
  aa5d9b15
- Make it so we don't have to change the source .spec file · a2ffa7d2
  Danny Auble authored Dec 02, 2013
  
  a2ffa7d2
- CRAY - More mods to the spec file · d7e40b4d
  David Gloe authored Dec 02, 2013
  
  d7e40b4d
- Fix typo · fe8be3ba
  David Gloe authored Dec 02, 2013
  
  fe8be3ba
- Merge branch 'slurm-2.6' · 7a327d96
  Morris Jette authored Dec 02, 2013
```
Conflicts:
	src/slurmctld/job_scheduler.c
```
  7a327d96
- Correct job dependency string · 08265c03
  Morris Jette authored Dec 02, 2013
```
Correct logic returning remaining job dependencies in job information
reported by scontrol and squeue. Eliminates vestigial descriptors with
no job ID values (e.g. "afterany"). As depdencies are removed, the
job ID values were removed from the strings, but not the descriptors.
This eliminates both. It also checks the full job ID to make sure we do
not remove "afterany:1234" when job "123" completes.
```
  08265c03
- Fix missing file in configure.ac · 047c9d35
  Danny Auble authored Dec 02, 2013
  
  047c9d35
- BLUEGENE - now that select_p_job_fini has a lock to block_state_mutex · 01d237ed
  Danny Auble authored Dec 02, 2013
```
handle any job_fail calls after the fact since it will result in deadlock
otherwise.
```
  01d237ed
- BGQ - pass job failure state to bg_status_process_kill_job_list · 662efa17
  Danny Auble authored Dec 02, 2013
  
  662efa17
- BGQ - remove unneeded reference · d285ee34
  Danny Auble authored Dec 02, 2013
  
  d285ee34
02 Dec, 2013 22 commits
- fix bad merge · aeb9f7ae
  Morris Jette authored Dec 02, 2013
  
  aeb9f7ae
- Merge branch 'slurm-2.6' · 77367138
  Morris Jette authored Dec 02, 2013
```
Conflicts:
	NEWS
	doc/man/man5/cgroup.conf.5
```
  77367138
- Check node name on batch job completion · 3f4b2d51
  Morris Jette authored Dec 02, 2013
```
Add a check to make sure that the job completion RPC from a
slurmstepd match that node that the batch job is running on.
This would not be the case of for a job started on a node
if that node's slurmd fails, but the slurmstepd keeps running.
The job could then be requeued and generate a completion RPC
from both slurmstepd daemons (one per node). This logic will
ignore the job complete RPC from the node NOT currently
running the batch job.
```
  3f4b2d51
- Remove trailing spaces from docs · 18237e97
  Morris Jette authored Dec 02, 2013
  
  18237e97
- Fixed sh5util loop when there are no node-step files. · cbb02378
  David Bigagli authored Dec 02, 2013
  
  cbb02378
- fix race condition in batch exit code · 6d1d932b
  Morris Jette authored Dec 02, 2013
```
Fix race condition on batch job termination that could result in a job exit
code of 0xfffffffe if the slurmd on node zero registers its active jobs at
the same time that slurmstepd is recording the job's exit code.
but 535
```
  6d1d932b
- Handle if numa isn't install on the system · 4bdbebcf
  Danny Auble authored Dec 02, 2013
  
  4bdbebcf
- Fix minor typo · bad2056b
  Danny Auble authored Dec 02, 2013
  
  bad2056b
- Remove trailing spaces from docs · 1526643c
  Morris Jette authored Dec 02, 2013
  
  1526643c
- Fixed sh5util loop when there are no node-step files. · fff922e9
  David Bigagli authored Dec 02, 2013
  
  fff922e9
- Fixed sh5util loop when there are no node-step files. · abefd4ee
  David Bigagli authored Dec 02, 2013
  
  abefd4ee
- run autogen.sh · 230213f7
  Danny Auble authored Dec 02, 2013
  
  230213f7
- Fix issues with uninitialized variables · 704801f3
  Danny Auble authored Dec 02, 2013
  
  704801f3
- Fix some compile issues on a non-cray · 5a75d567
  Danny Auble authored Dec 02, 2013
  
  5a75d567
- Remove unneeded #ifdef · 9119175d
  Danny Auble authored Dec 02, 2013
  
  9119175d
- Fix minor formatting and couple of long lines. · 8ee2477e
  Danny Auble authored Dec 02, 2013
  
  8ee2477e
- CRAY - initial patch from cray to support the switch and task · d0cc8dba
  Jason Sollom authored Dec 02, 2013
  
  d0cc8dba
- CRAY - added missing header · cb3e1839
  Danny Auble authored Dec 02, 2013
  
  cb3e1839
- Fix issue if freq is set to 1 · 8014fef1
  Danny Auble authored Dec 02, 2013
  
  8014fef1
- Add new condition to test · fce8b570
  Danny Auble authored Dec 02, 2013
  
  fce8b570
- Attempt to move all UID/GID getting away from the slurmd/slurmstepd. · bedb9226
  Danny Auble authored Nov 27, 2013
```
This is needed primarily for native Cray systems to avoid hitting their
LDAP or NIS server from every node in their system.  This can also provide
a more scalable model for other systems as well.
```
  bedb9226
- Upgrade test to match pbsnodes output enhancements · 178ecfc3
  Morris Jette authored Dec 02, 2013
  
  178ecfc3
29 Nov, 2013 5 commits

Merge branch 'slurm-2.6' · 529f4324
Morris Jette authored Nov 29, 2013

529f4324
Improve test failure clean up · b1699285
Morris Jette authored Nov 29, 2013

b1699285
Merge branch 'slurm-2.6' · 92e4149f
Morris Jette authored Nov 29, 2013
```
Conflicts:
	src/plugins/proctrack/cgroup/proctrack_cgroup.c
```
92e4149f

Morris Jette authored Nov 29, 2013

There was already cgroup locking in the version 14.03 code base
using different variable names and slighly different logic from
that in commit 3f6d9e36.
This commit is a variant of that commit in order to make the logic
in version 2.6 match that of our next release (logic which is
already pretty well tested).
bug 447

13aa9184

proctrack/cgroup - Add lock preventing race condition · 3f6d9e36

Morris Jette authored Nov 29, 2013

proctrack/cgroup - Add locking to prevent race condition where one job step
is ending for a user or job at the same time another job stepsis starting
and the user or job container is deleted from under the starting job step.
bug 447

3f6d9e36