Commits · c6828f2da9dc3a68506ea317bebf268beee1a334 · Manuel G. Marciani / ces_slurm_simulator

02 May, 2014 14 commits
- Merge remote-tracking branch 'origin/slurm-14.03' · c6828f2d
  Danny Auble authored May 02, 2014
```
Conflicts:
	META
	NEWS
```
  c6828f2d
- Update NEWS for next version · 87080f15
  Danny Auble authored May 02, 2014
  
  87080f15
- META for tag 14.03.2 · 36b69754
  Danny Auble authored May 02, 2014
  
  36b69754
- update test to handle change in commit · 42722cff
  Danny Auble authored May 02, 2014
```
cbcea672
```
  42722cff
- Addendum to commit c6833796 to handle · 3de7a0aa
  Danny Auble authored May 02, 2014
```
situations where the association tree has multiple grp limits, only honor
the first one.
```
  3de7a0aa
- Merge remote-tracking branch 'origin/slurm-14.03' · 2eaee689
  Danny Auble authored May 02, 2014
  
  2eaee689
- Merge remote-tracking branch 'origin/slurm-2.6' into slurm-14.03 · 1c1f8b80
  Danny Auble authored May 02, 2014
  
  1c1f8b80
- Handle node ranges better when dealing with accounting max node limits. · c6833796
  Danny Auble authored May 02, 2014
```
This is for bug 775
```
  c6833796
- BGQ - Temp fix issue where job could be left on job_list after it finished. · e4f1a099
  Danny Auble authored May 02, 2014
  
  e4f1a099
- Merge remote-tracking branch 'origin/slurm-2.6' into slurm-14.03 · 845e8ab9
  Danny Auble authored May 02, 2014
```
Conflicts:
	src/slurmd/slurmstepd/slurmstepd_job.c
```
  845e8ab9
- Fix issue where user is requesting --acctg-freq=0 and no memory limits. · 17e4e2ac
  Danny Auble authored May 02, 2014
  
  17e4e2ac
- Merge remote-tracking branch 'origin/slurm-14.03' · 4c7e8765
  Danny Auble authored May 01, 2014
  
  4c7e8765
- minor formatting · 4b37fb39
  Danny Auble authored May 01, 2014
  
  4b37fb39
- Merge remote-tracking branch 'origin/slurm-14.03' · ebd0cb50
  Danny Auble authored May 01, 2014
  
  ebd0cb50
01 May, 2014 8 commits
- Fix allowgroup on bad group seg fault with the controller. · 76846134
  Danny Auble authored May 01, 2014
```
regression from 2a674aee
```
  76846134
- Merge remote-tracking branch 'origin/slurm-14.03' · 7c61f9f7
  Danny Auble authored May 01, 2014
```
Conflicts:
	NEWS
```
  7c61f9f7
- Fix test to work correctly · 2eb3a544
  Danny Auble authored Apr 30, 2014
  
  2eb3a544
- Fix issue with GrpCPURunMins if a job's timelimit is altered while the job · 8553f674
  Danny Auble authored Apr 30, 2014
```
is running.
```
  8553f674
- Fix issue where user is requesting --acctg-freq=0 and no memory limits. · 903161c8
  Danny Auble authored Apr 30, 2014
  
  903161c8
- Temporary fix for handling our typemap for the perl api with newer perl. · bffdc7e2
  Danny Auble authored May 01, 2014
  
  bffdc7e2
- Fix issue with GrpCPURunMins if a job's timelimit is altered while the job · 98de72e4
  Danny Auble authored Apr 30, 2014
```
is running.
```
  98de72e4
- Fix issue where user is requesting --acctg-freq=0 and no memory limits. · 0018cdf4
  Danny Auble authored Apr 30, 2014
  
  0018cdf4
30 Apr, 2014 11 commits
- Merge branch 'slurm-14.03' · 541aa2b9
  Morris Jette authored Apr 30, 2014
  
  541aa2b9
- Mostly cosmetic changes to FAQ web page · 90a2ab79
  Morris Jette authored Apr 30, 2014
  
  90a2ab79
- Correct squeue to not merge jobs with state pending and completing · 8ddadea5
  David Bigagli authored Apr 30, 2014
```
together.
```
  8ddadea5
- Merge branch 'slurm-14.03' · d7d055cc
  Morris Jette authored Apr 30, 2014
  
  d7d055cc
- Merge branch 'slurm-2.6' into slurm-14.03 · dc66a71b
  Morris Jette authored Apr 30, 2014
  
  dc66a71b
- switch/nrt - CAU and RMDA tracking correction · 6f66fdef
  Morris Jette authored Apr 30, 2014
```
Switch/nrt - Properly track usage of CAU and RDMA resources with multiple
tasks per compute node. Previous logic would allocate resources once per
task and then deallocate once per node, leaking CMA and RDMA resources
and preventing their use by future jobs.
```
  6f66fdef
- Merge branch 'slurm-14.03' · 337e7cd1
  Morris Jette authored Apr 30, 2014
  
  337e7cd1
- ignore prio reset on held jobs · cbcea672
  Morris Jette authored Apr 30, 2014
```
If a job is held, then only release it with the "scontrol release <jobid>"
command rather than a simple reset of the job's priority. This is needed to
support job arrays better. Otherwise a priority reset of a job array
would free all requeued/held jobs from that job array rather than
leaving them held.
```
  cbcea672
- Update hyperlink to Munge · c93704c3
  Morris Jette authored Apr 30, 2014
  
  c93704c3
- Merge branch 'slurm-14.03' · c51acd9f
  Morris Jette authored Apr 29, 2014
  
  c51acd9f
- Always clear JOB_SPECIAL_EXIT on job prio set · 66f332ed
  Morris Jette authored Apr 29, 2014
```
If a job's priority is set non-zero then always clear the
JOB_SPECIAL_EXIT job state flag, not only when the prior
state is HELD_USER or HELD.
I'm not sure how the job could have cleared the HELD state
and changed to NO_REASON, but this would fix the problem.
bug 760
```
  66f332ed
29 Apr, 2014 7 commits

slurmd to cache launched job IDs · 653b247b

Morris Jette authored Apr 29, 2014

Modify slurmd to keep track of which jobs have already been launched.
It the launch is complete, then process suspend requests immediately.
Previously the suspend request was always delayed by 1 second, which
adversely impacts gang scheduling performance. If the job can't be
found (say after a slurmd restart), then delay the suspend by up
to 3 seconds, but only once.

653b247b

Affinity tests to support larger systems · fefb52c9

Morris Jette authored Apr 29, 2014

Change the integer to hex function to support 32-bit unsigned
integers and exit on systems with more than 32 cpus per node
since Expect can not work with numbers so large.

fefb52c9

Update slurm.conf man page. · 5218f5e6
David Bigagli authored Apr 29, 2014

5218f5e6

Affinity tests to support larger systems · d8a2e926

Morris Jette authored Apr 29, 2014

Change the integer to hex function to support 32-bit unsigned
integers and exit on systems with more than 32 cpus per node
since Expect can not work with numbers so large.

d8a2e926

Update slurm.conf man page. · 91e8fc8d
David Bigagli authored Apr 29, 2014

91e8fc8d
Have scontrol show config to display MemLimitEnforce. · 20875a4a
David Bigagli authored Apr 28, 2014

20875a4a
Introduce automatic job requeue policy based on exit value. · 60e18f34
David Bigagli authored Apr 28, 2014

60e18f34