Commits · 05e0dabee474f90b14168adfdfedd6b47a3902ed · Manuel G. Marciani / ces_slurm_simulator

21 Oct, 2015 4 commits
- sbatch --ntasks precedence fix · 05e0dabe
  Morris Jette authored Oct 21, 2015
```
sbatch --ntasks option to take precedence over --ntasks-per-node plus node
    count, as documented. Set SLURM_NTASKS/SLURM_NPROCS environment variables
    accordingly.
bug 2015
```
  05e0dabe
- Fix the pty window manager in slurmstepd. · 9350c830
  David Bigagli authored Oct 21, 2015
  
  9350c830
- Since PreemptMode=Gang isn't valid make it PreemptMode=Suspend,Gang · 6f21c4c6
  Danny Auble authored Oct 20, 2015
  
  6f21c4c6
- Remove defunct PreemptMode=kill · 587bed51
  Danny Auble authored Oct 20, 2015
  
  587bed51
20 Oct, 2015 13 commits
- Correct Slurm's website · 988d169c
  Danny Auble authored Oct 20, 2015
  
  988d169c
- Remove defunct SchedulerType=sched/gang documentation · 76c1ccce
  Danny Auble authored Oct 20, 2015
  
  76c1ccce
- Permit configuration of ResumeRate=0 & SuspendRate=0 · a5f0663c
  Deric Sullivan authored Oct 20, 2015
```
Previous logic would report an error and stop the power_save thread.
bug 2042
```
  a5f0663c
- Restore advanced reservation web page link · f7e36b53
  Morris Jette authored Oct 20, 2015
```
Not sure when that went away...
```
  f7e36b53
- Clarify PreemptMode=suspend operation in documents · a348ae18
  Morris Jette authored Oct 20, 2015
  
  a348ae18
- Manual job resume bookkeeping fix · d2d92060
  Morris Jette authored Oct 20, 2015
```
If a suspended job is manually resumed and gang scheduling is
configured, but no time slices are available for the job being
resumed, then just resume it without adding it to a time slice.
The jobs previously running on those nodes will be replaced with
new jobs as resources become available and the resumed job will
basically be treated like a stray job.
bug 2031
```
  d2d92060
- Don't report CPU oversubscription · c82abd9c
  Morris Jette authored Oct 20, 2015
```
Avoid reporting more allocated CPUs than exist on a node. This can be
    triggered by resuming a previosly suspended job, resulting in
    oversubscription of CPUs.
bug 2021
```
  c82abd9c
- Warn of risks in job suspend/resume · 2929a289
  Morris Jette authored Oct 20, 2015
```
bug 2031
```
  2929a289
- Fix salloc -I to accept an argument · d133f16a
  Danny Auble authored Oct 19, 2015
  
  d133f16a
- Add test for scancel --full option · f8b38b9d
  Morris Jette authored Oct 19, 2015
  
  f8b38b9d
- Add scancel -f/--full option · 26944ca0
  Morris Jette authored Oct 19, 2015
```
Add scancel -f/--full option to signal all steps including batch script and
    all of its child processes.
bug 2031
```
  26944ca0
- Improve logging for signals · e4e1ca38
  Morris Jette authored Oct 19, 2015
  
  e4e1ca38
- Clarify scancel man page · 198008c3
  Morris Jette authored Oct 19, 2015
  
  198008c3
19 Oct, 2015 10 commits
- Set SLURM_HINT environment variable when --hint is used with sbatch or salloc. · 62d1e1aa
  Brian Christiansen authored Oct 19, 2015
```
Bug 1888
```
  62d1e1aa
- Fix issue on a scontrol reconfig all available GRES/TRES would be zeroed · 63e59dcd
  Danny Auble authored Oct 19, 2015
```
out.

Remove unneeded code that commit 8274ea54 fixed.

This code would 0 out all GRES/TRES on a reconfig which isn't what we want.

8274ea54 does the right thing by itself.
```
  63e59dcd
- Correct backfill logic for job with INFINITE time limit · 1886ac8b
  Hongjia Cao authored Oct 19, 2015
```
bug 2032
```
  1886ac8b
- Update to spank.h comment, wrong function name · c49ea55d
  Deric Sullivan authored Oct 19, 2015
```
bug 2039
```
  c49ea55d
- Corrections to spank man page · cbdd56b6
  Deric Sullivan authored Oct 19, 2015
```
bug 2037
```
  cbdd56b6
- Add new contributor · 60d88f2e
  Morris Jette authored Oct 19, 2015
  
  60d88f2e
- Fix spank man page · d681d696
  Deric Sullivan authored Oct 19, 2015
```
backport of commit 4f2e2801 from v16.05
```
  d681d696
- Fix burst_buffer/cray for interactive allocs >4GB · 3c066cbc
  Morris Jette authored Oct 19, 2015
```
Needed to change a couple of variables from 32- to 64-bit.
```
  3c066cbc
- Add new burst_buffer.conf parameters · 25fcc9db
  Morris Jette authored Oct 19, 2015
```
Add new burst_buffer.conf parameters: ValidateTimeout and OtherTimeout.
See man page for details.
```
  25fcc9db
- Rename a test file for better clarity · 897450ab
  Morris Jette authored Oct 08, 2015
  
  897450ab
16 Oct, 2015 2 commits
- Update NEWS. · 55c7cd17
  David Bigagli authored Oct 16, 2015
  
  55c7cd17
- Add hv_to_qos_cond to Perl interface · 67c23eb6
  Josko Plazonic authored Oct 16, 2015
  
  67c23eb6
15 Oct, 2015 1 commit
- MYSQL - Fix minor issue after an index was added to the database it would · 90e2e552
  Danny Auble authored Oct 14, 2015
```
previously take 2 restarts of the slurmdbd to make it stick correctly.
```
  90e2e552
14 Oct, 2015 1 commit

Fix task/cgroup affinity to work correctly with multi-socket · 31f91bd9

Danny Auble authored Oct 14, 2015

single-threaded cores.  A regression caused only 1 socket to be used on
this kind of node instead of all that were available.

31f91bd9

09 Oct, 2015 2 commits
- Add link to ib2slurm in topology web page · f217a861
  Morris Jette authored Oct 09, 2015
  
  f217a861
- Avoid srun segv on job allocate failure · 4b0e3c75
  Morris Jette authored Oct 09, 2015
```
If a job allocation returns some invalid contents, the pointer
  to the job structure may be NULL. This change preserves the error
  message and avoids a segv.
```
  4b0e3c75
08 Oct, 2015 2 commits

Fix case where if the backup slurmdbd has existing connections when it gives... · 44bb06bc

Brian Christiansen authored Oct 07, 2015

Fix case where if the backup slurmdbd has existing connections when it gives up control that the it would be killed.

If the backup had existing connections when giving up control, it would try to
signal the existing threads by using pthread_kill to send SIGKILL to the
threads. The problem is that SIGKILL doesn't go the thread but the main process
and the backup dbd would be killed.

44bb06bc

Fixed slurmctld not sending cold-start messages correctly to the database · 4ed2f8c6
Danny Auble authored Oct 07, 2015
```
when a cold-start (-c) happens to the slurmctld.
```
4ed2f8c6

07 Oct, 2015 5 commits
- Merge remote-tracking branch 'origin/slurm-14.11' into slurm-15.08 · 2dcc2732
  Danny Auble authored Oct 07, 2015
```
Conflicts:
	src/sacct/options.c
```
  2dcc2732
- Fix sacct -j, (nothing but a comma) to not return all jobs. · d5979ef6
  Danny Auble authored Oct 07, 2015
  
  d5979ef6
- sacctmgr - Don't allow default account associations to be removed · 9f602cba
  Danny Auble authored Oct 07, 2015
```
from a user.

This would cause the slurmctld to cache the old default which wasn't valid
and cause the user to have to request the association always.
```
  9f602cba
- Merge remote-tracking branch 'origin/slurm-14.11' into slurm-15.08 · f5d6b175
  Danny Auble authored Oct 07, 2015
```
Conflicts:
	NEWS
	src/plugins/accounting_storage/mysql/as_mysql_job.c
```
  f5d6b175
- Document sbatch cpu/mem binding env vars · 8e949f72
  Morris Jette authored Oct 07, 2015
```
bug 2009
```
  8e949f72