Commits · 5ad5d66d54ce441735d88d7f366758607ec2fff9 · Manuel G. Marciani / ces_slurm_simulator

18 Jun, 2015 3 commits
- Make debug a bit more useful · 5ad5d66d
  David Bigagli authored Jun 18, 2015
  
  5ad5d66d
- Merge branch 'slurm-14.11' of https://github.com/SchedMD/slurm into slurm-14.11 · 28b6ac9f
  Morris Jette authored Jun 17, 2015
  
  28b6ac9f
- Fix test for front end mode · 6b856313
  Morris Jette authored Jun 17, 2015
  
  6b856313
17 Jun, 2015 3 commits
- Minor doc fixes. · 609a3c7c
  Brian Christiansen authored Jun 17, 2015
  
  609a3c7c
- Lowercase Slurm in docs. · 806a03ce
  Brian Christiansen authored Jun 17, 2015
  
  806a03ce
- Fix typo on FAQ web page · e5c8e899
  Morris Jette authored Jun 16, 2015
  
  e5c8e899
15 Jun, 2015 2 commits

Fix native cray compile error. · 70e5f5de
Brian Christiansen authored Jun 15, 2015

70e5f5de

Prevent abort on update of license-only reservation · 50deadb4

Morris Jette authored Jun 15, 2015

Logic was assuming the reservation had a node bitmap which was
being used to check for overlapping jobs. If there is no node
bitmap (e.g. a licenses only reservation), an abort would result.

50deadb4

12 Jun, 2015 2 commits
- Set job's reason to BadConstaints when job can't run on any node. · 475988f5
  Brian Christiansen authored Jun 12, 2015
```
Bug 1739
```
  475988f5
- Deprecated TICKET_BASED fairshare. · c3a30337
  Brian Christiansen authored Jun 12, 2015
```
Bug 1743
```
  c3a30337
11 Jun, 2015 5 commits

One more addition to 9d20cf02 · 214c5372
Brian Christiansen authored Jun 11, 2015
```
Prevent double free.
```
214c5372
One more addition to 9d20cf02 · 9dcf1444
Brian Christiansen authored Jun 10, 2015

9dcf1444
Use correct slurmd spooldir when creating cpu-frequency locks. · 9d20cf02
Brian Christiansen authored Jun 10, 2015
```
Bug 1733
```
9d20cf02

Fix for node reboot/down state · 4e8545b6

Didier GAZEN authored Jun 10, 2015

In your node_mgr fix to keep rebooted nodes down (commit 9cd15dfe), you
forgot to consider the case of nodes that are powered up but are responding after
ResumeTimeout seconds (the maximum time permitted). Such nodes are
marked DOWN (because they didn't respond within ResumeTimeout seconds) than
should become silently available when ReturnToService=1 (as stated in the slurm.conf manual)

With your modification when such nodes are finally responding, they are seen as
rebooted nodes and remain in the DOWN state (with the new reason: Node
unexpectedly rebooted) even when ReturnToService=1 !

Correction of commit 3c2b46af

4e8545b6

Revert commit 3c2b46af · c85f7484
Didier GAZEN authored Jun 10, 2015

c85f7484

10 Jun, 2015 3 commits

Add NEWS for last commit · 30e50e6c
Morris Jette authored Jun 10, 2015

30e50e6c

Fix for node reboot/down state · 3c2b46af

Didier GAZEN authored Jun 10, 2015

My patch to obtain the correct behaviour:

3c2b46af

select/serial gres scheduling fix · f2a08ce7
Morris Jette authored Jun 09, 2015
```
Equivalent fix as e1a00772
for select/serial rather than select/cons_res
```
f2a08ce7

09 Jun, 2015 7 commits
- Search for user in all groups · 93ead71a
  David Bigagli authored Jun 09, 2015
  
  93ead71a
- Fix scheduling inconsistency with GRES · e1a00772
  Morris Jette authored Jun 09, 2015
```
1. I submit a first job that uses 1 GPU:
$ srun --gres gpu:1 --pty bash
$ echo $CUDA_VISIBLE_DEVICES
0

2. while the first one is still running, a 2-GPU job asking for 1 task per node
waits (and I don't really understand why):
$ srun --ntasks-per-node=1 --gres=gpu:2 --pty bash
srun: job 2390816 queued and waiting for resources

3. whereas a 2-GPU job requesting 1 core per socket (so just 1 socket) actually
gets GPUs allocated from two different sockets!
$ srun -n 1  --cores-per-socket=1 --gres=gpu:2 -p testk --pty bash
$ echo $CUDA_VISIBLE_DEVICES
1,2

With this change #2 works the same way as #3.
bug 1725
```
  e1a00772
- Move definitions into alphabetic order · 5f337d38
  Morris Jette authored Jun 09, 2015
  
  5f337d38
- Update broken links in webpages · 4a41e4d7
  Danny Auble authored Jun 09, 2015
  
  4a41e4d7
- Replace /usr/bin with a more managible approach · 321a48b3
  Danny Auble authored Jun 09, 2015
  
  321a48b3
- Corrections to slurm.conf formatting · b373847e
  Morris Jette authored Jun 09, 2015
  
  b373847e
- In test4.3 unset the SINFO_FORMAT since it conflicts with the --long · 3fad9df1
  David Bigagli authored Jun 08, 2015
```
option.
```
  3fad9df1
05 Jun, 2015 8 commits
- Correct eof/wait logic in a test · 752a33db
  Morris Jette authored Jun 05, 2015
  
  752a33db
- More wording changes in addition to commit 3d11b90f · 619c4372
  Danny Auble authored Jun 05, 2015
  
  619c4372
- update unsubscription methods · 3d11b90f
  Danny Auble authored Jun 05, 2015
  
  3d11b90f
- Revert "Fix issue where command line options were parsed twice in sbatch." · b37004e2
  Danny Auble authored Jun 02, 2015
```
Only going to do this in the master as it may affect scripts.

This reverts commit 454f78e6.

Conflicts:
	NEWS
```
  b37004e2
- Update gres.conf description of file regular expressions · 22b7f1ad
  Morris Jette authored Jun 05, 2015
```
bug 1724
```
  22b7f1ad
- Small typo in expect code for test28.[345]. · e053d6d4
  Nicolas Joly authored Jun 05, 2015
  
  e053d6d4
- Spelling fixes in testsuite. · 6f56f61b
  Nicolas Joly authored Jun 05, 2015
  
  6f56f61b
- Update some LLNL links to SchedMD · ed84f96c
  Morris Jette authored Jun 05, 2015
  
  ed84f96c
04 Jun, 2015 7 commits
- Partially modify the commit 971d0021 . · 707268a5
  David Bigagli authored Jun 04, 2015
  
  707268a5
- Remove old code. · 05976915
  David Bigagli authored Jun 04, 2015
  
  05976915
- Move around some code to be cleaner · d0f6c4ac
  Morris Jette authored Jun 04, 2015
  
  d0f6c4ac
- fix test if unsufficient resources · 05eadb57
  Veronique Legrand authored Jun 04, 2015
```
Previously the test would generate an error if the default partition
  contained less than 3 nodes
bug 1720
```
  05eadb57
- Fix parsing for NetBSD sleep error message · ee72ee8c
  Nicolas Joly authored Jun 04, 2015
  
  ee72ee8c
- Cut and Paste error with variables in If statement · 61ad32e8
  Nancy Kritkausky authored Jun 04, 2015
  
  61ad32e8
- Fix broken build on non Cray. · df6fce57
  David Bigagli authored Jun 03, 2015
  
  df6fce57