Commits · 3f91f4b2c04526c410f94e94974192af55f2ba53 · Manuel G. Marciani / ces_slurm_simulator

29 Jun, 2015 1 commit
- Display error message when attempting to modify priority of a held job. · 3f91f4b2
  Nathan Yee authored Jun 29, 2015
```
Bug 1745
```
  3f91f4b2
25 Jun, 2015 3 commits
- Fix broken behaviour in sreport · c45774c1
  David Bigagli authored Jun 25, 2015
  
  c45774c1
- Minor change to 7b99dcd0 and change check_connection to always return · 89b4327b
  Danny Auble authored Jun 25, 2015
```
ESLURM_DB_CONNECTION when in error.
```
  89b4327b
- Clarify a NEWS item · 83ed9780
  Morris Jette authored Jun 24, 2015
  
  83ed9780
24 Jun, 2015 2 commits
- Add link to XDMod accounting tools · f16684db
  Morris Jette authored Jun 24, 2015
  
  f16684db
- Fix core dump. · 7b99dcd0
  David Bigagli authored Jun 24, 2015
  
  7b99dcd0
23 Jun, 2015 2 commits
- Set the totalview_stepid to the value of the job step instead of NO_VAL. · 5456f107
  David Bigagli authored Jun 23, 2015
  
  5456f107
- Add note about OpenMPI locked memory limit failure · fdb159e3
  Morris Jette authored Jun 23, 2015
  
  fdb159e3
22 Jun, 2015 9 commits
- Advanced reservation fixes · a6454176
  Morris Jette authored Jun 22, 2015
```
Updates of existing bluegene advanced reservations did not work at all.
Some multi-core configurations resulting in an abort due to creating
  core_bitmaps for the reservation that only had one bit per node rather
  than one bit per core.
These bugs were introduced in commit 5f258072
```
  a6454176
- cosmetic changes to reservation logic · f3a46c60
  Morris Jette authored Jun 22, 2015
  
  f3a46c60
- Disable some reservation tests if cores_per_node==1 · cb2f549a
  Morris Jette authored Jun 22, 2015
  
  cb2f549a
- Update NEWS · c8545598
  David Bigagli authored Jun 22, 2015
  
  c8545598
- Fix the calculation of job energy. · 5a579e4e
  Thomas Cadeau authored Jun 22, 2015
  
  5a579e4e
- Update NEWS · 38007f9b
  David Bigagli authored Jun 22, 2015
  
  38007f9b
- Fix sorting of job arrays. · 5350442f
  Moe Jette authored Jun 22, 2015
  
  5350442f
- core_spec/cray: avoid redundant cgroup binding · 8cead2b7
  Morris Jette authored Jun 22, 2015
  
  8cead2b7
- topology/tree: prevent infinite loop if not tree · a63b16a0
  Morris Jette authored Jun 22, 2015
  
  a63b16a0
19 Jun, 2015 2 commits
- Fix squeue to print according to the man page. · 2973524c
  David Bigagli authored Jun 19, 2015
  
  2973524c
- Prevent slurmctld from dumping core ifjob_resrcs is missing in the · 2d8d92aa
  David Bigagli authored Jun 19, 2015
```
job data structure.
```
  2d8d92aa
18 Jun, 2015 3 commits
- Make debug a bit more useful · 5ad5d66d
  David Bigagli authored Jun 18, 2015
  
  5ad5d66d
- Merge branch 'slurm-14.11' of https://github.com/SchedMD/slurm into slurm-14.11 · 28b6ac9f
  Morris Jette authored Jun 17, 2015
  
  28b6ac9f
- Fix test for front end mode · 6b856313
  Morris Jette authored Jun 17, 2015
  
  6b856313
17 Jun, 2015 3 commits
- Minor doc fixes. · 609a3c7c
  Brian Christiansen authored Jun 17, 2015
  
  609a3c7c
- Lowercase Slurm in docs. · 806a03ce
  Brian Christiansen authored Jun 17, 2015
  
  806a03ce
- Fix typo on FAQ web page · e5c8e899
  Morris Jette authored Jun 16, 2015
  
  e5c8e899
15 Jun, 2015 2 commits

Fix native cray compile error. · 70e5f5de
Brian Christiansen authored Jun 15, 2015

70e5f5de

Prevent abort on update of license-only reservation · 50deadb4

Morris Jette authored Jun 15, 2015

Logic was assuming the reservation had a node bitmap which was
being used to check for overlapping jobs. If there is no node
bitmap (e.g. a licenses only reservation), an abort would result.

50deadb4

12 Jun, 2015 2 commits
- Set job's reason to BadConstaints when job can't run on any node. · 475988f5
  Brian Christiansen authored Jun 12, 2015
```
Bug 1739
```
  475988f5
- Deprecated TICKET_BASED fairshare. · c3a30337
  Brian Christiansen authored Jun 12, 2015
```
Bug 1743
```
  c3a30337
11 Jun, 2015 5 commits

One more addition to 9d20cf02 · 214c5372
Brian Christiansen authored Jun 11, 2015
```
Prevent double free.
```
214c5372
One more addition to 9d20cf02 · 9dcf1444
Brian Christiansen authored Jun 10, 2015

9dcf1444
Use correct slurmd spooldir when creating cpu-frequency locks. · 9d20cf02
Brian Christiansen authored Jun 10, 2015
```
Bug 1733
```
9d20cf02

Fix for node reboot/down state · 4e8545b6

Didier GAZEN authored Jun 10, 2015

In your node_mgr fix to keep rebooted nodes down (commit 9cd15dfe), you
forgot to consider the case of nodes that are powered up but are responding after
ResumeTimeout seconds (the maximum time permitted). Such nodes are
marked DOWN (because they didn't respond within ResumeTimeout seconds) than
should become silently available when ReturnToService=1 (as stated in the slurm.conf manual)

With your modification when such nodes are finally responding, they are seen as
rebooted nodes and remain in the DOWN state (with the new reason: Node
unexpectedly rebooted) even when ReturnToService=1 !

Correction of commit 3c2b46af

4e8545b6

Revert commit 3c2b46af · c85f7484
Didier GAZEN authored Jun 10, 2015

c85f7484

10 Jun, 2015 3 commits

Add NEWS for last commit · 30e50e6c
Morris Jette authored Jun 10, 2015

30e50e6c

Fix for node reboot/down state · 3c2b46af

Didier GAZEN authored Jun 10, 2015

My patch to obtain the correct behaviour:

3c2b46af

select/serial gres scheduling fix · f2a08ce7
Morris Jette authored Jun 09, 2015
```
Equivalent fix as e1a00772
for select/serial rather than select/cons_res
```
f2a08ce7

09 Jun, 2015 3 commits

Search for user in all groups · 93ead71a
David Bigagli authored Jun 09, 2015

93ead71a

Fix scheduling inconsistency with GRES · e1a00772

Morris Jette authored Jun 09, 2015

1. I submit a first job that uses 1 GPU:
$ srun --gres gpu:1 --pty bash
$ echo $CUDA_VISIBLE_DEVICES
0

2. while the first one is still running, a 2-GPU job asking for 1 task per node
waits (and I don't really understand why):
$ srun --ntasks-per-node=1 --gres=gpu:2 --pty bash
srun: job 2390816 queued and waiting for resources

3. whereas a 2-GPU job requesting 1 core per socket (so just 1 socket) actually
gets GPUs allocated from two different sockets!
$ srun -n 1  --cores-per-socket=1 --gres=gpu:2 -p testk --pty bash
$ echo $CUDA_VISIBLE_DEVICES
1,2

With this change #2 works the same way as #3.
bug 1725

e1a00772

Move definitions into alphabetic order · 5f337d38
Morris Jette authored Jun 09, 2015

5f337d38