Commits · 4d4b78d6882e54c94049902ff78a7990547c7fcb · Manuel G. Marciani / ces_slurm_simulator

12 Sep, 2016 9 commits
- Prepare 17.02 RPCs for memory size change to uint64_t. · 4d4b78d6
  Tim Wickberg authored Jun 27, 2016
```
Copy previous format, but do not change yet. Update RPCs involving:
def_mem_per_cpu
max_mem_per_cpu
free_mem
real_memory
actual_real_mem
job_mem_lim
```
  4d4b78d6
- Merge branch 'slurm-16.05' · 840134ea
  Morris Jette authored Sep 12, 2016
  
  840134ea
- Merge branch 'slurm-16.05' of https://github.com/SchedMD/slurm into slurm-16.05 · e145d604
  Morris Jette authored Sep 12, 2016
  
  e145d604
- Updates to cpu and mem binding for man page · 1feb808e
  Morris Jette authored Sep 12, 2016
  
  1feb808e
- Update --mem_bind option description · e2f63dc3
  Morris Jette authored Sep 12, 2016
```
bug 3065
```
  e2f63dc3
- SLUG Agenda - shuffle around abstracts to match new order. · 9f12f6f4
  Tim Wickberg authored Sep 12, 2016
  
  9f12f6f4
- Swap Slurm Overview and Slurm 16.05 Overview in agenda. · d16c62ed
  Tim Wickberg authored Sep 12, 2016
  
  d16c62ed
- Update SLUG16 agenda. · 02edfef8
  Tim Wickberg authored Sep 12, 2016
```
Add Slurm overview by Alex on first day, move KNL to second morning,
shift roadmap to right after lunch.
```
  02edfef8
- Expand description of NUMA memory binding and KNL MCDRAM · bd64287e
  Morris Jette authored Sep 12, 2016
```
bug 3065
```
  bd64287e
09 Sep, 2016 10 commits
- Merge branch 'slurm-16.05' · 28725c66
  Morris Jette authored Sep 09, 2016
  
  28725c66
- Cap job termination message staggering at 5 seconds · 3e2251cb
  Morris Jette authored Sep 09, 2016
```
Previous cap was 2 sec (default TCP timeout) times the node count
  and divided by 1000. A 9000 node job would have the messages
  spread out over 18 seconds. This change caps the spread at
  5 seconds and assumes the normal TCP logic can handle the rest
bug 3044
```
  3e2251cb
- Merge branch 'slurm-16.05' · fb4e305b
  Morris Jette authored Sep 09, 2016
  
  fb4e305b
- Don't generate srun hostlist for log for huge task count · 175e9dea
  Morris Jette authored Sep 09, 2016
```
If the overhead of determining the hostlist for a given task list
is too high, then report a hostlist of "Unknown" instead. If the
overhead is too high, then srun will become unresponsive and communications
will timeout/fail.
bug 3044
```
  175e9dea
- Cosmetic change, no change in logic · 0d8d1b34
  Morris Jette authored Sep 09, 2016
  
  0d8d1b34
- Modify srun task completion handling · 166e3bec
  Morris Jette authored Sep 09, 2016
```
Modify srun task completion handling to only build the task/node string for
    logging purposes if it is needed. Modified for performance purposes.
bug 3044
```
  166e3bec
- Add function to report current logging level · 469c63fc
  Morris Jette authored Sep 09, 2016
```
Add get_log_level() function to return the highest LOG_LEVEL_* used
for any logging mechanism.
```
  469c63fc
- Revert "Fix issue filtering licenses for output with squeue." · 9c4eabed
  Tim Wickberg authored Sep 09, 2016
```
This reverts commit 1ec2a4ae.
```
  9c4eabed
- Merge branch 'slurm-16.05' · 3e1976dd
  Morris Jette authored Sep 09, 2016
```
Conflicts:
	src/api/step_launch.c
```
  3e1976dd
- Fix issue filtering licenses for output with squeue. · 1ec2a4ae
  Alejandro Sanchez authored Sep 09, 2016
```
Bug 3063.
```
  1ec2a4ae
08 Sep, 2016 10 commits
- Display configured and allocated tres on nodes · 91ea68ad
  Brian Christiansen authored Sep 08, 2016
```
In scontrol show nodes.
```
  91ea68ad
- Restructure srun task_exit logic · 6b6d4e1a
  Morris Jette authored Sep 08, 2016
```
Restructure srun command locking for task_exit processing logic for improved
  parallelism. This change decreases the amount of time consumed by serial
  logic by 2 orders of magnitude.
bug 3044
```
  6b6d4e1a
- Merge branch 'federation' · bd84687b
  Brian Christiansen authored Sep 08, 2016
  
  bd84687b
- Initialize fed_mgr at controller start · dded800a
  Brian Christiansen authored Sep 08, 2016
```
Grab federations from db at startup instead of waiting for db_update and
load from state if the db is down.
```
  dded800a
- Free fed_cond's cluster_list · 61d58586
  Brian Christiansen authored Sep 08, 2016
  
  61d58586
- Refactor fed_mgr to store pointer of federation · 5764e925
  Brian Christiansen authored Sep 08, 2016
```
Instead of making a separate copy. All of the cluster_recs are now in
the federation_rec with a pointer to the local cluster rec.
```
  5764e925
- Refactor removing qos logic to do only one query · c13e2243
  Brian Christiansen authored Sep 08, 2016
  
  c13e2243
- Set federation table row to defaults on removal · ec80b1f8
  Brian Christiansen authored Sep 08, 2016
  
  ec80b1f8
- Add cluster_list to federation_cond · 4bfdaaeb
  Brian Christiansen authored Sep 08, 2016
```
Select federations base off which clusters belong to them.
```
  4bfdaaeb
- Correct comment · e3d6dc68
  Morris Jette authored Sep 08, 2016
  
  e3d6dc68
07 Sep, 2016 11 commits

Merge branch 'slurm-16.05' · 78274bf4
Morris Jette authored Sep 07, 2016

78274bf4

Preserve node "RESERVATION" state · 5eee1d28

Morris Jette authored Sep 07, 2016

Preserve node "RESERVATION" state when one of multiple overlapping
    reservations ends. Previous logic would clear the node's
    RESERVATION state flag when any one of the reservations on the
    node ended rather than keeping the node in RESERVATION state
    until the last reservation ended.
bug 3057

5eee1d28

Decrease frequency of reservation purge logic · 062e1ca6
Morris Jette authored Sep 07, 2016
```
The logic is now heavier weight, so increase interval between tests
  from 2 to 5 seconds
```
062e1ca6
Fix memory leak · c44af482
Brian Christiansen authored Sep 07, 2016

c44af482
Merge branch 'federation' · ae354f1c
Brian Christiansen authored Sep 07, 2016

ae354f1c
Make local sibling list with pointers · 1d00e90b
Brian Christiansen authored Sep 07, 2016
```
Instead of making copies just use the pointers and stay in the read
locks.
```
1d00e90b
Rename fed_elem_t to slurmdb_cluster_fed_t · d10d3784
Brian Christiansen authored Sep 07, 2016

d10d3784

Update test to verify flags stay after fed mod · 42bec70d

Brian Christiansen authored Sep 02, 2016

Before fixing fed.flags to be initialized to FEDERATION_NOT_SET, the
federation's flags were being set to 0 when a federation modification
happened.  These tests verify that the flags stay after a federation
modification.

42bec70d

Do commit check after adding federations · 25c7c04a

Brian Christiansen authored Sep 02, 2016

Was modeled after cluster adding where commit the check was being done
before sending the changes to the dbd. But since the dbd isn't making
any additional tables for federations -- like it does for clusters -- it
can send to the dbd first and make sure they worked and then ask to
commit the changes.

25c7c04a

Handle modify federation with just clusters · 1ae87b91

Brian Christiansen authored Sep 02, 2016

Ran into the issue after initializing fed flags to NOTSET. Fed flags
were always being set to 0 on modification.

1ae87b91

Initialize federation flags to NOTSET · 38a11ed7
Brian Christiansen authored Sep 02, 2016

38a11ed7