- 12 Sep, 2016 9 commits
-
-
Tim Wickberg authored
Copy previous format, but do not change yet. Update RPCs involving: def_mem_per_cpu max_mem_per_cpu free_mem real_memory actual_real_mem job_mem_lim
-
Morris Jette authored
-
-
Morris Jette authored
-
Morris Jette authored
bug 3065
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
Add Slurm overview by Alex on first day, move KNL to second morning, shift roadmap to right after lunch.
-
Morris Jette authored
bug 3065
-
- 09 Sep, 2016 10 commits
-
-
Morris Jette authored
-
Morris Jette authored
Previous cap was 2 sec (default TCP timeout) times the node count and divided by 1000. A 9000 node job would have the messages spread out over 18 seconds. This change caps the spread at 5 seconds and assumes the normal TCP logic can handle the rest bug 3044
-
Morris Jette authored
-
Morris Jette authored
If the overhead of determining the hostlist for a given task list is too high, then report a hostlist of "Unknown" instead. If the overhead is too high, then srun will become unresponsive and communications will timeout/fail. bug 3044
-
Morris Jette authored
-
Morris Jette authored
Modify srun task completion handling to only build the task/node string for logging purposes if it is needed. Modified for performance purposes. bug 3044
-
Morris Jette authored
Add get_log_level() function to return the highest LOG_LEVEL_* used for any logging mechanism.
-
Tim Wickberg authored
This reverts commit 1ec2a4ae.
-
Morris Jette authored
Conflicts: src/api/step_launch.c
-
Alejandro Sanchez authored
Bug 3063.
-
- 08 Sep, 2016 10 commits
-
-
Brian Christiansen authored
In scontrol show nodes.
-
Morris Jette authored
Restructure srun command locking for task_exit processing logic for improved parallelism. This change decreases the amount of time consumed by serial logic by 2 orders of magnitude. bug 3044
-
Brian Christiansen authored
-
Brian Christiansen authored
Grab federations from db at startup instead of waiting for db_update and load from state if the db is down.
-
Brian Christiansen authored
-
Brian Christiansen authored
Instead of making a separate copy. All of the cluster_recs are now in the federation_rec with a pointer to the local cluster rec.
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Brian Christiansen authored
Select federations base off which clusters belong to them.
-
Morris Jette authored
-
- 07 Sep, 2016 11 commits
-
-
Morris Jette authored
-
Morris Jette authored
Preserve node "RESERVATION" state when one of multiple overlapping reservations ends. Previous logic would clear the node's RESERVATION state flag when any one of the reservations on the node ended rather than keeping the node in RESERVATION state until the last reservation ended. bug 3057
-
Morris Jette authored
The logic is now heavier weight, so increase interval between tests from 2 to 5 seconds
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Brian Christiansen authored
Instead of making copies just use the pointers and stay in the read locks.
-
Brian Christiansen authored
-
Brian Christiansen authored
Before fixing fed.flags to be initialized to FEDERATION_NOT_SET, the federation's flags were being set to 0 when a federation modification happened. These tests verify that the flags stay after a federation modification.
-
Brian Christiansen authored
Was modeled after cluster adding where commit the check was being done before sending the changes to the dbd. But since the dbd isn't making any additional tables for federations -- like it does for clusters -- it can send to the dbd first and make sure they worked and then ask to commit the changes.
-
Brian Christiansen authored
Ran into the issue after initializing fed flags to NOTSET. Fed flags were always being set to 0 on modification.
-
Brian Christiansen authored
-