- 01 Apr, 2017 1 commit
-
-
Morris Jette authored
This should have been part of commit f84456f3
-
- 31 Mar, 2017 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
- 30 Mar, 2017 2 commits
-
-
Morris Jette authored
This fixes test 7.17 to work with commit a152196f
-
Morris Jette authored
Move the working_cluster_rec reference out of the following functions: slurm_send_recv_controller_msg(), slurm_send_recv_controller_rc_msg(), slurm_open_controller_conn_spec() and slurm_send_only_controller_msg() so that communcations to multiple clusters can be performed in parallel (using threads)
-
- 29 Mar, 2017 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
If a cluster in our federation is down, it's "control_host" value is "" and an attempt to communicate with it will abort.
-
Morris Jette authored
-
- 28 Mar, 2017 8 commits
-
-
Morris Jette authored
-
Morris Jette authored
Previous content has news about version 16.05 release. Updated for version 17.02 release.
-
Danny Auble authored
[min|max]_cpus if not already set when a job comes in.
-
Morris Jette authored
-
Morris Jette authored
Change slurm_load_jobs(), slurm_load_job_user() and slurm_load_job() to all use the same underlying function, which was formerly just used by slurm_load_jobs(). This function will get data from all of the clusters in a federation, but in a serial fashion for now.
-
Morris Jette authored
The clang error isn't entirely clear, but this should fix it.
-
Morris Jette authored
-
Danny Auble authored
the job may still point to the qos_id of the removed qos.
-
- 27 Mar, 2017 9 commits
-
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
the job_table. This would represent the removal of runaway jobs. I also made it work so the versions would be easy to see. I couldn't easily make this into multiple commits or I would had.
-
Morris Jette authored
Add squeue --local and --sibling options to modify filtering of jobs on federated clusters.
-
Morris Jette authored
Modify slurm_load_jobs() function to load job information from all clusters in a federation.
-
Danny Auble authored
-
Alejandro Sanchez authored
Fix double free issue from commit 3851f619 Same issue from fixed in a different spot with b19637bd. Bug 3617
-
Danny Auble authored
This reverts commit 8343365f.
-
Danny Auble authored
Fix double free issue from commit 3851f619 Same issue from fixed in a different spot with b19637bd. Bug 3617
-
- 24 Mar, 2017 14 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
Rename some scontrol show job fields to match sview (and keep them in alphabetic order in the sview output): ViableSiblings changed to FedViableSiblings and ActiveSiblings changed to FedActiveSiblings
-
Morris Jette authored
Added FedActiveSiblings, FedOrigin and FedViableSiblings to job info
-
Morris Jette authored
-
Morris Jette authored
-
Dominik Bartkiewicz authored
steps.
-
Morris Jette authored
The wrong function was being called to release memory. Bug introduced yesterday in commit c8bf6b5d
-
Morris Jette authored
The xmalloc was based upon the wrong data structure
-
Morris Jette authored
No changes to logic, just added some parenthesis and brackets
-
Brian Christiansen authored
Reroute federated scancel <jobid>'s to the origin cluster.
-
Brian Christiansen authored
The local cluster will cancel the job if federated job is running on the cluster, otherwise it will route the request, back through the client, to the origin cluster.
-
Brian Christiansen authored
Set the cluster lock even when the cluster is the only viable cluster. The cluster_lock is used to determine if the cluster is running the job or not.
-
Brian Christiansen authored
when seeing if the cluster is the only cluster in the viable list.
-