- 28 Mar, 2017 6 commits
-
-
Danny Auble authored
[min|max]_cpus if not already set when a job comes in.
-
Morris Jette authored
-
Morris Jette authored
Change slurm_load_jobs(), slurm_load_job_user() and slurm_load_job() to all use the same underlying function, which was formerly just used by slurm_load_jobs(). This function will get data from all of the clusters in a federation, but in a serial fashion for now.
-
Morris Jette authored
The clang error isn't entirely clear, but this should fix it.
-
Morris Jette authored
-
Danny Auble authored
the job may still point to the qos_id of the removed qos.
-
- 27 Mar, 2017 9 commits
-
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
the job_table. This would represent the removal of runaway jobs. I also made it work so the versions would be easy to see. I couldn't easily make this into multiple commits or I would had.
-
Morris Jette authored
Add squeue --local and --sibling options to modify filtering of jobs on federated clusters.
-
Morris Jette authored
Modify slurm_load_jobs() function to load job information from all clusters in a federation.
-
Danny Auble authored
-
Alejandro Sanchez authored
Fix double free issue from commit 3851f619 Same issue from fixed in a different spot with b19637bd. Bug 3617
-
Danny Auble authored
This reverts commit 8343365f.
-
Danny Auble authored
Fix double free issue from commit 3851f619 Same issue from fixed in a different spot with b19637bd. Bug 3617
-
- 24 Mar, 2017 17 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
Rename some scontrol show job fields to match sview (and keep them in alphabetic order in the sview output): ViableSiblings changed to FedViableSiblings and ActiveSiblings changed to FedActiveSiblings
-
Morris Jette authored
Added FedActiveSiblings, FedOrigin and FedViableSiblings to job info
-
Morris Jette authored
-
Morris Jette authored
-
Dominik Bartkiewicz authored
steps.
-
Morris Jette authored
The wrong function was being called to release memory. Bug introduced yesterday in commit c8bf6b5d
-
Morris Jette authored
The xmalloc was based upon the wrong data structure
-
Morris Jette authored
No changes to logic, just added some parenthesis and brackets
-
Brian Christiansen authored
Reroute federated scancel <jobid>'s to the origin cluster.
-
Brian Christiansen authored
The local cluster will cancel the job if federated job is running on the cluster, otherwise it will route the request, back through the client, to the origin cluster.
-
Brian Christiansen authored
Set the cluster lock even when the cluster is the only viable cluster. The cluster_lock is used to determine if the cluster is running the job or not.
-
Brian Christiansen authored
when seeing if the cluster is the only cluster in the viable list.
-
Brian Christiansen authored
Adding to be able to route cancel (and other future) requeusts to the origin cluster in a federation.
-
Brian Christiansen authored
Federation Reconciliation
-
Brian Christiansen authored
When a sibling establishes a connection a silbing that sibling will then reconcile jobs with the other sibling.
-
- 23 Mar, 2017 8 commits
-
-
Brian Christiansen authored
when the job is being purged due a the origin job being "cleaned" (e.g slurmctld -c).
-
Brian Christiansen authored
-
Brian Christiansen authored
instead of just all of the viables. I need to be able send the viable list to only a specific sibling. For example if reconciliation finds that a sibling doesn't have a job that the origin thinks it could.
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Brian Christiansen authored
The connection's memory wasn't getting free'd which caused issues when the connection needed to be restablished.
-
Brian Christiansen authored
-