- 27 Mar, 2017 1 commit
-
-
Morris Jette authored
Modify slurm_load_jobs() function to load job information from all clusters in a federation.
-
- 22 Mar, 2017 1 commit
-
-
Morris Jette authored
bug 3610
-
- 21 Mar, 2017 8 commits
-
-
Danny Auble authored
Bug 3097
-
Danny Auble authored
-
Tim Shaw authored
-
Michael Meier authored
-
Michael Meier authored
-
Thomas Opfer authored
Fix correct macro this time, Tim Wickberg's fault for prior failed patch. Bug 3605.
-
Thomas Opfer authored
This reverts commit 53523c54.
-
Thomas Opfer authored
Bug 3605.
-
- 20 Mar, 2017 1 commit
-
-
Morris Jette authored
-
- 17 Mar, 2017 4 commits
-
-
Brian Christiansen authored
-
Danny Auble authored
This reverts commit 979db782.
-
Danny Auble authored
Partial backport from 17.11 commit 4874f988. Bug 2484
-
Brian Christiansen authored
-
- 16 Mar, 2017 6 commits
-
-
Danny Auble authored
-
Danny Auble authored
Association. This reverts commits 92d2c645 and 37be42ec. This caused incorrect behavior, original code was correct. This also corrects documentation additions in commit 4cfe6bde. This code caused the first clause never to be correct and if you were over the limit you would get the third clause reporting a huge number available where it should be a negative number. The reality is the first clause should had been triggered and handled correctly.
-
Danny Auble authored
This reverts commit af52111c.
-
Josh Samuelson authored
Association. This reverts commits 92d2c645 and 37be42ec. This caused incorrect behavior, original code was correct. This also corrects documentation additions in commit 4cfe6bde. This code caused the first clause never to be correct and if you were over the limit you would get the third clause reporting a huge number available where it should be a negative number. The reality is the first clause should had been triggered and handled correctly.
-
Danny Auble authored
What this does is removes the join on the step table during rollups. It turns out this was a very costly join if the tables were fairly large. We can now just grab the data directly from the job_table which already contains the data in the tres_alloc column.
-
Dominik Bartkiewicz authored
system. What this patch does is looks at the partition the job is running in to get the cpus per node off the first node in the partition. Before it would just look at the first node on the system which isn't that great on a heterogeneous system. Bug 3486
-
- 15 Mar, 2017 1 commit
-
-
Jacek Budzowski authored
Bug 3577.
-
- 14 Mar, 2017 1 commit
-
-
Danny Auble authored
other half go to libslurmdb.so. Turns out in 17.11 if you are using --with-shared-libslurm sacct will link to both libslurmfull.so and libslurmdb.so. In the case of linking to the accounting_storage/slurmdbd plugin it will callback to slurmdbd_defs_init which would be in libslurmdb.so but the call to acct_storage_p_get_connection would call slurm_open_slurmdbd_conn which is in libslurmfull.so and it would xassert on slurmdbd_defs_inited which was set in libslurmdb.so but not in libslurmfull.so. So the moral of the story is don't export half a file in one lib and the other half in a different lib. Perhaps we should only have 1 lib all together, but that isn't the way it is done today. This fixes the issue and has the entire file exported to libslurmfull.so so we should be good. But just a note for the future. This was an unexpected regression caused by commit 5a5347c7.
-
- 13 Mar, 2017 2 commits
-
-
Alejandro Sanchez authored
Bug 3530 NOTE: there does appear to be other issues here, but we didn't fell comfortable changing this many things in 17.02 for fear of breaking something. This only fixes a bit of the issue as it appears node_scheduler.c has a fuller test for these. In 17.11 I plan to make this a function that will fill in the min, req, max nodes and use it in both places in the code to prevent this from happening again.
-
Alejandro Sanchez authored
Code calls list_find_first to search in resv_list whether the requested name for the new reservation already exists. If it exists, resv_ptr is set with the pointer to the existing reservation. Then the code goto bad_parse label and xfreed that resv_ptr, thus corrupting the list data by freeing the existing reservation. This is fixed by only freeing memory on the new local resv_ptr instead of always freeing memory. xfree is also not sufficient for freeing the memory, we needed to call _del_resv_rec() or we would leak the memory we had transferred from the resv_desc_ptr. This also involved NULLing out the other variables freed after bad_parse, or you would get double frees. Bug 3558.
-
- 11 Mar, 2017 1 commit
-
-
Danny Auble authored
assoc_mgr_make_tres_str_from_array.
-
- 10 Mar, 2017 5 commits
-
-
Danny Auble authored
-
Tim Shaw authored
array task id. This is combination with the previous commit. Bug 3563
-
Danny Auble authored
-
Danny Auble authored
I also made seff work on a non-standard perl install. I would had loved to do this in multiple commits, but it wasn't that possible.
-
Dominik Bartkiewicz authored
-
- 08 Mar, 2017 9 commits
-
-
Danny Auble authored
moment the dbd is assigning a db_index. Bug 3512 This is a definite culprit of creating runaway jobs.
-
Danny Auble authored
because the dbd is setting a db_index.
-
Danny Auble authored
in the middle of setting db_indexes.
-
Danny Auble authored
multifactor. Almost impossible to make happen, but still a bug.
-
Tim Wickberg authored
The stepd->protocol_version field is not set at this point, if not handled here then the following call to stepd_get_uid will fail as a protocol_version of zero is rejected. This appeared to work in 16.05 and older as the stepd_get_uid() API call used to ignore the invalid protocol_version and return the correct result anyways, but that was fixed in c177ff95. Bug .
-
Tim Shaw authored
correctly with Slurm tools. What this does is replace the incoming string with the gres actually chosen. Continuation of last commit. Bug 3521
-
Danny Auble authored
the database.
-
Morris Jette authored
Remove duplicate lines
-
Thomas Opfer authored
Symbol required for read_slurm_cgroup_conf() call to work. Bug 3550.
-