- 23 May, 2017 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
Just in case something goes wrong with the test, this should clear the job relatively soon
-
Morris Jette authored
The RESPONSE_JOB_PACK_ALLOCATION RPC returns a list of RESPONSE_RESOURCE_ALLOCATION data types. Error handling still needs to be added to slurmctld/agent.c, blocking for resources does not work yet, and the salloc code does not handle the List response type yet.
-
Morris Jette authored
Expect seems to be sometimes truncating the output from "salloc --help" resulting in a test failure. This is a bug in Expect not Slurm. Modified the test to not parse all the way to the end of the expected output so the test always passes.
-
- 22 May, 2017 6 commits
-
-
Morris Jette authored
No change in logic
-
Morris Jette authored
No change in logic yet, but added a function to determine if there are pending pack job components and moved the data structure build logic to a separate function so it can be called for each pack job component
-
Morris Jette authored
Use syntax "scancel #+#". For example "scancel 1+2" cancels component number 2 from job 1
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
slurmctld modified to return all components of a pack job if the pack job leader job ID is specified Modify squeue to properly handle job ID filtering for pack jobs
-
- 20 May, 2017 1 commit
-
-
Morris Jette authored
-
- 19 May, 2017 3 commits
-
-
Morris Jette authored
pack_job_id_set shows all job IDs associated with a pack job. Logic added to include in show job RPC and display in scontrol. Logic added to save/restore the field. Logic still needed to validate on slurmctld restart and rebuild pack_job_list list of job pointers.
-
Morris Jette authored
-
Morris Jette authored
-
- 18 May, 2017 1 commit
-
-
Morris Jette authored
Also sort job ID's considering pack job ID and offset
-
- 17 May, 2017 2 commits
-
-
Morris Jette authored
Add pack_job_id and pack_job_offset fields to job record in slurmctld plus job_info structure Add logic to save/restore pack_job_id and pack_job_offset fields on slurmctld restart Add logging of pack_job_id and pack_job_offset fields in slurmctld's jobid2fmt() and jobid2str() functions Add pack_job_id and pack_job_offset fields to "scontrol show job" output
-
Morris Jette authored
cancel vestigial pack jobs when an error happens in some portion of the job request
-
- 16 May, 2017 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
Add a new RPC, REQUEST_JOB_PACK_ALLOCATION, that will send a List of job request descriptors in a single message. Each job descriptor will be logged and an job record created for it.
-
Morris Jette authored
Refactor the salloc option parsing logic to build a separate set of job descriptor data structures for each portion of the heterogeneous job. Only the last job record is current submitted to slurmctld.
-
- 15 May, 2017 1 commit
-
-
Morris Jette authored
-
- 13 May, 2017 3 commits
-
-
Morris Jette authored
-
Isaac Hartung authored
Bug 3695
-
Morris Jette authored
bug 3779
-
- 12 May, 2017 4 commits
-
-
Morris Jette authored
If capmc reports a node name, but not mcdram_cfg for the node, then log the missing data rather than assume the value is zero and report a value mismatch with cnselect.
-
Alejandro Sanchez authored
When requesting an operation on jobs, where the operation permits to specify more than one job in the same request, and a job array appears before a regular job (no-array job) in the list of jobs to operate with, the job_array_resp_msg_t pointer was not properly NULL'ed and thus incorrectly accessed when processing the no-array job. This fix prevents the crash from happening in the following scontrol operations: uhold, hold, suspend, requeue, requeuehold, update, release when the same request has <array_jobid>,<non-array_jobid> in this order in the job list to process. Bug 3759
-
Morris Jette authored
Job expansion example in FAQ enhanced to demonstrate operation in heterogeneous environments. bug 2979
-
Alejandro Sanchez authored
Do not attempt to schedule jobs after changing the power cap if there are already many active threads.
-
- 11 May, 2017 2 commits
-
-
Danny Auble authored
# Conflicts: # META # NEWS
-
Danny Auble authored
-
- 10 May, 2017 3 commits
-
-
Danny Auble authored
-
Dominik Bartkiewicz authored
Bug 3760
-
Danny Auble authored
didn't work at all. Bug 3712.
-
- 09 May, 2017 7 commits
-
-
Danny Auble authored
This reverts commit ecfd007f.
-
Dominik Bartkiewicz authored
-
Brian Christiansen authored
Continuation of 9a1370e3 CID 168995
-
Danny Auble authored
It was noticed that while doing any update to a job the admin comment would be blown away. This patch fixes that.
-
Dominik Bartkiewicz authored
Bug 3789
-
Danny Auble authored
run multiple tasks on multiple nodes. Changing the max nodes setting from 3 to 6 fixes the issue without apparent compromise to the test.
-
Danny Auble authored
destroying a mutex.
-