- 17 Jul, 2017 5 commits
-
-
Morris Jette authored
Avoid interleaving labels and output from various components of a pack job
-
Morris Jette authored
This allocates an array to the proper size (based upon all tasks to be launched). Work still needed to populate the data structure properly for all tasks.
-
Morris Jette authored
No change in logic
-
Morris Jette authored
The debugger symbol in srun is not being properly handled today. This change does the malloc once, even for a pack-job, makes the array over-sized, and range checks before writes. Suitable for making srun progress without memory errors (writing off end of allocated memory in an array).
-
Morris Jette authored
-
- 15 Jul, 2017 1 commit
-
-
Morris Jette authored
-
- 14 Jul, 2017 1 commit
-
-
Morris Jette authored
Major re-write of task state container logic to support for list of containers rather than one container per srun command.
-
- 13 Jul, 2017 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
No changes to logic
-
Isaac Hartung authored
-
Isaac Hartung authored
-
- 08 Jul, 2017 2 commits
-
-
Morris Jette authored
No changes in logic
-
Morris Jette authored
Test was failing sometimes due to varying Expect buffering
-
- 07 Jul, 2017 4 commits
-
-
Morris Jette authored
Now have one launch step per pack job component, so no duplicate launch errors
-
Morris Jette authored
-
Morris Jette authored
Use it for setting output label when the output/error file name includes the "%N" format option (node name, e.g. "--output=out.%N").
-
Morris Jette authored
-
- 06 Jul, 2017 1 commit
-
-
Morris Jette authored
Correct some logic so that pack_gorup and pack_grp_bits fields are properly set.
-
- 05 Jul, 2017 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
$ srun --pack-group=1,2 -l tmp 0: JOBID:43740 HOST:nid00002 0: JOBID:43741 HOST:nid00003 1: JOBID:43741 HOST:nid00003 1: JOBID:43740 HOST:nid00002 2: JOBID:43740 HOST:nid00002 2: JOBID:43741 HOST:nid00003 3: JOBID:43741 HOST:nid00003 srun: error: Task 0 reported exit for a second time. srun: error: Task 1 reported exit for a second time. srun: error: Task 2 reported exit for a second time.
-
Morris Jette authored
srun now parsing for the "-pack-group" option first, then processes the environment variables for that pack group, then processes the remaining options for the pack group. srun will still only spawn tasks for one pack group at a time, but will at least handled forwarded options from the job allocation phase (i.e. sbatch or salloc).
-
- 03 Jul, 2017 1 commit
-
-
Morris Jette authored
-
- 29 Jun, 2017 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
Change (almost) all references to use pointer to structure for pack-job support with multiple components
-
- 28 Jun, 2017 2 commits
-
-
Morris Jette authored
Previous logic could fail on step allocation in larger job allocation
-
Morris Jette authored
-
- 27 Jun, 2017 3 commits
-
-
Morris Jette authored
Underlying logic not yet available, just the new option parsing and documentation.
-
Morris Jette authored
-
Isaac Hartung authored
-
- 26 Jun, 2017 6 commits
-
-
Morris Jette authored
-
Isaac Hartung authored
-
Isaac Hartung authored
Make packjobid, packjoboffset, and packjobidset fields available in squeue output.
-
Isaac Hartung authored
Test salloc/sbatch job options passed and resulting environment variables.
-
Morris Jette authored
-
Dominik Bartkiewicz authored
Improve backfill scheduling algorithm with respect to starting jobs as soon as possible while avoiding advanced reservations. bug 3757
-
- 24 Jun, 2017 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- 23 Jun, 2017 1 commit
-
-
Morris Jette authored
bug 3886
-