- 04 Oct, 2019 2 commits
-
-
Tim Wickberg authored
-
Christopher Samuel authored
Bug 7666.
-
- 03 Oct, 2019 2 commits
-
-
Tim Wickberg authored
-
Dominik Bartkiewicz authored
Bug 7680.
-
- 02 Oct, 2019 3 commits
-
-
Marcin Stolarek authored
pthread_create is not a async-signal-safe function, since it can't be used in a fork (according to POSIX). Calling it may end-up with deadlock in glibc. Bug 7666.
-
Dominik Bartkiewicz authored
Bug 7779
-
Dominik Bartkiewicz authored
Bug 7779
-
- 01 Oct, 2019 5 commits
-
-
Michael Hinton authored
Add some slurm.conf validation to make sure that typed GRES and untyped GRES don't mix (e.g. do not allow gres=gpu:1,gpu:tesla:1). Bug 7517
-
Michael Hinton authored
If a configured GPU matches a GPU on the system, match them together. If a configured GPU has mismatched Cores or Links with the system GPU, then omit that configured GPU from the final list. Bug 7517
-
Michael Hinton authored
Before, data from slurm.conf was only partially merged if there was no corresponding entry in gres.conf. Now, Slurm tries to match up gres.conf records to what is defined in slurm.conf. Bug 7517
-
Christopher Samuel authored
Bug 7169
-
Felip Moll authored
Increase the maximum array len large to be packed/unpacked with one order of magnitude, since the current value demonstrated it is not enough when an MPI program spawns a considerable amount of tasks over a big set of nodes. This limit was introduced in 627928f4. Bug 7495
-
- 30 Sep, 2019 4 commits
-
-
Danny Auble authored
There was never any security to allow for this, so we are just removing it. Bug 7765
-
Albert Gil authored
Admin/Operator users were not able to skip MaxQueryTimeRange when trying to show/fix runaway jobs. This commit uses _validate_operator() instead of _validate_slurm_user() in _get_jobs_cond() as well as check for operators in _fix_runaway_jobs(). Bug 7765
-
Dominik Bartkiewicz authored
Bug 7708
-
Dominik Bartkiewicz authored
Don't remove jobs from preemptee_candidates List. Bug 7708
-
- 26 Sep, 2019 3 commits
-
-
Georg Rath authored
Since this happens inside a the user process, this can inadvertently cause the user's job to die by running out of file descriptors. Bug 7814. Co-authored-by: William Arndt <warndt@lbl.gov>
-
Marshall Garey authored
Bug 7499
-
Dominik Bartkiewicz authored
Regression introduced in fb26b706. Bug 7675
-
- 25 Sep, 2019 1 commit
-
-
Albert Gil authored
Now the signaling of the batch step and the handeling of the flags is totally handled in _kill_all_active_steps() in slurmd, and _handle_signal_container() in stepd to ensure that: - if KILL_JOB_BATCH then only batch container is signaled - if KILL_FULL_JOB then batch script and its children are also signaled - if both of the above then only the batch script and its children are signaled We do not relay anymore on proctrack_g_signal() to handle the batch step signaling anymore, therefore it works the same for all proctrack plugins. This commit also includes minor related fixes in other code handling such signaling flags, and documentation improvement. Bug 7282
-
- 23 Sep, 2019 1 commit
-
-
Tim Wickberg authored
-
- 20 Sep, 2019 2 commits
-
-
Brian Christiansen authored
Signed-off-by: Tim Wickberg <tim@schedmd.com> Bug 7697
-
Michael Hinton authored
1cd43fce Bug 7630
-
- 16 Sep, 2019 1 commit
-
-
Robert Tweedy authored
Bug 7727 This was missed in commit 6ac4ce84.
-
- 12 Sep, 2019 3 commits
-
-
Marcin Stolarek authored
An incorrect logic with the variables holding available cores in the gres_plugin_job_core_filter3() function lead to a potential infinite "while (avail_cores_tot > req_cores)" loop, leaving slurmctld unresponsive. Bug 7685.
-
Brian Christiansen authored
Bug 7719 Signed-off-by: Danny Auble <da@schedmd.com>
-
Dominik Bartkiewicz authored
Regression caused by 72736af2. Bug 7708.
-
- 10 Sep, 2019 1 commit
-
-
Danny Auble authored
FastSchedule will be removed in 20.02. FastSchedule=2 functionality has been moved to SlurmdParameters=config_overrides. Bug 7496. Signed-off-by: Tim Wickberg <tim@schedmd.com>
-
- 06 Sep, 2019 2 commits
-
-
Brian Christiansen authored
Bug 7699
-
Danny Auble authored
Continuation of 64876087 Bug 7698
-
- 04 Sep, 2019 3 commits
-
-
Danny Auble authored
Bug 4781
-
Dominik Bartkiewicz authored
Otherwise, there could be time frames where printed schednodes information could be obsolete. Bug 7676.
-
Dominik Bartkiewicz authored
exclusively to that job. Bug 7510
-
- 03 Sep, 2019 4 commits
-
-
Dominik Bartkiewicz authored
use correct start_time for TIME_FLOAT resevation in _job_overlap() Bug 7458
-
Dominik Bartkiewicz authored
Bug 7458
-
Dominik Bartkiewicz authored
Bug 7458
-
Dominik Bartkiewicz authored
Move _validate_node_choice() before prolog/epilog check Bug 7458
-
- 29 Aug, 2019 3 commits
-
-
Michael Hinton authored
Free the gres_devices list to avoid a valgrind warning on exit. Bug 7644.
-
Brian Christiansen authored
Continuation of 30bbc11d Bug 7445 Signed-off-by: Dominik Bartkiewicz <bart@schedmd.com>
-
Brian Christiansen authored
When --batch=<feature> is used, the batch_host isn't chosen until the job is being launched -- because the features could be different on boot (e.g. KNL nodes). Thus if the job is allocated nodes that need to be booted, it needs to wait till they are all booted so it can make a decision at launch time. Bug 7445 Signed-off-by: Dominik Bartkiewicz <bart@schedmd.com>
-