- 18 Oct, 2018 1 commit
-
-
Tim Wickberg authored
-
- 17 Oct, 2018 1 commit
-
-
Dominik Bartkiewicz authored
a jobs --mem-per-cpu and also -c is greater than 1. Bug 5858
-
- 09 Oct, 2018 1 commit
-
-
Broderick Gardner authored
Also simplify this xstrfmtcat() into an xstrcat() call while here. Bug 5828.
-
- 05 Oct, 2018 1 commit
-
-
Tim Wickberg authored
-
- 04 Oct, 2018 1 commit
-
-
Dominik Bartkiewicz authored
Bug 5594.
-
- 02 Oct, 2018 2 commits
-
-
Marshall Garey authored
Bug 5708
-
Jason Booth authored
Continuation of 06582da8 (17.11.9) Poll was timing out too quickly due to an incorrect conversion of MessageTimeout. Added a multiplier so timeout reflects the correct millisecond value. Bug 5553
-
- 27 Sep, 2018 4 commits
-
-
Michael Hinton authored
Bug 5231 Change slurmctld/controller.c's purge_thread_lock from static to global extern, so the mutex can be used in slurmctld/job_mgr.c slurm_cond_signal() needs to always be wrapped in the same mutex as slurm_cond_[timed]wait(), or else there is a possibility that slurm_cond_signal() will trigger before slurm_cond_[timed]wait() is even listening, most likely causing a deadlock.
-
Michael Hinton authored
initialize old fairshare association pointer correctly. Bug 5744
-
Broderick Gardner authored
Bug 5458
-
Felip Moll authored
Bug 5748
-
- 26 Sep, 2018 1 commit
-
-
Dominik Bartkiewicz authored
Bug 5775
-
- 24 Sep, 2018 1 commit
-
-
Felip Moll authored
Bug 5613
-
- 21 Sep, 2018 1 commit
-
-
Didier GAZEN authored
bug 5669
-
- 13 Sep, 2018 1 commit
-
-
Dominik Bartkiewicz authored
The backup slurmctld can end up without an agent thread if read_slurm_conf() takes too long to complete. Forcing the agent_init() here again ensures it is restarted if not currently running. This was fixed upstream in c1a537db. Bug 5633.
-
- 10 Sep, 2018 2 commits
-
-
Danny Auble authored
is removed from the system. Bug 5665
-
Danny Auble authored
burst buffer. Bug 5665
-
- 06 Sep, 2018 1 commit
-
-
Marshall Garey authored
Bug 5359
-
- 31 Aug, 2018 1 commit
-
-
Marshall Garey authored
Spaces need to be built into each key definition, otherwise it'll all run together. Bug 5657.
-
- 30 Aug, 2018 1 commit
-
-
Danny Auble authored
%n or %h in the spool dir. Bug 5096
-
- 29 Aug, 2018 2 commits
-
-
Dominik Bartkiewicz authored
Fail early when the layout requirements cannot be satisfied in _task_layout_lllp_block(). Otherwise step layout calculations will fail, leading to a "fatal: _task_layout_lllp_block infinite loop" Bug 5415.
-
Dominik Bartkiewicz authored
The conditional for handling --overcommit was not sufficient to prevent other option combinations from using that code path, resulting in unexpected resource allocation behavior. Correct that to check that overcommit is enabled. Bug 5345.
-
- 27 Aug, 2018 2 commits
-
-
Danny Auble authored
running. Bug 5624
-
Dominik Bartkiewicz authored
for an allocation. Bug 5293 Backport of 9a507584 in 18.08.
-
- 22 Aug, 2018 2 commits
-
-
Danny Auble authored
Bug 5608 Tim approved
-
Brian Christiansen authored
If the dbd comes up after a job array has been submitted to the controller, the controller calls _update_job_tres() which calls assoc_mgr_set_tres_cnt_array() which allocates memory for the job's tres_alloc_cnt. The job array gets scheduled, but job_array_split() doesn't NULL out the pending job's tres_alloc_cnt, so both the array task and the pending array job are pointing to the same memory. The array task calls job_set_alloc_tres() which free's the running job's tres_alloc_cnt and now the pending array job is pointing to bad memory and when the array splits again the new array task tries to free tres_alloc_cnt in job_set_alloc_tres() and segfaults. Bug 5604
-
- 21 Aug, 2018 2 commits
-
-
Danny Auble authored
-
Felip Moll authored
Bug 5072
-
- 20 Aug, 2018 1 commit
-
-
Michael Hinton authored
MySQL permits up to 64-character database names, but Slurm was truncating at 33-characters. If we exceed this limit, let the mysql_query fail and give the admin a chance to sort it out, rather than truncating and then failing to query against the un-truncated name later on. While here correct the fatal() message. Bug 5586.
-
- 18 Aug, 2018 1 commit
-
-
Brian Christiansen authored
Bug 5554
-
- 16 Aug, 2018 5 commits
-
-
Danny Auble authored
(i.e. gres=gpu/tesla) it would get a count of 0.
-
Dominik Bartkiewicz authored
Note that pipe2() is Linux-specific, but this whole cgroup plugin is already Linux-specific in design, and the eventfd() call itself is Linux-specific as well. Bug 5570.
-
Danny Auble authored
-
Danny Auble authored
Turns out if you don't close up everything you open the hdf5 lib doesn't like that.
-
Danny Auble authored
Fixs regression in 011b2f23.
-
- 15 Aug, 2018 2 commits
-
-
Morris Jette authored
avoid copying strings that may be overwritten. Bug 5547
-
Danny Auble authored
It turns out it is fairly easy with het jobs to hit this and seg fault srun. Bug 5390
-
- 14 Aug, 2018 3 commits
-
-
Alejandro Sanchez authored
Bug 5547.
-
Tim Wickberg authored
Causes srun and sattach to fail on big endian platforms. Bug 5556
-
Tim Wickberg authored
Wrong author. This reverts commit 392e7cfc.
-