- 03 May, 2018 2 commits
-
-
Alejandro Sanchez authored
Use setenv() instead of setenvfs(), since setenvfs() memory allocation is implemented with xmalloc() and fini_setproctitle() (which is called on reconfigure) free's the memory with free(), leading to a: "free(): invalid size" malloc_printerr error. Continuation of dce83a23. Bug 5095.
-
Felip Moll authored
Due to current design the job limits are checked before the allocation is made when one specifies a generic gres and a specific gres type is configured. The workaround for now is to define a job submit plugin to control the user request and succesfully apply limits. Bug 4767
-
- 02 May, 2018 6 commits
-
-
Dominik Bartkiewicz authored
Bug 4960.
-
Dominik Bartkiewicz authored
Bug 4887.
-
Tim Wickberg authored
Can lead to deadlock within malloc depending on the exact timing. Rework thread startup and shutdown code so pthread_cancel is not needed. Bug 5119, 5103.
-
Tim Wickberg authored
happens. Bug 5108
-
Danny Auble authored
This reverts commit de5a4da2.
-
Danny Auble authored
happens. Bug 5108
-
- 01 May, 2018 1 commit
-
-
Danny Auble authored
Turns out the partititon's billing tres was working off the sum of the node_ptrs which contain the max of all partitions they are in. This isn't correct since each partition's billing can be different. Set it correctly here.
-
- 30 Apr, 2018 3 commits
-
-
Tim Wickberg authored
These functions are not async-cancel-safe, and cannot safely be cancelled. This leads to potential deadlock, either between our own locks, or deep inside glibc when the thread held a malloc arena lock when canceled. Replace with pthread_signal to the appropriate cond to wake threads up at the appropriate time instead. Bug 5103.
-
Marshall Garey authored
Otherwise the extern step will disappear after 11.5 days. Bug 5000.
-
Dominik Bartkiewicz authored
to be sure if it is created under job write lock. Bug 4901
-
- 28 Apr, 2018 2 commits
-
-
Brian Christiansen authored
Bug 5053
-
Brian Christiansen authored
This allows the suspend script to be triggered even if Slurm has the node(s) in a power_save state. Bug 5053
-
- 23 Apr, 2018 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
When any of these --exclusive modes couldn't be satisfied, Slurm was returning an incorrect ESLURM_NODE_NOT_AVAIL, having as a consequence scheduling problems as described in the bug. The fix makes it so the error code is properly set to ESLURM_NODES_BUSY, fixing also the scheduling problems and working over the correct share_node_bitmap. Continuation of commits from bug 4932: e2a14b8d fc4e5ac9 Bug 5047.
-
- 19 Apr, 2018 2 commits
-
-
Marshall Garey authored
Fix an issue in the bit manipulation log introduced in commit 892ffa89. Bug 4997.
-
Tim Wickberg authored
Replace select_p_select_jobinfo_sprint() with the same NO-OP that the other plugins (except alps and bluegene) have implemented. Bug 5077.
-
- 17 Apr, 2018 1 commit
-
-
Morris Jette authored
1. Identifies nodes which are unavailable to a specific job, adding a call to filter_by_node_owner() in select_nodes() where the node list is generated. 2. Removes the "unavail_node_str" argument to select_nodes() as it is no longer useful. This string originally was originally generated once at the start of the job scheduling logic for all jobs, but since each job can have a different set of unavailable nodes (dedicated to user, group, etc.) so the same string for all jobs can be misleading. Bug 4932.
-
- 16 Apr, 2018 3 commits
-
-
Tim Wickberg authored
-
Dominik Bartkiewicz authored
See commit 0dabf4e7. Bug 4932.
-
Dominik Bartkiewicz authored
regression from ef1f3e73. Bug 4885.
-
- 11 Apr, 2018 4 commits
-
-
Morris Jette authored
over any other nodes. Bug 4968
-
Danny Auble authored
allocated time than 100% (Flex reservations). Bug 4968
-
Danny Auble authored
Bug 4968
-
Morris Jette authored
bug 4968
-
- 10 Apr, 2018 1 commit
-
-
Morris Jette authored
In configurator.html web pages change default configuration from task/none to task/affinity plugin and from select/linear plugin to select/cons_res plus CR_Core.
-
- 04 Apr, 2018 1 commit
-
-
Morris Jette authored
-
- 03 Apr, 2018 2 commits
-
-
Ben Matthews authored
Bug 4952.
-
Ole H Nielsen authored
Bug 5011.
-
- 02 Apr, 2018 1 commit
-
-
Alejandro Sanchez authored
If an update for a job that is being requeued[hold] happens, we don't want to modify the start_time of the old record. Pending + Completing is equivalent to Requeue. Bug 4855.
-
- 27 Mar, 2018 4 commits
-
-
Brian Christiansen authored
Missed from 500e23b5 Bug 4988
-
Brian Christiansen authored
job_ptr was still being referenced after releasing the read locks. Also prolog_running_decr() needs to have a job write lock and a fed read lock. Bug 4947
-
Dominik Bartkiewicz authored
Bug 4990.
-
Felip Moll authored
-
- 26 Mar, 2018 1 commit
-
-
Felip Moll authored
While restarting slurmctld with -R flag, features were emptied. This patch just does nothing when recover=2 (-R) which means active and available features are preserved from the state files. Needed for backup slurmctld controller. Bug 4927
-
- 23 Mar, 2018 3 commits
-
-
Boris Karasev authored
Bug 4882
-
Felip Moll authored
-
Tim Wickberg authored
If block_map_size is zero (as seen on FreeBSD when not using HWLOC), xmalloc(0) will return NULL. Writing to str[0] then is invalid. Bug 4845.
-
- 22 Mar, 2018 1 commit
-
-
Brian Christiansen authored
Bug 4971
-