- 21 Oct, 2015 4 commits
-
-
Morris Jette authored
sbatch --ntasks option to take precedence over --ntasks-per-node plus node count, as documented. Set SLURM_NTASKS/SLURM_NPROCS environment variables accordingly. bug 2015
-
David Bigagli authored
-
Danny Auble authored
-
Danny Auble authored
-
- 20 Oct, 2015 13 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Deric Sullivan authored
Previous logic would report an error and stop the power_save thread. bug 2042
-
Morris Jette authored
Not sure when that went away...
-
Morris Jette authored
-
Morris Jette authored
If a suspended job is manually resumed and gang scheduling is configured, but no time slices are available for the job being resumed, then just resume it without adding it to a time slice. The jobs previously running on those nodes will be replaced with new jobs as resources become available and the resumed job will basically be treated like a stray job. bug 2031
-
Morris Jette authored
Avoid reporting more allocated CPUs than exist on a node. This can be triggered by resuming a previosly suspended job, resulting in oversubscription of CPUs. bug 2021
-
Morris Jette authored
bug 2031
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
Add scancel -f/--full option to signal all steps including batch script and all of its child processes. bug 2031
-
Morris Jette authored
-
Morris Jette authored
-
- 19 Oct, 2015 10 commits
-
-
Brian Christiansen authored
Bug 1888
-
Danny Auble authored
out. Remove unneeded code that commit 8274ea54 fixed. This code would 0 out all GRES/TRES on a reconfig which isn't what we want. 8274ea54 does the right thing by itself.
-
Hongjia Cao authored
bug 2032
-
Deric Sullivan authored
bug 2039
-
Deric Sullivan authored
bug 2037
-
Morris Jette authored
-
Deric Sullivan authored
backport of commit 4f2e2801 from v16.05
-
Morris Jette authored
Needed to change a couple of variables from 32- to 64-bit.
-
Morris Jette authored
Add new burst_buffer.conf parameters: ValidateTimeout and OtherTimeout. See man page for details.
-
Morris Jette authored
-
- 16 Oct, 2015 2 commits
-
-
David Bigagli authored
-
Josko Plazonic authored
-
- 15 Oct, 2015 1 commit
-
-
Danny Auble authored
previously take 2 restarts of the slurmdbd to make it stick correctly.
-
- 14 Oct, 2015 1 commit
-
-
Danny Auble authored
single-threaded cores. A regression caused only 1 socket to be used on this kind of node instead of all that were available.
-
- 09 Oct, 2015 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
If a job allocation returns some invalid contents, the pointer to the job structure may be NULL. This change preserves the error message and avoids a segv.
-
- 08 Oct, 2015 2 commits
-
-
Brian Christiansen authored
Fix case where if the backup slurmdbd has existing connections when it gives up control that the it would be killed. If the backup had existing connections when giving up control, it would try to signal the existing threads by using pthread_kill to send SIGKILL to the threads. The problem is that SIGKILL doesn't go the thread but the main process and the backup dbd would be killed.
-
Danny Auble authored
when a cold-start (-c) happens to the slurmctld.
-
- 07 Oct, 2015 5 commits
-
-
Danny Auble authored
Conflicts: src/sacct/options.c
-
Danny Auble authored
-
Danny Auble authored
from a user. This would cause the slurmctld to cache the old default which wasn't valid and cause the user to have to request the association always.
-
Danny Auble authored
Conflicts: NEWS src/plugins/accounting_storage/mysql/as_mysql_job.c
-
Morris Jette authored
bug 2009
-