- 08 Dec, 2011 2 commits
-
-
Morris Jette authored
-
Danny Auble authored
-
- 06 Dec, 2011 2 commits
-
-
Morris Jette authored
One of our testers discovered a regression in version 2.3.1. If a job is pending due to PartitionNodeLimit and the limit is relieved with a 'sacctmgr modify qos name=<qos name> set flags=partitionmaxnodes' new jobs exceeding the partition limit (but not the QOS limit) are allowed to run. However, the pending job is never allowed to run. Attached is a patch to address this problem. FYI, this problem doesn't exist in version 2.4. Patch from Bill Brophy, Bull.
-
Yuri D'Elia authored
-
- 05 Dec, 2011 3 commits
-
-
Morris Jette authored
Patch by Alexander Bersenev (Institute of Mathematics and Mechanics, Russia).
-
Morris Jette authored
-
Morris Jette authored
-
- 02 Dec, 2011 2 commits
-
-
Danny Auble authored
There was also some bad code that would reset the conn_type of a block to SMALL no matter what type of SMALL it was.
-
Morris Jette authored
Patsh from Rod Schulz, Bull.
-
- 01 Dec, 2011 1 commit
-
-
jette authored
This was due to a bug in select/cons_res with some configuration optiions and job options, especially if there is more than one thread per core and the job option includes "--threads-per-core=1". Fixes problem reported by CSCS.
-
- 30 Nov, 2011 3 commits
-
-
Danny Auble authored
lock was deemed not necessary because the information (db_index) was only internal and was only modified in the same function later which is protected by the write lock.
-
Danny Auble authored
qos on the cluster to fill that in correctly.
-
Danny Auble authored
incorrectly when getting fairshare from the parent.
-
- 23 Nov, 2011 2 commits
-
-
Morris Jette authored
-
Danny Auble authored
wasn't started at the time the eligible message was sent but started before the db_index was returned information like start time would be lost.
-
- 22 Nov, 2011 2 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
- 21 Nov, 2011 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Fix Lua link order
-
- 18 Nov, 2011 3 commits
- 16 Nov, 2011 1 commit
-
-
Pär Andersson authored
Put -llua* in LIBS rather than LDFLAGS to get correct link order. Without this the configure test for Lua fails when using GCC 4.6, the default compiler on recent Linux distributions like Ubuntu 11.10.
-
- 08 Nov, 2011 1 commit
-
-
Morris Jette authored
Note this is an old bug. The new code keeps slurmstepd alive and it keeps trying to send step completion message to slurmctld.
-
- 07 Nov, 2011 4 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
This make the same patch to select/linear as Carles Fenoy's patch to select/cons_res plugin.
-
Carles Fenoy authored
Added gres_cpus test. Without this test it could lead to the error "fatal: cons_res: sync loop not progressing" With this patch a job will be rejected if asking for unavailable configuration.
-
- 04 Nov, 2011 3 commits
-
-
Morris Jette authored
Print an error rather than setting CUDA_VISIBLE_DEVICES environment variable to "NoDevFiles" if no device files defined.
-
Morris Jette authored
Patch 4f68cde5bd6b4fcf839f6694457373c81d9548ba from chaos/slurm by Don Lipari, LLNL
-
Morris Jette authored
The change in function call order of commit e60abe43 resulted in slurmd daemons on front-end systems not registering with the proper node name.
-
- 02 Nov, 2011 1 commit
-
-
Morris Jette authored
-
- 31 Oct, 2011 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- 28 Oct, 2011 3 commits
-
-
Morris Jette authored
Backfill scheduling - Add SchedulerParameters configuration parameter of "bf_res" to control the resolution in the backfill scheduler's data about when jobs begin and end. Default value is 60 seconds (used to be 1 second).
-
Morris Jette authored
-
Morris Jette authored
Do not drain the compute or front-end node when trying to start a job for which the UID is not found
-