Commits · fbb89f5ed74b5f7630eab3fee854fdafa362f630 · Manuel G. Marciani / ces_slurm_simulator

08 Dec, 2011 2 commits
- Eliminate warning message when srun --pty option used · fbb89f5e
  Morris Jette authored Dec 08, 2011
  
  fbb89f5e
- BLUEGENE - Fixed preemption issue. · bcc3c6a9
  Danny Auble authored Dec 07, 2011
  
  bcc3c6a9
06 Dec, 2011 2 commits

Permit pending job to exeeded partition limit with QOS flag change. · 0e1abeda

Morris Jette authored Dec 06, 2011

One of our testers discovered a regression in version 2.3.1.  If a job is
pending due to PartitionNodeLimit and the limit is relieved with a
'sacctmgr modify qos name=<qos name> set flags=partitionmaxnodes' new jobs
exceeding the partition limit (but not the QOS limit) are allowed to run.
However, the pending job is never allowed to run.  Attached is a patch to
address this problem.  FYI, this problem doesn't exist in version 2.4.
Patch from Bill Brophy, Bull.

0e1abeda

Fix leaked iterator in slurmdb_pack_job_cond. · bf9943fe
Yuri D'Elia authored Dec 06, 2011

bf9943fe

05 Dec, 2011 3 commits
- Fix task/cgroup plugin error when used with GRES · 6443e89f
  Morris Jette authored Dec 05, 2011
```
Patch by Alexander Bersenev (Institute of Mathematics and Mechanics, Russia).
```
  6443e89f
- Update NEWS for start of v2.3.2 work · 75bd6efe
  Morris Jette authored Dec 05, 2011
  
  75bd6efe
- Update META for v2.3.2 tag · c758925a
  Morris Jette authored Dec 05, 2011
  
  c758925a
02 Dec, 2011 2 commits
- BLUEGNE - Fixed issue with handling HTC modes and rebooting. · 04bb8ebc
  Danny Auble authored Dec 02, 2011
```
There was also some bad code that would reset the conn_type of a block
to SMALL no matter what type of SMALL it was.
```
  04bb8ebc
- Improve man pages for FSIZE propagation · d1cf49b0
  Morris Jette authored Dec 02, 2011
```
Patsh from Rod Schulz, Bull.
```
  d1cf49b0
01 Dec, 2011 1 commit

Fix for "fatal: cons_res: sync loop not progressing" · d70a9ac4

jette authored Dec 01, 2011

This was due to a bug in select/cons_res with some configuration
optiions and job options, especially if there is more than one
thread per core and the job option includes "--threads-per-core=1".
Fixes problem reported by CSCS.

d70a9ac4

30 Nov, 2011 3 commits
- Revert write lock back to a read lock for performance reasons. A write · 1cb9dabd
  Danny Auble authored Nov 30, 2011
```
lock was deemed not necessary because the information (db_index) was only
internal and was only modified in the same function later which is
protected by the write lock.
```
  1cb9dabd
- Fixed if not enforcing associations but want QOS support for a default · 391d8e05
  Danny Auble authored Nov 30, 2011
```
qos on the cluster to fill that in correctly.
```
  391d8e05
- Fix issue in accounting where normalized shares could be updated · 2ac2662f
  Danny Auble authored Nov 30, 2011
```
incorrectly when getting fairshare from the parent.
```
  2ac2662f
23 Nov, 2011 2 commits
- Fix test for SLURM_CPUS_PER_TAASK env var mod · 4d32100f
  Morris Jette authored Nov 23, 2011
  
  4d32100f
- Fixed race condition when using the DBD in accounting where if a job · a3bb2409
  Danny Auble authored Nov 23, 2011
```
wasn't started at the time the eligible message was sent but started
before the db_index was returned information like start time would be lost.
```
  a3bb2409
22 Nov, 2011 2 commits
- Fix for fatal error managing GRES. Patch by Carles Fenoy, BSC. · 837be271
  Morris Jette authored Nov 22, 2011
  
  837be271
- Set SLURM_CPUS_PER_TASK=1 when user specifies --cpus-per-task=1 · 85a5a8d0
  Morris Jette authored Nov 22, 2011
  
  85a5a8d0
21 Nov, 2011 4 commits
- clarify location of configuration files · 1972c22a
  Morris Jette authored Nov 21, 2011
  
  1972c22a
- Add new contributor to web page · ab555ca4
  Morris Jette authored Nov 21, 2011
  
  ab555ca4
- Run autogen.sh after Lua link patch · 8f222137
  Morris Jette authored Nov 21, 2011
  
  8f222137
- Merge pull request #10 from paran1/lua-fix · e807f0ef
  Morris Jette authored Nov 21, 2011
```
Fix Lua link order
```
  e807f0ef
18 Nov, 2011 3 commits
- Fix a couple of pgsql bugs · f2e9ef1a
  jette authored Nov 18, 2011
```
Patch from Yuri D'Ella
```
  f2e9ef1a
- Discourage use of pgsql in the configurator web page · c9e170c7
  jette authored Nov 18, 2011
  
  c9e170c7
- Added accounting notes about requeued jobs · e2a04389
  jette authored Nov 18, 2011
```
Note that if a job is requeued, it's submit time is changed and the record appears
as a duplicate job with a different submit time. Patch by Bill Brophy, Bull.
```
  e2a04389
16 Nov, 2011 1 commit

Fix Lua link order · 1ef73876

Pär Andersson authored Nov 15, 2011

Put -llua* in LIBS rather than LDFLAGS to get correct link order.
Without this the configure test for Lua fails when using GCC 4.6,
the default compiler on recent Linux distributions like Ubuntu
11.10.

1ef73876

08 Nov, 2011 1 commit

Avoid orphan job step if slurmctld is down when a job step completes · 9e71fd08

Morris Jette authored Nov 07, 2011

Note this is an old bug. The new code keeps slurmstepd alive and it
keeps trying to send step completion message to slurmctld.

9e71fd08

07 Nov, 2011 4 commits
- Add missing bracket, bug introduced in fd83389434b · d6376385
  Morris Jette authored Nov 07, 2011
  
  d6376385
- Cache current time in squeue for improved performance · 5515f1bd
  Morris Jette authored Nov 07, 2011
  
  5515f1bd
- GRES allocation ignoring some job parameters · 90862249
  Morris Jette authored Nov 07, 2011
```
This make the same patch to select/linear as Carles Fenoy's patch to
select/cons_res plugin.
```
  90862249
- Added gres_cpus test. Without this test it could lead to the error "fatal:... · fd838943
  Carles Fenoy authored Nov 07, 2011
```
Added gres_cpus test. Without this test it could lead to the error "fatal: cons_res: sync loop not progressing" With this patch a job will be rejected if asking for unavailable configuration.
```
  fd838943
04 Nov, 2011 3 commits

Don't set CUDA_VISIBLE_DEVICES from gres/gpu if not files defined · 26e93d97

Morris Jette authored Nov 04, 2011

Print an error rather than setting CUDA_VISIBLE_DEVICES environment
variable to  "NoDevFiles" if no device files defined.

26e93d97

Updated set_oomadj.c, replacing deprecated oom_adj reference with oom_score_adj. · 9820986e
Morris Jette authored Nov 04, 2011
```
Patch 4f68cde5bd6b4fcf839f6694457373c81d9548ba from chaos/slurm by Don Lipari, LLNL
```
9820986e

Partial revert of commit · e76a0c9b

Morris Jette authored Nov 04, 2011

The change in function call order of commit e60abe43
resulted in slurmd daemons on front-end systems not registering with the
proper node name.

e76a0c9b

02 Nov, 2011 1 commit
- Cray - Remove the "family" specification from the GPU reservation request. · ccb8b419
  Morris Jette authored Nov 02, 2011
  
  ccb8b419
31 Oct, 2011 3 commits
- Do not look for the script file for completed jobs · 8e6ee500
  Morris Jette authored Oct 31, 2011
  
  8e6ee500
- Clarify use of EnforcePartLimits configuration parameter · e11edced
  Morris Jette authored Oct 31, 2011
  
  e11edced
- Add QOS to the information logged when a job is submitted. · a42a0dda
  Morris Jette authored Oct 31, 2011
  
  a42a0dda
28 Oct, 2011 3 commits

Add backfill scheduler resolution parameter · b86bc225

Morris Jette authored Oct 28, 2011

Backfill scheduling - Add SchedulerParameters configuration parameter of
"bf_res" to control the resolution in the backfill scheduler's data about
when jobs begin and end. Default value is 60 seconds (used to be 1 second).

b86bc225

cosmetic mods · 1397892c
Morris Jette authored Oct 28, 2011

1397892c

Don't drain node if job has UID not found · a183f2ed

Morris Jette authored Oct 28, 2011

Do not drain the compute or front-end node when trying to start a job
for which the UID is not found

a183f2ed