Commits · 52be13571b9971c539ece0edd539c96f96378b80 · Manuel G. Marciani / ces_slurm_simulator

10 May, 2016 24 commits
- Switch hostlist.c over to slurm_mutex_* · 52be1357
  Tim Wickberg authored Apr 19, 2016
```
Remove local mutex_* macros. Require <pthread.h> and fixup includes.
```
  52be1357
- Switch list.c over to slurm_mutex_* macros. · 66b67ff8
  Tim Wickberg authored Apr 19, 2016
```
Remove list_mutex_* macros. Require <pthread.h> and remove WITH_PTHREAD
```
  66b67ff8
- Switch cbuf.c over to slurm_mutex_* functions · 5af66294
  Tim Wickberg authored Apr 19, 2016
```
Remove cbuf_mutex_* macros. Require <pthread.h> and remove WITH_PTHREAD
```
  5af66294
- Remove __CURRENT_FUNC__ and cleanup macros.h · 2dc9ee16
  Tim Wickberg authored Apr 19, 2016
```
Replace all __CURRENT_FUNC__ references with c99 __func__

Remove NULL and bool definitions, use <stddef.h> and <stdbool.h> instead.
```
  2dc9ee16
- Remove unused slurm_fd_set types. · 134a03c1
  Tim Wickberg authored Apr 19, 2016
  
  134a03c1
- Remove AF_SLURM and SLURM_INADDR_ANY macros. · 81403469
  Tim Wickberg authored Apr 19, 2016
```
AF_SLURM => AF_INET; AF_INET is already commonly used.

SLURM_INADDR_ANY macro never used.
```
  81403469
- Header include cleanup part one. · 29e6fc2f
  Tim Wickberg authored Apr 19, 2016
```
No functional change in theory. Cleanup headers to reduce the mostly-useless
for some time now.

1) Remove common/{getopt.[ch],getopt1.c} files and remove from build. Use
   C99-required functions from <unistd.h> and <getopt.h>.

2) <inttypes.h> is required by C99 and has been required for some time. Remove
   #ifdef blocks and replace some older <stdint.h> includes.

3) <pthread.h> isn't optional at this point. PTHREAD_MUTEX_INITIALIZER is
   required throughout.

4) Use <limits.h> instead of <values.h> or <float.h>

5) <string.h> is required by C99. Remove long-deprecated <strings.h> includes.
```
  29e6fc2f
- Merge remote-tracking branch 'origin/slurm-16.05' · 3f64a484
  Danny Auble authored May 10, 2016
  
  3f64a484
- Merge remote-tracking branch 'origin/slurm-15.08' into slurm-16.05 · 774a2b88
  Danny Auble authored May 10, 2016
  
  774a2b88
- Perlapi - Remove unneeded/undefined mutex. · 7276f432
  Danny Auble authored May 10, 2016
  
  7276f432
- Merge branch 'slurm-16.05' · da3c8141
  Morris Jette authored May 10, 2016
  
  da3c8141
- Merge branch 'slurm-15.08' into slurm-16.05 · b0dd00e2
  Morris Jette authored May 10, 2016
  
  b0dd00e2
- Fix wrong info message when updating a job cpus_per_task · 9f9d36a1
  Alejandro Sanchez authored May 10, 2016
  
  9f9d36a1
- Add array_inx to job_submit.lua plugin for job arrays. · f52a3317
  Tim Wickberg authored May 10, 2016
  
  f52a3317
- Merge remote-tracking branch 'origin/slurm-15.08' into slurm-16.05 · 335ac4db
  Danny Auble authored May 10, 2016
```
# Conflicts:
#	src/plugins/select/cray/select_cray.c
#	testsuite/expect/test1.84
```
  335ac4db
- Increase buffer to hold spool dir for cpufreq locks · 21eed55d
  Brian Christiansen authored May 10, 2016
  
  21eed55d
- Changes have been made to the cray/select plugin aeld code · 2959a1e6
  Marlys Kohnke authored May 10, 2016
```
    for better robustness.  This cray/select plugin code has
    been modified to remove a possible timing window where two
    aeld pthreads could exist, interfering with each other through
    the global aeld_running variable.

    An additional validity check has been added to the data provided
    to aeld through an alpsc_ev_set_application_info() call.
    If an error is returned from that call, only certain errors
    need the current socket connection closed to aeld and a new
    connection established.  Other error returns will log an
    error message and keep the current session established with
    aeld.
```
  2959a1e6
- Static not needed. · ab7ee2c6
  Brian Christiansen authored May 09, 2016
  
  ab7ee2c6
- Use static length when printing thread name. · fe9e80d7
  Brian Christiansen authored May 09, 2016
  
  fe9e80d7
- Display thread name instead of thread id for thread_id LogTimeFormat · eb3c46ed
  Brian Christiansen authored May 09, 2016
  
  eb3c46ed
- Move array from stack to heap · ba2fc67a
  Morris Jette authored May 09, 2016
```
This might possibly be related to bug 2334, but it's a long shot.
```
  ba2fc67a
- Fix issue where daemons would only listen on specific address given in · 79c9a499
  Danny Auble authored May 09, 2016
```
slurm.conf instead of all.  If looking for specific addresses use
TopologyParam options No*InAddrAny.

This was broken in 15.08 with the advent of the referenced TopologyParams
the commits 9378f195 and c5312f52 are no longer needed.

Bug 2696
```
  79c9a499
- Shorten slurmctld thread names. · 3ef05150
  Brian Christiansen authored May 09, 2016
```
Thread names can only be 16 characters long, plus we already know that the threads are from the slurmctld.
```
  3ef05150
- Set thread names for acct_gather threads. · 99b0812a
  Brian Christiansen authored May 09, 2016
  
  99b0812a
09 May, 2016 6 commits

Move code into #ifdef if there is no hwloc · 9129192b
Danny Auble authored May 09, 2016

9129192b

MySQL - Fix for possible race condition when archiving multiple clusters · eb208763

Moe Jette authored May 09, 2016

at the same time.

Bug 2683

Turns out making a variable static in a function will make it not safe
when dealing with threads.

eb208763

Add capmc emulation script · 024cf048
Morris Jette authored May 09, 2016
```
Used for development and testing purposes
```
024cf048
update dw_wlm_cli emulator · 6daf89d1
Morris Jette authored May 09, 2016

6daf89d1

bb/cray: add support for "equalize_fragments" · 5d67e075

Morris Jette authored May 09, 2016

burst_buffer/cray - Add support for rounding up the size of a buffer reqeust
if the DataWarp configuration "equalize_fragments" is used. That option
can significantly increase the size of a burst buffer allocation in order
to create equal sized buffers on all server nodes for performance reasons.
Support current only provided by Cray for job buffers, not persistent
burst buffers.

5d67e075

Spelling fix. · d38561e8
Brian Christiansen authored May 09, 2016

d38561e8

06 May, 2016 8 commits

Automatically all "hbm" GRES for KNL · 97831467

Morris Jette authored May 06, 2016

If node_feature/knl_cray plugin is configured and a GresType of "hbm"
is not defined, then add it the the GRES tables. Without this, references
to a GRES of "hbm" (either by a user or Slurm's internal logic) will
generate error messages.
bug 2708

97831467

Add another explanation for test failure · b5dabfe8
Morris Jette authored May 06, 2016

b5dabfe8
Merge branch 'slurm-16.05' · 69b567fc
Morris Jette authored May 06, 2016

69b567fc
Merge branch 'slurm-15.08' into slurm-16.05 · eb888964
Morris Jette authored May 06, 2016

eb888964

Fix for slurmstepd setfault · db0fe22e

John Thiltges authored May 06, 2016

With slurm-15.08.10, we're seeing occasional segfaults in slurmstepd. The logs point to the following line: slurm-15.08.10/src/slurmd/slurmstepd/mgr.c:2612

On that line, _get_primary_group() is accessing the results of getpwnam_r():
    *gid = pwd0->pw_gid;

If getpwnam_r() cannot find a matching password record, it will set the result (pwd0) to NULL, but still return 0. When the pointer is accessed, it will cause a segfault.

Checking the result variable (pwd0) to determine success should fix the issue.

db0fe22e

Document lack of current support for KNL quad/SNC2 · 7246b955
Morris Jette authored May 06, 2016
```
Note that Slurm can not support heterogenous core counts for each
NUMA nodes.
bug 2704
```
7246b955

Correct partition MaxCPUsPerNode enforcement · 70aafa68

Marco Ehlert authored May 05, 2016

I would like to mention a problem which seems to be a calculation bug of
used_cores in slurm version 15.08.7

If a node is divided into 2 partitions using MaxCPUsPerNode like this
slurm.conf configuration

    NodeName=n1 CPUs=20
    PartitionName=cpu NodeName=n1    MaxCPUsPerNode=16
    PartitionName=gpu NodeName=n1    MaxCPUsPerNode=4

I run into a strange scheduling situation.
The situation occurs after a fresh restart of the slurmctld daemon.

I start jobs one by one:

case 1
    systemctl restart slurmctld.service
    sbatch -n 16 -p cpu cpu.sh
    sbatch -n 1  -p gpu gpu.sh
    sbatch -n 1  -p gpu gpu.sh
    sbatch -n 1  -p gpu gpu.sh
    sbatch -n 1  -p gpu gpu.sh

    => Problem now: The gpu jobs are kept in PENDING state.

This picture changes if I start the jobs this way

case 2
    systemctl restart slurmctld.service
    sbatch -n 1  -p gpu gpu.sh
    scancel <gpu job_id>
    sbatch -n 16 -p cpu cpu.sh
    sbatch -n 1  -p gpu gpu.sh
    sbatch -n 1  -p gpu gpu.sh
    sbatch -n 1  -p gpu gpu.sh
    sbatch -n 1  -p gpu gpu.sh

and all jobs are running fine.

By looking into the code I figured out a wrong calculation of 'used_cores' in
function _allocate_sc()

plugins/select/cons_res/job_test.c

_allocate_sc(...)
...
         for (c = core_begin; c < core_end; c++) {
                 i = (uint16_t) (c - core_begin) / cores_per_socket;

                 if (bit_test(core_map, c)) {
                         free_cores[i]++;
                         free_core_count++;
                 } else {
                         used_cores[i]++;
                 }
                 if (part_core_map && bit_test(part_core_map, c))
                         used_cpu_array[i]++;

This part of code seems to work only if the part_core_map exists for a
partition or on a completly free node. But in case 1 there is no
part_core_map for gpu created yet. Starting a gpu  the core_map contains
4 cores left from the cpu job. Now all non free cores of the cpu partion
are counted as used cores in the gpu partition and this condition will
match in the next code parts

    free_cpu_count + used_cpu_count >  job_ptr->part_ptr->max_cpus_per_node

what is definitely wrong.

As soon as a part_core_map appears, means a gpu job was started on a free
node (case 2) then there is no problem at all.

To get case 1 work I changed the above code to the following and all works
fine:

         for (c = core_begin; c < core_end; c++) {
                 i = (uint16_t) (c - core_begin) / cores_per_socket;

                if (bit_test(core_map, c)) {
                         free_cores[i]++;
                         free_core_count++;
                 } else {
                     if (part_core_map && bit_test(part_core_map, c)){
                         used_cpu_array[i]++;
                         used_cores[i]++;
                     }
                 }

I am not sure this code change is really good, but it fixes my problem.

70aafa68

Update configure script for 17.02 · 34db1cce
Brian Christiansen authored May 05, 2016

34db1cce

05 May, 2016 2 commits
- Expand comment for better clarity · 91a7587f
  Morris Jette authored May 05, 2016
  
  91a7587f
- Fix spelling mistakes in rpc.shtml and redo with Unix line endings. · c5774968
  Tim Wickberg authored May 05, 2016
  
  c5774968