- 24 Oct, 2016 2 commits
-
-
Morris Jette authored
-
Dorian Krause authored
This commit fixes a bug in the multi-prog handling. When running salloc -N 2 srun -O --multi-prog mp.conf where mp.conf reads 0-192 true srun crashes can be observed. valgrind reports: ==6857== Invalid read of size 4 ==6857== at 0x45938D: bit_realloc (bitstring.c:189) ==6857== by 0x5977A9: _update_task_mask (multi_prog.c:335) ==6857== by 0x597A5E: _validate_ranks (multi_prog.c:403) ==6857== by 0x597D1E: verify_multi_name (multi_prog.c:469) ==6857== by 0x6E7B4BE: launch_p_handle_multi_prog_verify (launch_slurm.c:453) ==6857== by 0x58A25D: launch_g_handle_multi_prog_verify (launch.c:493) ==6857== by 0x58E556: _opt_args (opt.c:1927) ==6857== by 0x58A3B9: initialize_and_process_args (opt.c:270) ==6857== by 0x591F82: init_srun (srun_job.c:459) ==6857== by 0x427E70: srun (srun.c:193) ==6857== by 0x428E23: main (srun.wrapper.c:17) ==6857== Address 0x5ace440 is 16 bytes inside a block of size 28 free'd ==6857== at 0x4C2BB4A: realloc (in /usr/lib64/valgrind/vgpreload_memcheck-amd64-linux.so) ==6857== by 0x446886: slurm_xrealloc (xmalloc.c:139) ==6857== by 0x45944C: bit_realloc (bitstring.c:191) ==6857== by 0x5977A9: _update_task_mask (multi_prog.c:335) ==6857== by 0x597A5E: _validate_ranks (multi_prog.c:403) ==6857== by 0x597D1E: verify_multi_name (multi_prog.c:469) ==6857== by 0x6E7B4BE: launch_p_handle_multi_prog_verify (launch_slurm.c:453) ==6857== by 0x58A25D: launch_g_handle_multi_prog_verify (launch.c:493) ==6857== by 0x58E556: _opt_args (opt.c:1927) ==6857== by 0x58A3B9: initialize_and_process_args (opt.c:270) ==6857== by 0x591F82: init_srun (srun_job.c:459) ==6857== by 0x427E70: srun (srun.c:193)
-
- 21 Oct, 2016 5 commits
-
-
Morris Jette authored
Do not process SALLOC_HINT, SBATCH_HINT or SLURM_HINT environment variables if any of the following salloc, sbatch or srun command line options are specified: -B, --cpu_bind, --hint, --ntasks-per-core, or --threads-per-core.bug 3118
-
Tim Wickberg authored
-
Morris Jette authored
Without this change, only the error was available, but no identification of the specific plugin that failed.
-
Morris Jette authored
Coverity was complaining that the return value of s_p_get_* was ignored. I added typecasting of the return value to (void) where needed. No change in logic, just making Coverity happy ;)
-
Morris Jette authored
-
- 20 Oct, 2016 5 commits
-
-
Tim Wickberg authored
_select_nodes_parts() was resetting state_reason to an admin hold without regard to admin vs user hold state. state_reason is the only place that user vs. admin is distinguished, so this prevented users from releasing these jobs. Bug introduced by commit fb46c84b in 16.05.5. Bug 3197.
-
Tim Wickberg authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
This is an addition to commit cb7ed937
-
- 19 Oct, 2016 8 commits
-
-
Morris Jette authored
Related to commit 974878cd
-
Morris Jette authored
The kmem limit was added in commit 084c9308
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
-
Ole H Nielsen authored
bug 3191
-
Ole H Nielsen authored
bug 3192
-
Morris Jette authored
-
- 18 Oct, 2016 6 commits
-
-
Martin Perry authored
-
Tim Wickberg authored
Continuation of commit 2fd4d7a6. MySQL dropped support for 'ignore', unconditionally remove that to avoid issues with mismatching client + server versions like in bug 3189.
-
Dominik Bartkiewicz authored
Improve reported estimates of start and end times for pending jobs. bug 3184
-
Morris Jette authored
Cray: Prevent abort in backfill scheduling logic for requeued job that has been cancelled while NHC is running. bug 3185
-
Joseph Mingrone authored
Old command line would not work with FreeBSD. New command line should work with all systems bug 3021
-
Tomasz Abramczyk authored
Add mechanism to constrain kernel memory allocation using cgroups. New cgroup.conf parameters added: ConstrainKmemSpace, MaxKmemPercent, and MinKmemSpace. bug 2917
-
- 17 Oct, 2016 9 commits
-
-
Morris Jette authored
-
Morris Jette authored
Also replace string copies with setting of pointers for better performance.
-
Morris Jette authored
Modify DataWarb example to use an environment variable rather than absolute path
-
Danny Auble authored
older versions. This is fixed differently going forward, so we can start warning about deprecated declarations again.
-
Danny Auble authored
-
Danny Auble authored
the way we use it and it also is deprecated, most likely since it really isn't needed.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
new glibc 2.24+ that depricates readdir_r.
-
- 15 Oct, 2016 3 commits
-
-
Morris Jette authored
Bug reported by Coverity
-
Morris Jette authored
-
Morris Jette authored
-
- 14 Oct, 2016 2 commits
-
-
Morris Jette authored
Add suppoprt for --mem_bind=prefer option to prefer, but not restrict memory use to the identified NUMA node. bug 3126
-
Morris Jette authored
Modify cpu_bind and mem_bind map and mask options to accept a repetition count to better support large task count. For example: "mask_mem:0x0f*2,0xf0*2" is equivalent to "mask_mem:0x0f,0x0f,0xf0,0xf0" bug 3065
-