- 19 Feb, 2016 2 commits
-
-
Morris Jette authored
Conflicts: META
-
Morris Jette authored
-
- 18 Feb, 2016 17 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
a new account and making it a default all at once. Bug 2428
-
Alejandro Sanchez authored
Match acct_gather_energy/rapl plugin. Bug 2397.
-
Tim Wickberg authored
Control whether the scheduler will continue to try to run jobs in a partition if a higher priority job is stuck due to an association limit. Can cause starvation for larger jobs, but will improve throughput and utilization for systems that have extensively divvyed up their resources through association/QOS limits. Bug 2388 and 2452.
-
Danny Auble authored
Bug 2453
-
Morris Jette authored
-
Morris Jette authored
This should have no effect, but is a belt-and-suspenders approach to checking node state.
-
Morris Jette authored
libpmi was previously using the slurm_mutex_un/lock functions, which are dependent upon other slurm functions (e.g. "fatal()"). Since this library is used by user applications and outside of slurm proper, we want to us the pthread_mutex_un/lock functions instead. Previous use of slurm functions was invoking glibc error() function rather than slurm's error() function and causing test7.2 to fail.
-
Jeff White authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Make srun logic work more like sbatch/salloc.
-
Morris Jette authored
Jenkins was reporting unused function otherwise
-
Tim Wickberg authored
have been selected set the time limit appropriately if the job didn't request one. If the partition has no DefaultTime setting, and no time_limit was given for the job, job_ptr->time_limit == NO_VAL. With AccountingStorageEnforce=safe this will prevent jobs from ever starting if the association has any limit set for CPUMins. (NO_VAL * cpus is a very large number, but if no time_limit is given anywhere that is what they get :)) Bug 2388.
-
Morris Jette authored
Fix for when a feature is available, but not active on any node.
-
Danny Auble authored
-
- 17 Feb, 2016 15 commits
-
-
Morris Jette authored
Enforce a job's timelimit based upon when it actually beings execution, after node reboot completes. However the job will be changed for resource allocation from the start of allocation, include boot time.
-
Danny Auble authored
-
Danny Auble authored
# Conflicts: # src/slurmctld/node_mgr.c
-
Morris Jette authored
-
Morris Jette authored
Move logic that re-calculates a job's end time based upon delayed launch into a separate function.
-
Tim Wickberg authored
Please submit patches through our bugzilla instance. (This would also show up on the issues page if we hadn't disabled it.)
-
Danny Auble authored
a parsing option. Dynamically set columns widths based on largest number in column.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
Previous logic was failing if feature name not found
-
Morris Jette authored
Add PID to both the slumctld code and cray capmc_suspend/resume programs
-
Morris Jette authored
-
Morris Jette authored
Previous logic would cause slurmctld shutdown to wait for completion of all power save (suspend and resume) programs according to their time limits, which could be huge. The new logic waits up to 10 second then orphans the processes.
-
Morris Jette authored
-
- 16 Feb, 2016 6 commits
-
-
Morris Jette authored
The parsing of the configuration parameter failed with a prefix of "node_features" due to vestigial logic for a plugin type of "knl".
-
Morris Jette authored
Was trying to set NUMA mode as MCNUMA and vise-versa Also change capmc to specify mode before nids, which seems more robust.
-
Tim Wickberg authored
-
Morris Jette authored
the "fputs" function was aborting, trying to write a NULL string pointer. Also found the log message was printing the configured and read names in the wrong order.
-
Morris Jette authored
-
Tim Wickberg authored
abort() rather than continue if pthread_mutex_ calls fail. better to die early rather than continue on and risk corruption. mirrors the (now removed) macro definitions from cbuf/hostlist/list.
-