- 15 Apr, 2016 6 commits
-
-
Danny Auble authored
-
Marlys Konhke authored
-
Morris Jette authored
If a job was submitted with a deadline and no time_limit or min_time, but the system has a QOS MaxWall the job's time_limit would be set to the QOS limit. Since there is no min_time specified, the QOS MaxWall would be treated as a min and max time limit for the job and potentially make the deadline impossible to satisfy. Now we set the min_time to 1 minute of there is a deadline, but no time_limit or min_time.
-
Morris Jette authored
Also make sure the job is cancelled at the end of the test
-
Morris Jette authored
-
Morris Jette authored
Add TopologyParam option of "TopoOptional" to optimize network topology only for jobs requesting it. bug 2567
-
- 14 Apr, 2016 23 commits
-
-
Tim Wickberg authored
Timeout stalled transfers and cleanup related data structures. Default to wait five minutes since last update. Hook onto registration/ping message type to trigger cleanup in a minimally invasive manner. While here restructure certain functions to use list_* functions rather than iterate on the structures.
-
Tim Wickberg authored
Otherwise --mail-type=ALL will send an unexpected stage_out message back to the user. Bug 2541.
-
Tim Wickberg authored
Otherwise --mail-type=ALL will send an unexpected stage_out message back to the user. Bug 2541.
-
Morris Jette authored
-
Janne Blomqvist authored
Siphash is a state of the art keyed hash function that is performance competitive with the usual non-cryptographic hash functions. It's used as the default hash function backing hash tables in e.g. Perl, Python, Rust, and so on. Here we initially use it for the gid cache hash table, and in the common xhash implementation.
-
Jean-Philippe Aumasson authored
-
Danny Auble authored
sacctmgr list events
-
Tim Wickberg authored
step_ptr->job_ptr is already dereferenced several times by now, so null check is unnecessary here.
-
Brian Christiansen authored
-
Morris Jette authored
Conflicts: NEWS src/plugins/accounting_storage/mysql/as_mysql_resv.c
-
Morris Jette authored
If a job fails stage in, set its reason to BurstBufferOperation with a string describing what happened. Previously the reason was set to AdminHeld on stage-in failure.
-
Brian Christiansen authored
For commits: f980c588 510abf23
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Brian Christiansen authored
-
Brian Christiansen authored
MB for memory and bb.
-
Brian Christiansen authored
Bug 1783
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Morris Jette authored
select/cray - Initiate step node health check at start of step termination rather than after application completely ends so that NHC can capture information about hung (non-killable) processes. bug 2192
-
Danny Auble authored
-
Danny Auble authored
the message. This was needed for sbcast or srun --bcast since the tree width is different than the default. Bug 2512
-
Danny Auble authored
-
- 13 Apr, 2016 11 commits
-
-
Tim Wickberg authored
Make default compression vary based on library availability.
-
Morris Jette authored
A node's CPULoad was getting reset to NoVal on reconfiguration. This change preserves the old value, which can fix a possible race condition with respect to test4.13
-
Tim Wickberg authored
Calling functions all moved to src/bcast/
-
Morris Jette authored
-
Morris Jette authored
power/cray - Fix bug introduced in 15.08.10 preventin operation in many cases. bug 2628
-
Morris Jette authored
-
Morris Jette authored
Was printing integer using %u format
-
Morris Jette authored
burst_buffer/cray - Fix for script creating or deleting persistent buffer would fail "paths" operation and hold the job. bug 2624
-
Danny Auble authored
and it doesn't meet basic requirements.
-
Tim Wickberg authored
-
Morris Jette authored
Test previously failed with heterogeous nodes (i.e. different core counts on different nodes). bug 2597
-