- 22 Aug, 2013 1 commit
-
-
jette authored
This will prevent possible confusion for the backup controller when it switches from primary back to backup modes since those pthread IDs are no longer value. Note the thred_id_rpc could be used by the backup controller after returning to backup mode
-
- 21 Aug, 2013 2 commits
-
-
Hongjia Cao authored
If there are completing jobs, a reconfigure will set wrong job/node state: all nodes of the completing job will be set allocated, and the job will not be removed even if the completing nodes are released. The state can only be restored by restarting slurmctld after the completing nodes released.
-
Morris Jette authored
-
- 20 Aug, 2013 4 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
-
jette authored
-
- 19 Aug, 2013 1 commit
-
-
jette authored
-
- 18 Aug, 2013 1 commit
-
-
jette authored
-
- 17 Aug, 2013 5 commits
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
- 16 Aug, 2013 5 commits
-
-
David Bigagli authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-> 2.6 slurmd
-
Danny Auble authored
-
- 15 Aug, 2013 5 commits
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
could end up before the job started. Bug 371
-
Danny Auble authored
-
Danny Auble authored
-
- 14 Aug, 2013 13 commits
-
-
https://github.com/SchedMD/slurmjette authored
-
jette authored
We now reject jobs with an invalid accounting frequency at submit time rather than launch time, so the error is slightly different and the test needs to change for that.
-
Morris Jette authored
-
Morris Jette authored
This avoids waiting for the job's initiation to fail.
-
Morris Jette authored
Only cancel the job.
-
Morris Jette authored
Fix job state recovery logic in which a job's accounting frequency was not set. This would result in a value of 65534 seconds being used (the equivalent of NO_VAL in uint16_t), which could result in the job being requeued or aborted.
-
Morris Jette authored
-
Danny Auble authored
-
jette authored
-
David Bigagli authored
-
Morris Jette authored
Problem reported by BYU. slurm.conf included a file one byte in length. Logic created a buffer one byte long and used fgets() to read the file. fgets() reads one byte less than the buffer size to include a trailing '\0', so it fails to read the file.
-
Danny Auble authored
Basically the system size has to be set up before you call the priority/multifactor plugin. If a job is finishing while the slurmctld is starting then it would fatal on the init if it wasn't set up.
-
Danny Auble authored
-
- 13 Aug, 2013 3 commits
-
-
Morris Jette authored
-
Morris Jette authored
core reservations and reservation prolog/epilog
-
John Thiltges authored
-