This file describes changes in recent versions of SLURM. It primarily documents those changes that are of interest to users and admins. * Changes in SLURM 2.1.0-pre10 ============================= -- Improve sview layout of blocks in use. -- A user can not change the dimensions of the grid in sview. * Changes in SLURM 2.1.0-pre9 ============================= -- Added the "scontrol update SlurmctldDebug" as the preferred alternative to the "scontrol setdebug" command. -- BLUEGENE - made it so when removing a block in an error state the nodes in the block are set correctly in accounting as not in error. -- Fixed issue where if slurmdbd is not up qos' are set up correctly for associations off of cache. -- scontrol, squeue, sview all display the correct node, cpu count along with correct corresponding nodelist on completing jobs. -- Patch (Mark Grondona) fixes serious security vulnerability in SLURM in the spank_job_env functionality. -- Improve spank_job_env interface and documentation -- Add ESPANK_NOT_LOCAL error code to spank_err_t -- Made the #define DECAY_INTERVAL used in the priority/multifactor plugin a slurm.conf variable (PriorityCalcPeriod) -- Added new macro SLURM_VERSION for use in autoconf scripts to determine current version of slurm installed on system when building against the api. -- Patch from Matthieu Hautreux that adds an entry into the error file when a job or step receives a TERM or KILL signal. -- Make it so env var SLURM_SRUN_COMM_HOST is overwritten if already in existence in the slurmd. * Changes in SLURM 2.1.0-pre8 ============================= -- Rearranged the "scontrol show job" output into functional groupings -- Change the salloc/sbatch/srun -P option to -d (dependency) -- Removed the srun -d option; must use srun --slurmd-debug instead -- When running the mysql plugin natively MUNGE errors are now eliminated when sending updates to slurmctlds. -- Check to make sure we have a default account before looking to fill in default association. -- Accounting - Slurmctld and slurmdbd will now set uids of users which were created after the start of the daemons on reconfig. Slurmdbd will attempt to set previously non-existant uids every hour. -- Patch from Aaron Knister and Mark Grondona, to parse correctly quoted #SBATCH options in a batch script. -- job_desc_msg_t - in, out, err have been changed to std_in, std_out, and std_err respectfully. Needed for PySLURM, since Python sees (in) as a keyword. -- Changed the type of addr to struct sockaddr_in in _message_socket_accept() in sattach.c, step_launch.c, and allocate_msg.c, and moved the function into a common place for all the calls since the code was very similar. -- proctrack/lua support has been added see contribs/lua/protrack.lua -- replaced local gtk m4 test with AM_PATH_GTK_2_0 -- changed AC_CHECK_LIB to AC_SEARCH_LIBS to avoid extra libs in compile lines. -- Patch from Matthieu Hautreux to improve error message in slurmd/req.c -- Added support for split groups from (Matthiu Hautreux CEA) -- Patch from Mark Grondona to move blcr scripts into pkglibexecdir -- Patch from Doug Parisek to calculate a job's projected start time under the builtin scheduler. -- Removed most global variables out of src/common/jobacct_common.h * Changes in SLURM 2.1.0-pre7 ============================= -- BLUEGENE - make 2.1 run correctly on a real bluegene cluster -- sacctmgr - Display better debug for when an admin specifies a non-existant parent account when changing parent accounts. -- Added a mechanism to the slurmd to defer the epilog from starting until after a running prolog has finished. -- If a node reboots inbetween checking status the node is marked down unless ReturnToService=2 -- Added -R option to slurmctld to recover partition state also when restarting or reconfiguring. * Changes in SLURM 2.1.0-pre6 ============================= -- When getting information about nodes in hidden partitions, return a node name of NULL rather than returning no information about the node so that node index information is still valid. -- When querying database for jobs in certain state and a time period is given only jobs in that state during the period will be returned, previously if a time period was given in sacct jobs eligible to run or running would be displayed, which is still the default if no states are requested. -- One can now query jobs based on size (nodes and or cpus) (mysql plugin only) -- Applied patch from Mark Grondona that tests for a missing config file before any other processing in spank_init(). This now prevents fatal errors from being mistakenly treated as recoverable. -- --enable-debug no longer has to be stated at configure time to have the slurmctld or slurmstepd dump core on a seg fault. -- Moved the errant slurm_job_node_ready() declaration from job_info.h to slurm.h and deleted job_info.h. -- Added the slurm_job_cpus_allocated_on_node_id() slurm_job_cpus_allocated_on_node() API for working with the job_resources_t structure. -- BLUEGENE - speed up start up for systems that have many blocks (100+) configured on the system. * Changes in SLURM 2.1.0-pre5 ============================= -- Add squeue option "--start" to report expected start time of pending jobs. -- Sched/backfill plugin modified to set expected start time of pending jobs. -- Add SchedulerParameters option of "max_job_bf=#" to control how far down the queue of pending jobs that SLURM searches in an attempt backfill schedule them. The default value is 50 jobs. -- Fixed cause of squeue -o "%C" seg fault. -- Add -"-signal=@