jette | 17 Oct 23:08 2014

Slurm versions 14.03.9 and 14.11.0-rc2 are now available


Slurm versions 14.03.9 and 14.11.0-rc2 are now available.
Version 14.03.9 includes quite a few relatively minor bug fixes.
Version 14.11.0-rc2 includes a few bug fixes discovered in recent testing.
Thanks to everyone participating in the testing!
Version 14.11.0 is no longer under active development, but is undergoing
testing for a planned release in early November.

Slurm downloads are available from
http://www.schedmd.com/#repos

* Changes in Slurm 14.03.9
==========================
  -- If slurmd fails to stat(2) the configuration print the string describing
     the error code.
  -- Fix for mixing core base reservations with whole node based reservations
     to avoid overlapping erroneously.
  -- BLUEGENE - Remove references to Base Partition.
  -- sview - If compiled on a non-bluegene system then used to view a BGQ fix
     to allow sview to display blocks correctly.
  -- Fix bug in update reservation. When modifying the reservation the end time
     was set incorrectly.
  -- The start time of a reservation that is in ACTIVE state cannot be  
modified.
  -- Update the cgroup documentation about release agent for devices.
  -- MYSQL - fix for setting up preempt list on a QOS for multiple QOS.
  -- Correct a minor error in the scancel.1 man page related to the
     --signal option.
  -- Enhance the scancel.1 man page to document the sequence of signals sent
  -- Fix slurmstepd core dump if the cgroup hierarchy is not completed
(Continue reading)

jette | 29 Sep 18:38 2014

Slurm User Group Meeting 2014: Presentations now online


About 70 people attended the Slurm User Group Meeting last week in  
Lugano Switzerland. There were a lot of good presentations and  
discussions. Copies of the presentations are now available online at
http://slurm.schedmd.com/publications.html

NOTE: A few of the presentations are missing, but will be posted when  
available.
--

-- 
Morris "Moe" Jette
CTO, SchedMD LLC

Danny Auble | 17 Sep 22:58 2014

Slurm versions 14.03.8 and 14.11.0-pre5 are now available


Slurm versions 14.03.8 and 14.11.0-pre5 are now available. Version 
14.03.8 includes quite a few relatively minor bug fixes.

Version 14.11.0 is under active development and its release is planned 
in November 2014.  Much of its features and performance enhancements 
will be discussed next week at SLUG 2014 in Lugano Switzerland.

Note to all developers, code freeze for new features in 14.11 will be at 
the end of this month (September).

Slurm downloads are available from http://www.schedmd.com/#repos.

Highlights of the 2 versions are these

* Changes in Slurm 14.03.8
==========================
  -- Fix minor memory leak when Job doesn't have nodes on it (Meaning 
the job
     has finished)
  -- Fix sinfo/sview to be able to query against nodes in reserved and other
     states.
  -- Make sbatch/salloc read in (SLURM|(SBATCH|SALLOC))_HINT in order to
     handle sruns in the script that will use it.
  -- srun properly interprets a leading "." in the executable name based 
upon
     the working directory of the compute node rather than the submit host.
  -- Fix Lustre misspellings in hdf5 guide
  -- Fix wrong reference in slurm.conf man page to what --profile option 
should
(Continue reading)

jette | 15 Sep 21:01 2014

Slurm User Group Meeting 2014 - Last call to register


Registration for the Slurm User Group Meeting in Lugano, Switzerland  
closes in two hours. If you plan to attend and have not yet  
registered, please do so now.

http://slurm.schedmd.com/slurm_ug_agenda.html#registration
--

-- 
Morris "Moe" Jette
CTO, SchedMD LLC

Slurm User Group Meeting
September 23-24, Lugano, Switzerland
Find out more http://slurm.schedmd.com/slurm_ug_agenda.html

jette | 20 Aug 00:11 2014

New Slurm releases and Slurm User Group Meeting


The 2014 Slurm User Group Meeting will be held on September 23 and 24  
in Lugano,
Switzerland. The meeting will include an assortment of tutorials, technical
presentations, and site reports. Prof. Felix Schürmann with the European Human
Brain Project will be our keynote speaker. Early registration for ends this
week. For more information, see
http://slurm.schedmd.com/slurm_ug_agenda.html

Slurm versions 14.03.7 and 14.11.0-pre4 are now available.
Version 14.03.7 includes quite a few relatively minor bug fixes.
Version 14.11.0-pre4 includes a new job array data structure and APIs for
managing job arrays. These changes provide vastly improved scalability with
respect to job arrays. Version 14.11.0 is under active development and its
release is planned in November 2014.

Slurm downloads are available from
http://www.schedmd.com/#repos

Highlights of changes in Slurm version 14.03.7 include:
  -- Correct typos in man pages.
  -- Add note to MaxNodesPerUser and multiple jobs running on the same node
     counting as multiple nodes.
  -- PerlAPI - fix renamed call from slurm_api_set_conf_file to
     slurm_conf_reinit.
  -- Fix gres race condition that could result in job deallocation  
error message.
  -- Correct NumCPUs count for jobs with --exclusive option.
  -- When creating reservation with CoreCnt, check that Slurm uses
     SelectType=select/cons_res, otherwise don't send the request to slurmctld
(Continue reading)

jette | 9 Aug 01:10 2014

Slurm User Group Meeting 2014 updates


The 2014 Slurm User Group Meeting will be held on September 23 and 24  
in Lugano, Switzerland. The meeting will include an assortment of  
tutorials, technical presentations, and site reports.

We are pleased to announce the keynote speaker this year will be Prof.  
Felix Schürmann from Ecole Polytechnique Fédérale de Lausanne (EPFL),  
co-director of the Blue Brain Project and involved in several research  
challenges of the European Human Brain Project.

Early registration ends on 23 August. For more information, please see:
http://slurm.schedmd.com/slurm_ug_agenda.html

--

-- 
Morris "Moe" Jette
CTO, SchedMD LLC

Slurm User Group Meeting
September 23-24, Lugano, Switzerland
Find out more http://slurm.schedmd.com/slurm_ug_agenda.html

jette | 18 Jul 17:41 2014

Slurm User Group Meeting 2014, Schedule and Registration


The fifth annual Slurm User Group Meeting will be held on September 23  
and 24, hosted by the Swiss National Supercomputing Centre in Lugano,  
Switzerland. The meeting will include an assortment of tutorials,  
technical presentations, and site reports. This is an excellent  
opportunity to learn more about how Slurm works and help to set future  
directions.

The schedule, registration and hotel information are now available online:
http://slurm.schedmd.com/slurm_ug_agenda.html

Thank you for your continued interest and support. We hope to see you  
in Lugano!

Sincerely,
Moe Jette
CTO, SchedMD LLC

jette | 17 Jul 00:57 2014

Slurm version 14.03.6 is now available


Slurm version 14.03.6 is now available. Version 14.03.6 includes  
includes a few bug fixes, including a bug related to generic resources  
that can result in the slurmctld daemon aborting.

Slurm downloads are available from
http://www.schedmd.com/#repos

Highlights of changes in Slurm version 14.03.6 include:

  -- Added examples to demonstrate the use of the sacct -T option to the man
     page.
  -- Fix for regression in 14.03.5 with sacctmgr load when Parent has "'"
     around it.
  -- Update comments in sacctmgr dump header.
  -- Fix for possible abort on change in GRES configuration.
  -- CRAY - fix modules file, (backport from 14.11 commit 78fe86192b.
  -- Fix race condition which could result in requeue if batch job  
exit and node
     registration occur at the same time.
  -- switch/nrt - Unload job tables (in addition to windows) in user  
space mode.
  -- Differentiate between two identical debug messages about purging vestigial
     job scripts.
  -- If the socket used by slurmstepd to communicate with slurmd exist when
     slurmstepd attempts to create it, for example left over from a previous
     requeue or crash, delete it and recreate it.

jette | 10 Jul 23:24 2014

Slurm versions 14.03.5 and 14.11.0-pre2 are now available


Slurm versions 14.03.5 and 14.11.0-pre2 are now available. Version  
14.03.5 includes about 40 relatively minor bug fixes and enhancements  
as described below. Version 14.11.0-pre2 is the second pre-release of  
the next major release of Slurm scheduled for November 2014. This is  
very much a work in progress and not intended for production use.

Slurm downloads are available from http://www.schedmd.com/#repos.

Highlights of changes in Slurm version 14.03.5 include:
  -- If a srun runs in an exclusive allocation and doesn't use the entire
     allocation and CR_PACK_NODES is set layout tasks appropriately.
  -- Correct Shared field in job state information seen by scontrol,  
sview, etc.
  -- Print Slurm error string in scontrol update job and reset the Slurm errno
     before each call to the API.
  -- Fix task/cgroup to handle -mblock:fcyclic correctly
  -- Fix for core-based advanced reservations where the distribution of cores
     across nodes is not even.
  -- Fix issue where association maxnodes wouldn't be evaluated correctly if a
     QOS had a GrpNodes set.
  -- GRES fix with multiple files defined per line in gres.conf.
  -- When a job is requeued make sure accounting marks it as such.
  -- Print the state of requeued job as REQUEUED.
  -- Fix if a job's partition was taken away from it don't allow a requeue.
  -- Make sure we lock on the conf when sending slurmd's conf to the  
slurmstepd.
  -- Fix issue with sacctmgr 'load' not able to gracefully handle bad formatted
     file.
  -- sched/backfill: Correct job start time estimate with advanced  
(Continue reading)

jette | 16 Jun 23:59 2014

Slurm versions 14.03.4 and 14.11.0-pre1 are now available


Slurm versions 14.03.4 and 14.11.0-pre1 are now available.
Version 14.03.4 includes about 40 relatively minor bug fixes and enhancements
as described below. Of particular note, there are several enhancements to
control layout of tasks across resources and significant performance
improvements for backfill scheduling.

Version 14.11.0-pre1 is the first pre-release of the next major release of
Slurm scheduled for November 2014. This is very much a work in  
progress and not
intended for production use.

Slurm downloads are available from
<a href="http://www.schedmd.com/#repos">http://www.schedmd.com/#repos</a>.

Highlights of changes in Slurm version 14.03.4 include:

  -- Fix issue where not enforcing QOS but a partition either allows or denies
     them.
  -- CRAY - Make switch/cray default when running on a Cray natively.
  -- CRAY - Make job_container/cncu default when running on a Cray natively.
  -- Disable job time limit change if it's preemption is in progress.
  -- Correct logic to properly enforce job preemption GraceTime.
  -- Fix sinfo -R to print each down/drained node once, rather than once per
     partition.
  -- If a job has non-responding node, retry job step create rather than
     returning with DOWN node error.
  -- Support SLURM_CONF path which does not have "slurm.conf" as the file name.
  -- CRAY - make job_container/cncu default when running on a Cray natively
  -- Fix issue where batch cpuset wasn't looked at correctly in
(Continue reading)

jette | 27 May 23:11 2014

CFP: Slurm User Group Meeting 2014, Due 6 June


You are invited to submit an abstract of a tutorial, technical  
presentation or site report to be given at the Slurm User Group  
Meeting 2014. This event is sponsored and organized by The Swiss  
National Supercomputing Centre and will be held in Lugano, Switzerland  
on 23-24 September 2014.

This international event is opened to everyone who wants to:
Learn more about Slurm, a highly scalable Resource Manager and Job Scheduler
Share their knowledge and experience with other users and administrators
Get detailed information about the latest features and developments
Share requirements and discuss future developments

Everyone who wants to present their own usage, developments, site  
report, or tutorial about Slurm is invited to send an abstract to  
sugc@...

Important Dates:
6 June 2014: Abstracts due
27 June 2014: Notification of acceptance
23-24 September 2014: Slurm User Group Meeting 2014

Program Committee:
Yiannis Georgiou (Bull)
Matthieu Hautreux (CEA)
Morris Jette (SchedMD)
Donald Lipari (LLNL, Lawrence Livermore National Laboratory)
Colin McMurtrie (CSCS, Swiss National Supercomputing Centre)
Stephen Trofinoff (CSCS, Swiss National Supercomputing Centre)

(Continue reading)


Gmane