Limiting a queue to a certain number of processor cores?
Mario Lang <mlang <at> tugraz.at>
2015-04-21 13:33:33 GMT
We are trying to figure out how to use Torque to create a queue which
would only allow a certain maximum number of processors to be used at
any given time. On the system we want to establish this, we have 16
core nodes. Our goal would be to limit the number of concurrently used
processor cores in that queue to something like 800, on a cluster with a
little more then 2000 cores available in total. Initial investigation seems to
indicate that the version of Torque we are currently using (2.4.16) does
not support this at all. The resources_available.procct queue attribute looks
like what I want, but it is only available from Torque 2.5 onwards. So
we installed Torque 4.2.9 on a test system. We couldn't quite get it to
work as expected, so upgraded to 4.2.10. Here, however, the
resources_available.procct queue attribte seems no longer available. I
can set it with something like
set queue foo resources_available.procct = 40
but when I later invoke "print queue foo" the attribute is not shown.
However, I also do not get a warning that this queue attribute is not
supported. Looks like a bug to me.
Question is: Is resources_available.procct gone again? If so, how are
we supposed to get the (IMHO rather basic) feature described above
otherwise. Is there any other queue attribute that does an equivalent
job to procct? Note that I do not want to limit the number of nodes, I
really want to limit the number of total CPU cores in use, across all
nodes of the system.
Graz University of Technology
IT Services - Computing
Steyrergasse 30/1, 8010 Graz, Austria - Europe
Phone: +43 316 873 6897
Mobile: +43 664 60 873 6897
Email: mlang <at> tugraz.at