Re: [RFC] CPU controllers?

Srivatsa Vaddagiri wrote:

Hello,
	There have been several proposals so far on this subject and no
consensus seems to have been reached on what an acceptable CPU controller
for Linux needs to provide. I am hoping this mail will trigger some
discussions in that regard. In particular I am keen to know what the
various maintainers think about this subject.

The various approaches proposed so far are:

	- CPU rate-cap (limit CPU execution rate per-task)
		http://lkml.org/lkml/2006/5/26/7	

	- f-series CKRM controller (CPU usage guarantee for a task-group)
		http://lkml.org/lkml/2006/4/27/399

	- e-series CKRM controller (CPU usage guarantee/limit for a task-group)
		http://prdownloads.sourceforge.net/ckrm/cpu.ckrm-e18.v10.patch.gz?download

	- OpenVZ controller (CPU usage guarantee/hard-limit for a task-group)
		http://openvz.org/

	- vserver controller (CPU usage guarantee(?)/limit for a task-group)
		http://linux-vserver.org/

(I apologize if I have missed any other significant proposal for Linux)

Their salient features and limitations/drawbacks, as I could gather, aresummarized later below. To note is each controller varies in degree ofcomplexity and addresses its own set of requirements.In going forward for an acceptable controller in mainline it would help, IMHO,if we put together the set of requirements which the Linux CPU controllershould support. Some questions that arise in this regard are:


	- Do we need mechanisms to control CPU usage of tasks, further to what
	  already exists (like nice)?  IMO yes.

	- What are the requirements of such a CPU controller? Some of them to
	  consider are:

		- Should it operate on a per-task basis or on a per-task-group
	  	  basis?
		- Should it support more than one level of task-groups?
		- If we want to allow on a per-task-group basis, which mechanism
		  do we use for grouping tasks (Resource Groups, PAGG,
		  uid/session id ..)?
		- Should it support limit and guarantee both? In case of limit,
		  should it support both soft and hard limit?
		- What interface do we choose for user to specify
		  limit/guarantee? system call or filesystem based (ex: /proc or
		  Resource Group's rcfs)?
		- Over what interval should guarantee/limit be monitored and
		  controlled?
		- With what accuracy should we allow the limit/guarantee to be
		  expressed?

- Co-existence with CPUset - should guarantee/limit beenforced only on the set of CPUs attached to the cpuset?

		- Should real-time tasks be outside the purview of this control?
		- Load balance to be made aware of the guarantee/limit of tasks
		  (or task-groups)? Ofcourse yes!

One possibility is to add a basic controller, that addresses some minimal
requirements, to begin with and progressively enhance it capabilities.

I would amend this to say "provide the basic controllers and let morecomplex management mechanisms use them (from outside the scheduler) toprovide higher level control. An essential part of this will be theprovision of statistics for these external controllers to use.

From this

pov, both the f-series resource group controller and cpu rate-cap seem to begood candidates for a minimal controller to begin with.


Thoughts?

Salient features of various CPU controllers that have been proposed so far are
summarized below. I have not captured OpenVZ and Vserver controller aspects
well. Request the maintainers to fill-in!

1. CPU Rate Cap	(by Peter Williams)

Features:

	* Limit CPU execution rate on a per-task basis.
	* Limit specified in terms of parts-per-thousand. Limit set thr' /proc
	  interface.

The /proc interface is not an essential part of this patch and thereason that it was implemented is that it was simple, easy and usefulfor testing. The patch "proper" provides four functions forsetting/getting the soft/hard caps an exports these so that they can beused from modules.

I.e. it would be very easy to replace the /proc interface with anotherone (or more) or to keep it and make another interface as well. All theessential testing/processing required for setting the caps properly isinside the functions NOT the /proc interface.

	* Supports hard limit and soft limit
* Introduces new task priorities where tasks that have exceeded theirsoft limit can be "parked" until the O(1) scheduler picks them for
 	  execution
	* Load balancing on SMP systems made aware of tasks whose execution
	  rate is limited by this feature
	* Patch is simple

Limitations:
	* Does not support guarantee

Why would a capping mechanism support guarantees? The two mechanismscan be implemented separately. The only interaction between them thatis required is a statement about which has precedence. I.e. if a cap isless than a guarantee is it enforced? I would opine that it should be.

BTW if "nice" works properly, guarantees can be implemented by suitablefiddling of task "nice" values.


Drawbacks:
	* Limiting CPU execution rate of a group of tasks has to be tackled from
	  an external module (user or kernel space) which may make this approach
	  somewhat inconvenient to implement for task-groups.

Nevertheless it can be done and it has the advantage that the cost isonly borne by those who wish to use such high level controls.

The caps provided by this (simple) patch provide functionality thatordinary can find useful. In particular, the use of a soft cap of zeroto effectively put a task (and all of its children) in the background isvery useful for doing software builds on a work station. Con Kolivas'sSCHED_IDLE scheduling class in his staircase scheduler provides the samefunctionality and is (from all reports) very popular.

The key difference between soft caps and the SCHED_IDLE mechanism isthat it is more general in that limits other than zero can be specified.This provides more flexibility.


Peter
--
Peter Williams                                   [email protected]

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [RFC] CPU controllers?
  - From: Matt Helsley <[email protected]>

References:
- [RFC] CPU controllers?
  - From: Srivatsa Vaddagiri <[email protected]>

Prev by Date: Re: FOR REVIEW: New x86-64 vsyscall vgetcpu()
Next by Date: Re: ON/OFF control of taskstats accounting data at do_exit
Previous by thread: Re: [RFC] CPU controllers?
Next by thread: Re: [RFC] CPU controllers?
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]