Re: [REPORT] cfs-v4 vs sd-0.44

Linus Torvalds wrote:

On Mon, 23 Apr 2007, Ingo Molnar wrote:
The "give scheduler money" transaction can be both an "implicittransaction" (for example when writing to UNIX domain sockets orblocking on a pipe, etc.), or it could be an "explicit transaction":sched_yield_to(). This latter i've already implemented for CFS, but it'smuch less useful than the really significant implicit ones, the oneswhich will help X.
Yes. It would be wonderful to get it working automatically, so please saysomething about the implementation..
The "perfect" situation would be that when somebody goes to sleep, anyextra points it had could be given to whoever it woke up last. Note thatfor something like X, it means that the points are 100% ephemeral: it getspoints when a client sends it a request, but it would *lose* the pointsagain when it sends the reply!
So it would only accumulate "scheduling points" while multiuple clientsare actively waiting for it, which actually sounds like exactly the rightthing. However, I don't really see how to do it well, especially since thekernel cannot actually match up the client that gave some schedulingpoints to the reply that X sends back.
There are subtle semantics with these kinds of things: especially if thescheduling points are only awarded when a process goes to sleep, if X isbusy and continues to use the CPU (for another client), it wouldn't giveany scheduling points back to clients and they really do accumulate withthe server. Which again sounds like it would be exactly the right thing(both in the sense that the server that runs more gets more points, butalso in the sense that we *only* give points at actual scheduling events).
But how do you actually *give/track* points? A simple "last woken up bythis process" thing that triggers when it goes to sleep? It might work,but on the other hand, especially with more complex things (and networkingtends to be pretty complex) the actual wakeup may be done by a softwareirq. Do we just say "it ran within the context of X, so we assume X wasthe one that caused it?" It probably would work, but we've generally triedvery hard to avoid accessing "current" from interrupt context, includingbh's.

Within reason, it's not the number of clients that X has that causes itsCPU bandwidth use to sky rocket and cause problems. It's more to towith what type of clients they are. Most GUIs (even ones that areconstantly updating visual data (e.g. gkrellm -- I can open quite alarge number of these without increasing X's CPU usage very much)) causevery little load on the X server. The exceptions to this are thevarious terminal emulators (e.g. xterm, gnome-terminal, etc.) when beingused to run output intensive command line programs e.g. try "ls -lR /"in an xterm. The other way (that I've noticed) X's CPU usage bandwidthsky rocket is when you grab a large window and wiggle it about a lot andhopefully this doesn't happen a lot so the problem that needs to beaddressed is the one caused by text output on xterm and its ilk.

So I think that an elaborate scheme for distributing "points" between Xand its clients would be overkill. A good scheduler will make sureother tasks such as audio streamers get CPU when they need it with goodresponsiveness even when X takes off by giving them higher prioritybecause their CPU bandwidth use is low.

The one problem that might still be apparent in these cases is the mousebecoming jerky while X is working like crazy to spew out text too fastfor anyone to read. But the only way to fix that is to give X morebandwidth but if it's already running at about 95% of a CPU that'sunlikely to help. To fix this you would probably need to modify X sothat it knows re-rendering the cursor is more important than renderingtext in an xterm.

In normal circumstances, the re-rendering of the mouse happens quicklyenough for the user to experience good responsiveness because X's normalCPU use is low enough for it to be given high priority.

Just because the O(1) tried this model and failed doesn't mean that themodel is bad. O(1) was a flawed implementation of a good model.


Peter

PS Doing a kernel build in an xterm isn't an example of high enoughoutput to cause a problem as (on my system) it only raises X'sconsumption from 0 to 2% to 2 to 5%. The type of output that causes theproblem is usually flying past too fast to read.

--
Peter Williams                                   [email protected]

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Arjan van de Ven <[email protected]>

References:
- [patch] CFS scheduler, v4
  - From: Ingo Molnar <[email protected]>
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Con Kolivas <[email protected]>
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Ingo Molnar <[email protected]>
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Con Kolivas <[email protected]>
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Juliusz Chroboczek <[email protected]>
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Linus Torvalds <[email protected]>
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Nick Piggin <[email protected]>
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Linus Torvalds <[email protected]>
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Ingo Molnar <[email protected]>
- Re: [REPORT] cfs-v4 vs sd-0.44
  - From: Linus Torvalds <[email protected]>

Prev by Date: Re: 2.6.21-rc7: BUG: sleeping function called from invalid context at net/core/sock.c:1523
Next by Date: Re: Permanent Kgdb integration into the kernel - lets get with it. (Dave: How do FreeBSD folks maintain the KGDB stub?)
Previous by thread: Re: [REPORT] cfs-v4 vs sd-0.44
Next by thread: Re: [REPORT] cfs-v4 vs sd-0.44
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]