Re: [ANNOUNCE][RFC] PlugSched-6.2 for 2.6.16-rc1 and 2.6.16-rc1-mm1

Paolo Ornati wrote:

On Sun, 22 Jan 2006 10:06:43 +1100
Peter Williams <pwil3058@bigpond.net.au> wrote:

---- spa_ebs: great! (as expected)

(sched_fooler)
 PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
5418 paolo     34   0  2392  288  228 R 51.4  0.1   1:06.47 a.out
5419 paolo     34   0  2392  288  228 R 43.7  0.1   0:54.60 a.out
5448 paolo     11   0  4952 1468  372 D  3.0  0.3   0:00.12 dd

(transcode)
 PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
5456 paolo     34   0  115m  18m 2432 R 51.9  3.7   0:23.34 transcode
5470 paolo     12   0 51000 4472 1872 S  5.7  0.9   0:02.38 tcdecode
5480 paolo     11   0  4948 1468  372 D  3.5  0.3   0:00.33 dd

Very good DD test performance in both cases.

Good.  How do you find the interactive responsiveness with this one?


It seems geneally good.

However I've noticed that priority of X fluctuate a lot for unknown
reasons...

When doing almost nothing it gets prio 6/7 but if I only move the

cursor a bit it jumps up to ~29.

If I'm running glxgears (with diret rendering ON) the priority stay to
6/7 and moving the cursor I'm only able to get priority 8.

This is a function of the "entitlement based" fairness part of thescheduler. Conceptually, it allocates each SCHED_NORMAL task "shares"based on its nice value (19->1, 0->20, -20->420) and calculates anentitlement based on the ratio of a tasks shares and the total shares inplay. It then compares the task's recent average cpu usage rate withits entitlement and sets the dynamic priority so as to try and match thecpu usage rate to the entitlement.

To implement this concept efficiently (i.e. avoiding maths especiallydivides as much as possible) a slightly different approach is taken inpractice. For each run queue, a recent maximum average cpu usage rateper share for tasks on that queue (a yardstick) is kept and each tasksusage per share is compared to that. If it is greater then it becomesthe new yardstick and the task is given a base dynamic priority of 34and otherwise it is given a priority between 11 and 34 based inproportion to the ratio of its usage per share to the yardstick.

Tasks are also awarded an interactive bonus based on the amount ofinteractive sleeping that they've been doing recently and this issubtracted from the base priority. The 11 point offset in the basepriority is there to allow the bonus to be applied without encroachingon the RT priority range.

To cater for periods of inactivity the yardstick is decayed towards zeroeach tick.

In general, this means that the busiest task on the system (in terms ofcpu usage per share) at any particular time will have a priority of (34- interactivity bonus) but when the system is idle this may not be thecase if the yardstick had been quite high and hasn't yet decayed enough.

This is why when the system is idle the X priority jumps to 29 when youmove the mouse as it is now the new yardstick even with a relatively lowusage rate. But when glxgears is running it becomes the yardstick withquite high cpu usage rate per share and when you move the mouse the Xservers usage per share is still small compared to the yardstick so itretains a small priority value.

Under load X priority goes up and it suffers (cursor jumps a bit).

IOW: strangeness!

I hope I've explained the strangeness :-) but I'm still concerned thatthe cursor is jumping a bit. In general, the entitlement basedmechanism is quite good for interactive response as most interactivetasks have very low CPU usage rates but under heavy load their usagerate per share can approach the yardstick (mainly because the yardsticktends to get smaller under load) so some help is required in the form ofinteractive bonuses. It looks like this component still needs a littlework.

One area that I'm looking at is reducing the time slice size for thefirst CPU run after a task is forked. From the above it should beapparent that a task with recent average CPU usage rate of zero (such asa newly forked process) will get a priority of (11 - bonus). This isusually a good thing as it means that these tasks have good latency butif they are CPU bound tasks they will block out most other runnabletasks for a full time slice which is quite long (120 msecs). (Theoccasions where this effect would be most noticeable is when doingsomething like a kernel build where lots of CPU intensive tasks arebeing forked.) Shortening this first time slice won't have much effecton non CPU intensive tasks as they would generally have voluntarilysurrendered the CPU within in a few msecs anyway and it will allow thescheduler to give the CPU intensive tasks an appropriate priority earlyin their life.

Peter
--
Peter Williams                                   pwil3058@bigpond.net.au

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- [ANNOUNCE][RFC] PlugSched-6.2 for 2.6.16-rc1 and 2.6.16-rc1-mm1
  - From: Peter Williams <pwil3058@bigpond.net.au>
- Re: [ANNOUNCE][RFC] PlugSched-6.2 for 2.6.16-rc1 and 2.6.16-rc1-mm1
  - From: Paolo Ornati <ornati@fastwebnet.it>
- Re: [ANNOUNCE][RFC] PlugSched-6.2 for 2.6.16-rc1 and 2.6.16-rc1-mm1
  - From: Peter Williams <pwil3058@bigpond.net.au>
- Re: [ANNOUNCE][RFC] PlugSched-6.2 for 2.6.16-rc1 and 2.6.16-rc1-mm1
  - From: Paolo Ornati <ornati@fastwebnet.it>

Prev by Date: [PATCH] swsusp: use bytes as image size units
Next by Date: Re: [PATCHv2] parport: add parallel port support for SGI O2
Previous by thread: Re: [ANNOUNCE][RFC] PlugSched-6.2 for 2.6.16-rc1 and 2.6.16-rc1-mm1
Next by thread: 3c59x went nuts between .15-mm3 and .15-mm4
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]