Re: [patch] CFS scheduler, -v12

Ingo Molnar wrote:

* Peter Williams <pwil3058@bigpond.net.au> wrote:
I've now done this test on a number of kernels: 2.6.21 and 2.6.22-rc1with and without CFS; and the problem is always present. It's not"nice" related as the all four tasks are run at nice == 0.
could you try -v13 and did this behavior get better in any way?

It's still there but I've got a theory about what the problems is thatis supported by some other tests I've done.

What I'd forgotten is that I had gkrellm running as well as top (toobserve which CPU tasks were on) at the same time as the spinners wererunning. This meant that between them top, gkrellm and X were usingabout 2% of the CPU -- not much but enough to make it possible that atleast one of them was running when the load balancer was trying to doits thing.

This raises two possibilities: 1. the system looked balanced and 2. thesystem didn't look balanced but one of top, gkrellm or X was movedinstead of one of the spinners.

If it's 1 then there's not much we can do about it except say that itonly happens in these strange circumstances. If it's 2 then we may haveto modify the way move_tasks() selects which tasks to move (if we thinkthat the circumstances warrant it -- I'm not sure that this is the case).

To examine these possibilities I tried two variations of the test.

a. run the spinners at nice == -10 instead of nice == 0. When I didthis the load balancing was perfect on 10 consecutive runs whichaccording to my calculations makes it 99.9999997 certain that thisdidn't happen by chance. This supports theory 2 above.

b. run the tests without gkrellm running but use nice == 0 for thespinners. When I did this the load balancing was mostly perfect but wasquite volatile (switching between a 2/2 and 1/3 allocation of spinnersto CPUs) but the %CPU allocation was quite good with the spinners allgetting approximately 49% of a CPU each. This also supports theory 2above and gives weak support to theory 1 above.

This leaves the question of what to do about it. Given that most CPUintensive tasks on a real system probably only run for a few tens ofmilliseconds it probably won't matter much on a real system except thata malicious user could exploit it to disrupt a system.

So my opinion is that we probably do need to do something about it butthat it's not urgent.

One thing that might work is to jitter the load balancing interval abit. The reason I say this is that one of the characteristics of topand gkrellm is that they run at a more or less constant interval (and,in this case, X would also be following this pattern as it's doingscreen updates for top and gkrellm) and this means that it's possiblefor the load balancing interval to synchronize with their intervalswhich in turn causes the observed problem. A jittered load balancinginterval should break the synchronization. This would certainly besimpler than trying to change the move_task() logic for selecting whichtasks to move.

What do you think?
Peter
--
Peter Williams                                   pwil3058@bigpond.net.au

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [patch] CFS scheduler, -v12
  - From: "Dmitry Adamushko" <dmitry.adamushko@gmail.com>
- Re: [patch] CFS scheduler, -v12
  - From: "Dmitry Adamushko" <dmitry.adamushko@gmail.com>
- Re: [patch] CFS scheduler, -v12
  - From: Peter Williams <pwil3058@bigpond.net.au>

References:
- [patch] CFS scheduler, -v12
  - From: Ingo Molnar <mingo@elte.hu>
- Re: [patch] CFS scheduler, -v12
  - From: Peter Williams <pwil3058@bigpond.net.au>
- Re: [patch] CFS scheduler, -v12
  - From: Peter Williams <pwil3058@bigpond.net.au>

Prev by Date: Re: Fork Bombing Attack
Next by Date: Re: Asynchronous scsi scanning
Previous by thread: Re: [patch] CFS scheduler, -v12
Next by thread: Re: [patch] CFS scheduler, -v12
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]