Re: [PATCH] sched: prevent high load weight tasks suppressing balancing

Siddha, Suresh B wrote:

On Tue, Mar 28, 2006 at 10:21:38AM +1100, Peter Williams wrote:
Siddha, Suresh B wrote:
This breaks HT and MC optimizations.. Consider a DP system with each
physical processor having two HT logical threads.. if there are tworunnable processes running on package-0, with this patch schedulerwill never move one of those processes to package-1..
Is this an active_load_balance() issue?
No. find_busiest_group() doesn't find an imbalance in this case..

But active_load_balance() is the only code that would want to move theonly runnable task off a CPU, isn't it? No other load balancing codewill try to do this that I can see.

If it is then I suggest that the solution is to fix theactive_load_balance() and associated code so that it works with thispatch in place.
It would be possible to modify find_busiest_group() andfind_busiest_queue() so that they just PREFER the busiest group to haveat least one CPU with more than one running task and the busiest queueto have more than one task. However, this would make the code
Please don't do that... Its not for the complexity I say NO but weare kind of patching the code instead of addressing the root issue..
considerably more complex and I'm reluctant to impose that on allarchitectures just to satisfy HT and MC requirements. Are thereconfiguration macros or other means that I can use to exclude this(proposed) code on systems where it isn't needed i.e. non HT and MCsystems or HT and MC systems with only one package.
There is no config option to disable that portion of the code. Interaction
of this code with mainstream code is very small. Look at the
active_load_balance() and how this comes into play with the help of
migration thread(which gets activated through load_balance)

Yes, I've read that which is why I say (see below) that it's backwardsand haphazard.

I'll make a temporary patch that does the PREFER I mentioned above totide us over until a proper rewrite of the active load balancingfunctionality can be done. After giving it some more thought I think Ican keep the extra complexity fairly small.

Personally, I think that the optimal performance of the load balancingcode has already been considerably compromised by its unconditionallyaccommodating the requirements of active_load_balance() (which you havesaid is now only required by HT and MC systems) and that it might bebetter if active load balancing was separated out into a separatemechanism that could be excluded from the build on architectures thatdon't need it. I can't help thinking that this would result in a moreefficient active load balancing mechanism as well because the currentone is very inefficient.
No. Upto now, this has been encapsulated very generically using cpu_power
and thats the reason why adding a sched domain for multi-core was simple.

It seems to me that it's being done backwards and haphazardly. As faras I can see the problem that's trying to be solved is there is apackage that has two or more CPUs that have exactly one runnable taskand there are other packages that have all of their CPUs idle and wewant to move one task to each idle package, right?

If any of the CPUs in the package have more than one runnable task thennormal load balancing will kick in which is why I say this special codeis only required for the case where there's exactly one task for two ormore of the CPUs in the package.

So why not write code that (every so many ticks) checks to see if apackage meets these criteria and if it does then looks for idle packages(that's packages not groups or queues) and if it finds them initiatesactive load balancing? Or some variation of that theme.

At the end of scheduler_tick() you could do (every so many ticks):

if rebalance_tick() didn't pull any tasks and this run queue has exactlyone runnable task then

        if the package that this run queue is in meets the criteria then

set the run queue's active_balance flag and let themigration thread know that it has work to do.

Properly packaged this code could be excluded from the build onarchitectures that don't need it.

Peter
PS I don't think that this issue is sufficiently important to preventthe adoption of the smpnice patches while it's being resolved.
Scheduler is a very critical piece in the kernel. We need to understand and fix
all the cases..

Yes, but this particular problem is a very minor especially whencompared to the general breakage of "nice" on SMP systems without thesmpnice patch.

Peter
--
Peter Williams                                   pwil3058@bigpond.net.au

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- Re: [PATCH] sched: prevent high load weight tasks suppressing balancing
  - From: "Siddha, Suresh B" <suresh.b.siddha@intel.com>
- Re: [PATCH] sched: prevent high load weight tasks suppressing balancing
  - From: Peter Williams <pwil3058@bigpond.net.au>
- Re: [PATCH] sched: prevent high load weight tasks suppressing balancing
  - From: "Siddha, Suresh B" <suresh.b.siddha@intel.com>

Prev by Date: Re: scheduler starvation resistance patches for 2.6.16
Next by Date: Re: Oops at __bio_clone with 2.6.16-rc6 anyone??????
Previous by thread: Re: [PATCH] sched: prevent high load weight tasks suppressing balancing
Next by thread: [PATCH] UML - Hotplug memory, take 2
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]