Re: better wake-balancing: respin

Ingo Molnar wrote:

* Nick Piggin <nickpiggin@yahoo.com.au> wrote:
Chen, Kenneth W wrote:
Once upon a time, this patch was in -mm tree (2.6.13-mm1):
http://marc.theaimsgroup.com/?l=linux-kernel&m=112265450426975&w=2

It is neither in Linus's official tree, nor it is in -mm anymore.

I guess I missed the objection for dropping the patch.  I'm bringing
My objection for the patch is that it seems to be designed just toimprove your TPC - and I don't think we've seen results yet... or didI miss that?
Also - by no means do I think improving TPC is wrong, but I think sucha patch may not be the right way to go. It doesn't seem to solve yourproblem well.
Nick, the TPC workload is simple and has been described before: lots ofinterrupts arriving on many CPUs, and waking up tasks randomly, which doshort amount of work and then go back to sleep again. There is nocorrelation between the CPU the interrupt arrives on and the CPU thetask gets woken up on. There is no point in immediate balancing either:the IRQs are well-balanced themselves so there are no load transients totake care of (except for idle CPUs, which my patch handles), and thenext wakeup for that task wont arrive on the same CPU anyway.
in such a workload, my patch will clearly improve things, by notbouncing tasks around wildly.

Ingo, I wasn't aware that tasks are bouncing around wildly; does
your patch improve things? Then by definition it must penalise
workloads where the pairings are more predictable?

I would prefer to try fixing wake balancing before giving up and
turning it off for busy CPUs.

Now you may have one of two problems. Well it definitely looks likeyou are taking a lot of cache misses in try_to_wake_up - however thiswon't be due to the load balancing stuff, but rather from locking theremote CPUs runqueue and touching its runqueues, and cachelines in thetask_struct that had been last touched by the remote CPU.
no, because you are not considering a fundamentally random workload likeTPC. There is only a 1:8 chance to hit the right CPU with the interrupt,and there is no benefit from moving the task to the CPU it got woken upfrom. In fact, it hurts by doing pointless migrations.

It doesn't always migrate though. That's the point of all the heuristics.

my patch adds the rule that we only consider 'fast' migration whenprovably beneficial: if the target CPU is idle. Any other case will haveto go over the 'slow' migration paths.

wrong. There is no way you can "prove" that a migration is beneficial!

In fact, if the balancing stuff in try_to_wake_up is working as itshould, then it will result in fewer "remote wakups" because taskswill be moved to the same CPU that wakes them. Schedstats can tell usa lot about this, BTW.
wrong. Even if the balancing stuff in try_to_wake_up is working as itshould, it can easily happen that moving a task is not worthwhile: ifthere is little or no further relationship between the wakeup CPU andthe IRQ CPU, i.e. when the migration cost is larger than therelationship-win between the wakeup CPU and the IRQ CPU.
so for me the decision logic is simple: the balancing code logic ismigrating over-eagerly, and this simple and straightforward patch makesit less eager for an important workload class. You are welcome tosuggest other approaches, but simply saying "I dont like this" wontbring us further, as the damage on TPC workloads is clearlydemonstrated. If this patch hurts other workloads (and please

Ken mentioned it was worth 2%. Not a bad improvement, but if our
performance "sucks" then it sounds like we need to look elsewhere.

demonstrate them instead of calling my patch a hammer - the patch hasbeen in -mm for many months already) then simply provide the logic thatwill do the balancing for those workloads only, without hurting thisworkload!

No doubt that if it is doing pointless migrations that your patch
prevents, then that will improve performance here. However I'd rather
try to fix the actual balancing code.

Without any form of wake balancing, then a multiprocessor system will
tend to have a completely random distribution of tasks over CPUs over
time. I prefer to add a driver so it is not completely random for
amenable workloads.

--
SUSE Labs, Novell Inc.

Send instant messages to your online friends http://au.messenger.yahoo.com-

To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: better wake-balancing: respin
  - From: Ingo Molnar <mingo@elte.hu>

References:
- better wake-balancing: respin
  - From: "Chen, Kenneth W" <kenneth.w.chen@intel.com>
- Re: better wake-balancing: respin
  - From: Nick Piggin <nickpiggin@yahoo.com.au>
- Re: better wake-balancing: respin
  - From: Ingo Molnar <mingo@elte.hu>

Prev by Date: Re: kernel BUG at mm/slab.c:1488! (2.6.13.2)
Next by Date: Re: [PATCH RFC] big reader semaphore take#2
Previous by thread: Re: better wake-balancing: respin
Next by thread: Re: better wake-balancing: respin
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]