Re: [rfc][patch] sched: remove smpnice

Andrew Morton wrote:

Con Kolivas <kernel@kolivas.org> wrote:

On Wednesday 08 February 2006 01:28, Nick Piggin wrote:

I'd like to get some comments on removing smpnice for 2.6.16. I don't
think the code is quite ready, which is why I asked for Peter's additions
to also be merged before I acked it (although it turned out that it still
isn't quite ready with his additions either).

I think the issues have now been resolved (see Martin J Bligh's messageof 2006/01/29 11:52 "Re: -mm seems significantly slower than mainline onkernbench thread").

Basically I have had similar observations to Suresh in that it does not
play nicely with the rest of the balancing infrastructure (and raised
similar concerns in my review).

The samples (group of 4) I got for "maximum recorded imbalance" on a 2x2

SMP+HT Xeon are as follows:
          | Following boot | hackbench 20        | hackbench 40

-----------+----------------+---------------------+---------------------
2.6.16-rc2 | 30,37,100,112  | 5600,5530,6020,6090 | 6390,7090,8760,8470
+nosmpnice |  3, 2,  4,  2  |   28, 150, 294, 132 |  348, 348, 294, 347

Hackbench raw performance is down around 15% with smpnice (but that in
itself isn't a huge deal because it is just a benchmark). However, the
samples show that the imbalance passed into move_tasks is increased by
about a factor of 10-30. I think this would also go some way to
explaining latency blips turning up in the balancing code (though I
haven't actually measured that).

We'll probably have to revert this in the SUSE kernel.

The other option for 2.6.16 would be to fast track Peter's stuff, which
I could put some time into...

See below.  Should only need testing with hackbench.

but that seems a bit risky at this stage

of the game.

I'd like to hear any other suggestions though. Patch included to aid
discussion at this stage, rather than to encourage any rash decisions.
I see the demonstrable imbalance but I was wondering if there is there a realworld benchmark that is currently affected?


Well was any real-world workload (or benchmark) improved by the smpnice
change?

Because if we have one workload which slowed and and none which sped up,
it's a pretty easy decision..

Not really as the "smp nice" mechanism isn't meant to speed things up.It's meant to ensure that nice still works more or less as expected onSMP systems. So the real question is "Does it achieve this withoutundue cost or unacceptable side effects?". Even though there are nofree lunches the extra costs of this mechanism when there are no nonzero niced tasks is expected to be almost too small to measure.Similarly there should be no side effects unless there are non zeroniced tasks and the side effects should be predictable (or, at least,explainable) and due solely to the fact that nice is being better enforced.

I think that the problems with excessive idling with the early versionsof my modifications to Con's patch showed that the load balancing codeis fairly sensitive to the average load per normal task not beingapproximately 1. My latest patches restore this state of affairs andkernbench testing indicates that the excessive idling has gone away (seeMartin J Bligh's message of 2006/01/29 11:52 "Re: -mm seemssignificantly slower than mainline on kernbench thread").

My testing indicates that niceness works well on dual CPU SMP systemsbut it would be nice if those who have commented (from time to time) onthe need for this (I forget their names -- Con?) could also test it.Some testing of this on larger systems would also be appreciated. (Aquick test is to start 4 (reasonably CPU intensive e.g. 80%) tasks perCPU with nice values of 0, 5, 10 and 15 and then observing their CPUusage rates after the system has had a chance to settle down. Theirusage rates should be in accord with their nice values. The aspinprogram in the simloads package<http://prdownloads.sourceforge.net/cpuse/simloads-0.1.1.tar.gz?download>is suitable for the purpose as it has mechanisms for setting cpu demandand niceness. E.g. "aspin -c 0.08 -p 0.02 -n 4 -N 10 &" would start 4copies of aspin with "nice" == 10 that would loop for 6 minutes andduring each loop it would run until it consumed 0.08 seconds of CPU andthen sleep until 0.02 seconds have elapsed giving a demand of about 80%.If you do this on a system without the patches you may be lucky andhave the tasks distributed among the CPUs so that their usage ratesmatch their nice values but more often than not they will be distributedin ways that causes their usage rates to not match their nice values.Users complain when this results in heavily niced tasks still gettinghigher CPU usage rates than tasks with nice of 0.)

Finally, the issue of whether the unacceptable side effects withhackbench described by Nick are still present when my patch is appliedon top of Con's patch needs to be tested. I think that this is the onlyissue that would prevent my patch going into the main line as the otherissues have been resolved.

Peter
--
Peter Williams                                   pwil3058@bigpond.net.au

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [rfc][patch] sched: remove smpnice
  - From: Martin Bligh <mbligh@google.com>
- Re: [rfc][patch] sched: remove smpnice
  - From: Con Kolivas <kernel@kolivas.org>

References:
- [rfc][patch] sched: remove smpnice
  - From: Nick Piggin <npiggin@suse.de>
- Re: [rfc][patch] sched: remove smpnice
  - From: Con Kolivas <kernel@kolivas.org>
- Re: [rfc][patch] sched: remove smpnice
  - From: Andrew Morton <akpm@osdl.org>

Prev by Date: Re: Which is simpler? (Was Re: [Suspend2-devel] Re: [ 00/10] [Suspend2] Modules support.)
Next by Date: Re: PROBLEM: Oops: 0002 [#1] ("Unable to handle kernel NULL pointer dereference at virtual address 00000040")
Previous by thread: Re: [rfc][patch] sched: remove smpnice
Next by thread: Re: [rfc][patch] sched: remove smpnice
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]