Re: [rfc][patch] sched: remove smpnice

Peter Williams wrote:

Andrew Morton wrote:
"Siddha, Suresh B" <suresh.b.siddha@intel.com> wrote:
On Tue, Feb 07, 2006 at 03:36:17PM -0800, Andrew Morton wrote:
Suresh, Martin, Ingo, Nick and Con: please drop everything,triple-check
and test this:

From: Peter Williams <pwil3058@bigpond.net.au>
This is a modified version of Con Kolivas's patch to add "nice"support to
load balancing across physical CPUs on SMP systems.
I have couple of issues with this patch.
a) on a lightly loaded system, this will result in higher priorityjob hopping around from one processor to another processor.. This isbecause of the code in find_busiest_group() which assumes thatSCHED_LOAD_SCALE represents a unit process load and with nice_to_biascalculations this is no longer true(in the presence of non nice-0 tasks)
My testing showed that 178.galgel in SPECfp2000 is down by ~10% whenrun with nice -20 on a 4P(8-way with HT) system compared to a nice-0run.
This is a bit of a surprise. Surely, even with this mod, a taskshouldn't be moved if it's the only runnable one on its CPU. If itisn't the only runnable one on its CPU, it's not actually on the CPU andit's not cache hot then moving it to another (presumably) idle CPUshould be a gain?
Presumably the delay waiting for the current task to exit the CPU isless than the time taken to move the task to the new CPU? I'd guessthat this means that the task about to be moved is either: a) higherpriority than the current task on the CPU and is waiting for it to bepreempted off or b) it's equal priority (or at least next one due to bescheduled) to the current task, waiting for the current task tosurrender the CPU and that surrender is going to happen pretty quicklydue to the current task's natural behaviour?

After a little lie down :-), I now think that this problem has beenmisdiagnosed and the actual problem is that movement of high prioritytasks on lightly loaded systems is supressed by this patch rather thanit causing such tasks to hop from CPU to CPU.

The reason that I think this is that the implementation of biased_load()makes it a likely outcome. As a shortcut to converting weighted load tobiased load it assumes the the average weighted load per runnable taskis 1 (or, equivalently, that the average biased prio per runnable isNICE_TO_BIAS_PRIO(p)) and this means that if there's only one task tomove and its nice value is less than zero (i.e. it's high priority) thenthe biased load to be moved that is calculated will be smaller than thattask's bias_prio causing it to NOT be moved by move_tasks().

Do you have any direct evidence to support your "hopping" hypothesis oris my hypothesis equally likely?

If my hypothesis holds there is a relatively simple fix that wouldinvolve modifying biased_load() to take into account rq->prio_bias andrq->nr_running during its calculations. Basically, in that function,(wload * NICE_TO_BIAS_PRIO(0)) would be replaced by (wload *rq->prio_bias / rq->nr_running) which would, in turn, create arequirement for rq to be passed in as an argument.

If there is direct evidence of hopping then, because of my hypothesisabove, I would shift the suspicion from find_busiest_group() totry_to_wake_up().

Peter
--
Peter Williams                                   pwil3058@bigpond.net.au

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- [rfc][patch] sched: remove smpnice
  - From: Nick Piggin <npiggin@suse.de>
- Re: [rfc][patch] sched: remove smpnice
  - From: Con Kolivas <kernel@kolivas.org>
- Re: [rfc][patch] sched: remove smpnice
  - From: Andrew Morton <akpm@osdl.org>
- Re: [rfc][patch] sched: remove smpnice
  - From: Con Kolivas <kernel@kolivas.org>
- Re: [rfc][patch] sched: remove smpnice
  - From: Andrew Morton <akpm@osdl.org>
- Re: [rfc][patch] sched: remove smpnice
  - From: "Siddha, Suresh B" <suresh.b.siddha@intel.com>
- Re: [rfc][patch] sched: remove smpnice
  - From: Andrew Morton <akpm@osdl.org>
- Re: [rfc][patch] sched: remove smpnice
  - From: Peter Williams <pwil3058@bigpond.net.au>

Prev by Date: max symlink = 5? ?bug? ?feature deficit?
Next by Date: Re: CD writing in future Linux (stirring up a hornets' nest)
Previous by thread: Re: [rfc][patch] sched: remove smpnice
Next by thread: Re: [rfc][patch] sched: remove smpnice
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]