Re: -mm seems significanty slower than mainline on kernbench

Peter Williams wrote:

Con Kolivas wrote:

On Wednesday 11 January 2006 23:24, Peter Williams wrote:

Martin J. Bligh wrote:

That seems broken to me ?


But, yes, given that the problem goes away when the patch is removed
(which we're still waiting to see) it's broken.  I think the problem is
probably due to the changed metric (i.e. biased load instead of simple
load) causing idle_balance() to fail more often (i.e. it decides to not
bother moving any tasks more often than it otherwise would) which would
explain the increased idle time being seen.  This means that the fix
would be to review the criteria for deciding whether to move tasks in
idle_balance().

Look back on my implementation. The problem as I saw it was that onetask alone with a biased load would suddenly make a runqueue look muchbusier than it was supposed to so I special cased the runqueue thathad precisely one task.


OK.  I'll look at that.

OK. I agree that this mechanism increases the chances that a queue withonly one runnable task is selected as the target for stealing tasksfrom. The attached patch addresses this issue in two ways:

1. in find_busiest_group(), only groups that have at least one queuewith more than one task running are considered, and2. in find_busiest_queue(), only queues with more than one runnable taskare considered.

As I see it, this patch is a bit iffy as it is effected by raceconditions in two ways:

1. just because there's more than one task runnable when these checksare made there's no guarantee that this will be the case when you try tomove some of them, and2. just because there's only one task runnable when these checks aremade it's possible that there will be more than one when you attempt themove.

I don't think that this patch makes case 1 any worse than it already isbut case 2 could cause potential moves to be missed that otherwisewouldn't be and I assume this is the reason why there is no similar codein the original. Whether the increased probability of choosing queueswith only one runnable tasks changes this reasoning is up for debate.

Signed-off-by: Peter Williams <pwil3058@bigpond.com.au>

Peter
--
Peter Williams                                   pwil3058@bigpond.net.au

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce

Index: MM-2.6.X/kernel/sched.c
===================================================================
--- MM-2.6.X.orig/kernel/sched.c	2006-01-12 10:44:50.000000000 +1100
+++ MM-2.6.X/kernel/sched.c	2006-01-12 10:47:01.000000000 +1100
@@ -2052,6 +2052,7 @@ find_busiest_group(struct sched_domain *
 		unsigned long load;
 		int local_group;
 		int i;
+		unsigned int eligible_qs = 0;
 
 		local_group = cpu_isset(this_cpu, group->cpumask);
 
@@ -2065,8 +2066,11 @@ find_busiest_group(struct sched_domain *
 			/* Bias balancing toward cpus of our domain */
 			if (local_group)
 				load = target_load(i, load_idx);
-			else
+			else {
 				load = source_load(i, load_idx);
+				if (cpu_rq(i)->nr_running > 1)
+					++eligible_qs;
+			}
 
 			avg_load += load;
 		}
@@ -2080,7 +2084,7 @@ find_busiest_group(struct sched_domain *
 		if (local_group) {
 			this_load = avg_load;
 			this = group;
-		} else if (avg_load > max_load) {
+		} else if (avg_load > max_load && eligible_qs) {
 			max_load = avg_load;
 			busiest = group;
 		}
@@ -2181,8 +2185,12 @@ static runqueue_t *find_busiest_queue(st
 		load = source_load(i, 0);
 
 		if (load > max_load) {
-			max_load = load;
-			busiest = cpu_rq(i);
+			runqueue_t *tmprq = cpu_rq(i);
+
+			if (tmprq->nr_running > 1) {
+				max_load = load;
+				busiest = tmprq;
+			}
 		}
 	}

References:
- -mm seems significanty slower than mainline on kernbench
  - From: Martin Bligh <mbligh@google.com>
- Re: -mm seems significanty slower than mainline on kernbench
  - From: "Martin J. Bligh" <mbligh@google.com>
- Re: -mm seems significanty slower than mainline on kernbench
  - From: Peter Williams <pwil3058@bigpond.net.au>
- Re: -mm seems significanty slower than mainline on kernbench
  - From: Con Kolivas <kernel@kolivas.org>
- Re: -mm seems significanty slower than mainline on kernbench
  - From: Peter Williams <pwil3058@bigpond.net.au>

Prev by Date: Re: + add-pselect-ppoll-system-call-implementation-tidy.patch added to -mm tree
Next by Date: Re: [PATCH 2 of 2] __raw_memcpy_toio32 for x86_64
Previous by thread: Re: -mm seems significanty slower than mainline on kernbench
Next by thread: Re: -mm seems significanty slower than mainline on kernbench
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]