Re: [patch] smpnice: don't consider sched groups which are lightly loaded for balancing

Siddha, Suresh B wrote:

On Thu, Apr 20, 2006 at 03:19:52PM +1000, Peter Williams wrote:

This patch doesn't fix this issue for example:
4-way simple MP system. P0 containing two high priority tasks, P1 containing
one high priority and two normal priority tasks, one high priotity task
each on P2, P3. Current load balance doesn't detect/fix the
imbalance by moving one of the normal priority task running on P1 to P2 or P3.

Is this always the case or just a possibility? Please describe the holeit slips through (and please do that every time you provide a scenario).

I thought a scenario is enough to show the hole :) Anyhow, I brought thisissue before also..

http://www.ussg.iu.edu/hypermail/linux/kernel/0604.0/0517.html

Load balance on P2 or P3 will always show P0 as max load but it will not
be able to move any load from P0. As
imbalance will be always < busiest_load_per_task and
max_load - this_load will be < imbn(2) * busiest_load_per_task...
and pwr_move will be <= pwr_now...

This will depend on how high the priority of the high priority tasks arerelative to normal tasks. E.g. it's quite possible to have two highpriority tasks whose combined load weight is less than that of twonormal tasks and a high priority task.

Basically sched groups with highest priority tasks can mask theimbalance between the other sched groups with in the same domain.

Sometimes.

I don't think that this stable state is so bad that anything specialneeds to be done especially as the fact that high priority tasks tend toonly use the CPU in short bursts means that it probably won't exist forvery long.

To paraphrase Ingo (from another thread), load balancing is aprobabilistic exercise. For a start, achieving a deterministic optimaldistribution would be an NP algorithm and by the time you determined thecorrect distribution (which could be a very long time) the "state"information on which the determination was based would have changed(possibly a lot). This latter bit (probably minus the possibly a lot)is true anyway as find_busiest_group() and find_busiest_queue() arecalled without locks meaning that the state upon which their results aredetermined may change before move_tasks() is called.

I think this justifies saying that this scenario probably doesn't matterand, therefore, fixing it isn't urgent.

BTW I agree with your earlier statements that the modification tomove_tasks() to circumvent the skip mechanism in some circumstancesneeds to be refined so that it doesn't move the highest priority task ofthe busiest queue. I'll be submitting a patch later today.

I think that the next thing that needs to be addressed after that is amodification to try_to_wake_up() to improve the distribution of highpriority tasks across CPUs. I think that just sticking them on any CPUand waiting for the load balancing code to kick in and move themunnecessarily increases their latency.

Peter
--
Peter Williams                                   pwil3058@bigpond.net.au

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- smpnice loadbalancing with high priority tasks
  - From: "Siddha, Suresh B" <suresh.b.siddha@intel.com>
- Re: smpnice loadbalancing with high priority tasks
  - From: Peter Williams <pwil3058@bigpond.net.au>
- [patch] smpnice: don't consider sched groups which are lightly loaded for balancing
  - From: "Siddha, Suresh B" <suresh.b.siddha@intel.com>
- Re: [patch] smpnice: don't consider sched groups which are lightly loaded for balancing
  - From: Peter Williams <pwil3058@bigpond.net.au>
- Re: [patch] smpnice: don't consider sched groups which are lightly loaded for balancing
  - From: "Siddha, Suresh B" <suresh.b.siddha@intel.com>

Prev by Date: Re: [RFC][PATCH 11/11] security: AppArmor - Export namespace semaphore
Next by Date: Re: [PATCH 3/7] FS-Cache: Avoid ENFILE checking for kernel-specific open files
Previous by thread: Re: [patch] smpnice: don't consider sched groups which are lightly loaded for balancing
Next by thread: Re: [andrea@suse.de: Re: [NFS] Problems with mmap consistency]
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]