Re: RT task scheduling — Linux Kernel

Ingo Molnar wrote:

* Darren Hart <darren@dvhart.com> wrote:
My last mail specifically addresses preempt-rt, but I'd like to knowpeople's thoughts regarding this issue in the mainline kernel. Pleasesee my previous post "realtime-preempt scheduling - rt_overloadbehavior" for a testcase that produces unpredictable schedulingresults.
the rt_overload feature i intend to push upstream-wards too, i justdidnt separate it out of -rt yet.
"RT overload scheduling" is a totally orthogonal mechanism to the SMPload-balancer (and this includes smpnice too) that is more or lessequivalent to having a 'global runqueue' for real-time tasks, withoutthe SMP overhead associated with that. If there is no "RT overload" [thecommon case even on Linux systems that _do_ make use of RT tasksoccasionally], the new mechanism is totally inactive and there's nooverhead. But once there are more RT tasks than CPUs, the scheduler willdo "global" decisions for what RT tasks to run on which CPU. To put evenless overhead on the mainstream kernel, i plan to introduce a newSCHED_FIFO_GLOBAL scheduling policy to trigger this behavior. [it doesntmake much sense to extend SCHED_RR in that direction.]
my gut feeling is that it would be wrong to integrate this feature intosmpnice: SCHED_FIFO is about determinism, and smpnice is a fundamentallystatistical approach. Also, smpnice doesnt have to try as hard to pickthe right task as rt_overload does, so there would be constant'friction' between "overhead" optimizations (dont be over-eager) and"latency" optimizations (dont be _under_-eager). So i'm quite sure wewant this feature separate. [nevertheless i'd happy to be proven wrongvia some good and working smpnice based solution]

I was thinking about this over night and came to similar conclusions.I.e. for RT tasks it's really a problem of selecting the right CPU atwake up time rather than a general load balancing problem. The solutionthat I thought of was different (though) and involved modifyingwake_idle() so that when the woken task was high priority as well aslooking for idle CPUs it looked for the one with the lowest prioritycurrent task and if it couldn't find an idle CPU it returned the onewith the lowest priority current task. The aim was to maximize theprobability that the newly woken task went straight onto a CPU (eitherby finding an idle one or preemption).

Although aimed at this specific problem, this solution would also helpsmpnice to attain equal "average load per task" values for groups/queueswhich I think is a desirable secondary aim to equatable distribution ofweighted load. If both of these aims are met I think a natural outcomewould be that the highest priority tasks are well distributed among theCPUs (but, as you imply, this would be a statistical trend rather thanan deterministic).

In summary, I think that smpnice can be modified in ways that will helpwith this problem but if you need determinism then special measures areprobably necessary.

Peter
---
Peter Williams                                   pwil3058@bigpond.net.au

"Learning, n. The kind of ignorance distinguishing the studious."
 -- Ambrose Bierce
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- RT task scheduling
  - From: Darren Hart <darren@dvhart.com>
- Re: RT task scheduling
  - From: Ingo Molnar <mingo@elte.hu>

Prev by Date: Re: RT task scheduling
Next by Date: Re: problem building UML kernel with 2.6.16.1 -- dies when linking vmlinux
Previous by thread: Re: RT task scheduling
Next by thread: Re: RT task scheduling
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]