OK, I found this bug on an older version of Ingo's RT kernel with my own
customizations. This is a very hard to get race condition but my logging
traced it pretty good and this looks like it may also be a bug for both
Ingo's RT kernel and the vanilla kernel. This was on an SMP machine.
Here's the race (since this was initiated with XFree86, I'll use it as
the userland process that started this):
XFree86: calls sys_call
-> sys_setitimer
-> do_setitimer
(grabs tsk->sighand->siglock)
-> del_timer_sync
which has the following code:
for_each_online_cpu(i) {
base = &per_cpu(tvec_bases, i);
if (base->running_timer == timer) {
while (base->running_timer == timer) {
cpu_relax();
preempt_check_resched();
}
break;
}
}
If the timer hasn't gone off yet on another cpu, it will spin until it
is finished. Now here's the problem:
ksoftirqd: calls do_softirq -> ... -> run_timer_softirq
-> __run_timers
-> it_real_fn
-> send_group_sig_info
-> group_send_sig_info
(grabs p->sighand->siglock)
Now, since the ksoftirqd is what changes running_timer, we have a
deadlock!
What would be the harm in doing something like:
--- linux-2.6.12-rc6/kernel/itimer.c.orig 2005-06-15 12:14:13.000000000 -0400
+++ linux-2.6.12-rc6/kernel/itimer.c 2005-06-15 12:18:31.000000000 -0400
@@ -153,11 +153,15 @@
switch (which) {
case ITIMER_REAL:
+ try_again:
spin_lock_irq(&tsk->sighand->siglock);
interval = tsk->signal->it_real_incr;
val = it_real_value(tsk->signal);
- if (val)
+ if (val) {
+ spin_unlock_irq(&tsk->sighand->siglock);
del_timer_sync(&tsk->signal->real_timer);
+ goto try_again;
+ }
tsk->signal->it_real_incr =
timeval_to_jiffies(&value->it_interval);
it_real_arm(tsk, timeval_to_jiffies(&value->it_value));
-- Steve
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Kernel Newbies]
[Netfilter]
[Bugtraq]
[Photo]
[Stuff]
[Gimp]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Video 4 Linux]
[Linux for the blind]
[Linux Resources]