On Wed, 2005-12-14 at 09:43 +0100, Ingo Molnar wrote:
> * Ingo Molnar <[email protected]> wrote:
>
> > hm, in that case it should be base->running that protects against this
> > case. Did you run an UP PREEMPT_RT kernel? In that case could you
> > check whether the fix below resolves the crash too?
>
> try the patch below instead. (Thomas pointed out that the correct
> dependency in this case is PREEMPT_SOFTIRQS)
>
> Ingo
>
> Index: linux/kernel/hrtimer.c
> ===================================================================
> --- linux.orig/kernel/hrtimer.c
> +++ linux/kernel/hrtimer.c
> @@ -118,7 +118,7 @@ static DEFINE_PER_CPU(struct hrtimer_bas
> * Functions and macros which are different for UP/SMP systems are kept in a
> * single place
> */
> -#ifdef CONFIG_SMP
> +#if defined(CONFIG_SMP) || defined(CONFIG_PREEMPT_SOFTIRQS)
>
> #define set_curr_timer(b, t) do { (b)->curr_timer = (t); } while (0)
>
OK Thomas and Ingo, I hit the bug again! And this time I found the other
race. Again let me explain where in the code I have a problem, and I
feel really uncomfortable with the "implied logic" to protect the race.
In __remove_hrtimer we have:
#ifdef CONFIG_HIGH_RES_TIMERS
if (timer->state == HRTIMER_PENDING_CALLBACK) {
list_del(&timer->list);
return;
}
#endif
And in run_hrtimer_hres_queue (and something similar with
run_hrtimer_queue) we have:
set_curr_timer(base, timer);
list_del(&timer->list);
spin_unlock_irq(&base->lock);
__remove_hrtimer is protected with the base-lock, so in this case we
have the logic of set_curr_timer to protect against
hrtimer_try_to_cancel which has:
if (base->curr_timer != timer)
ret = remove_hrtimer(timer, base);
And this _is_ protected, but I just discovered that this does _not_
protect against hrtimer_start!
In hrtimer_start we have:
base = lock_hrtimer_base(timer, &flags);
/* Remove an active timer from the queue: */
remove_hrtimer(timer, base);
Which can be called after that spin_unlock_irq is done by the
run_hrtimer_queue, and we will hit a bug (as I did). This is not an
easy race to hit. Here's an example of a race for this problem:
In posix-timers.c: commen_timer_set:
If we get preempted between hrtimer_try_to_cancel and
hrtimer_restart.
Then a new thread adds the timer back (by a threaded program).
Now the timer goes off for the softirq, and the softirq runs
and just after it removes the timer and unlocks the base spinlock, it
gets preempted by some high prio task that blocks on the original
thread and boosts it's priority higher than the softirq.
Now the first thread runs and then calls hrtimer_start, and the timer
is now in the state PENDING_CALLBACK but is _not_ on the list. So
this will error when it tries to delete the timer again.
This is a really hard race to hit. But it is possible, and I hit it with
my kernel because I have some crazy dynamic scheduling going on.
Now the question is what's the right solution? Can hrtimer_start
schedule? Probably not, maybe we should add something to check this and
have hrtimer_start return -1 if it is running, and let who ever called
it figure out what to do? Maybe have a hrtimer_cancel_start atomic
operation? As well as a hrtimer_try_to_cancel_start?
-- Steve
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Kernel Newbies]
[Netfilter]
[Bugtraq]
[Photo]
[Stuff]
[Gimp]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Video 4 Linux]
[Linux for the blind]
[Linux Resources]