Re: PATCH? rcu_do_batch: fix a pure theoretical memory ordering race

Oleg Nesterov a écrit :

On 12/03, Eric Dumazet wrote:

Oleg Nesterov a ?crit :

Yes, but how is it related to RCU ?
I mean, rcu_do_batch() is just a loop like others in kernel.
The loop itself is not buggy, but can call a buggy function, you are right.


	int start_me_again;

	struct rcu_head rcu_head;

	void rcu_func(struct rcu_head *rcu)
	{
		start_me_again = 1;
	}

	// could be called on arbitrary CPU
	void check_start_me_again(void)
	{
		static spinlock_t lock;

		spin_lock(lock);
		if (start_me_again) {
			start_me_again = 0;
			call_rcu(&rcu_head, rcu_func);
		}
		spin_unlock(lock);
	}

I'd say this code is not buggy.


Are you sure ? Can you prove it ? :)

I do think your rcu_func() misses some sync primitive, *after* start_me_again=1;
You seem to rely on some undocumented side effect.
Adding smp_rmb() before calling rcu_func() wont help.


In case it was not clear. I do not claim we need this patch (I don't know).
And yes, I very much doubt we can hit this problem in practice (even if I am
right).

What I don't agree with is that it is callback which should take care of this
problem.

A smp_rmb() wont avoid all possible bugs...


For example?

A smp_rmb() wont avoid stores pending on this CPU to be committed to memoryafter another cpu takes the object for itself. Those stores could overwritestores done by the other cpu as well.

So in theory you could design a buggy callback function even after your patchapplied.

Any function that can transfer an object from CPU A scope to CPU B scope musttake care of memory barrier by itself. The caller *cannot* possibly do thejob, especially if it used an indirect call. However, in some cases it ispossible some clever algos are doing the reverse, ie doing the memory barrierin the callers.


Kernel is full of such constructs :

for (ptr = head; ptr != NULL ; ptr = next) {
	next = ptr->next;
	some_subsys_delete(ptr);
}

And we dont need to add smp_rmb() before the call to some_subsys_delete(), itwould be a nightmare, and would slow down modern cpus.

When the callback function dont need a memory barrier, why should we force ageneric one in the caller ?

AFAIK most kfree() calls dont need a barrier, since slab just queue thepointer into the cpu's array_cache if there is one available slot.



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: PATCH? rcu_do_batch: fix a pure theoretical memory ordering race
  - From: Oleg Nesterov <[email protected]>

References:
- PATCH? rcu_do_batch: fix a pure theoretical memory ordering race
  - From: Oleg Nesterov <[email protected]>
- Re: PATCH? rcu_do_batch: fix a pure theoretical memory ordering race
  - From: Eric Dumazet <[email protected]>
- Re: PATCH? rcu_do_batch: fix a pure theoretical memory ordering race
  - From: Oleg Nesterov <[email protected]>
- Re: PATCH? rcu_do_batch: fix a pure theoretical memory ordering race
  - From: Eric Dumazet <[email protected]>
- Re: PATCH? rcu_do_batch: fix a pure theoretical memory ordering race
  - From: Oleg Nesterov <[email protected]>

Prev by Date: Re: [LinuxBIOS] #57: libusb host program for PLX NET20DC debug device
Next by Date: Re: Device naming randomness (udev?)
Previous by thread: Re: PATCH? rcu_do_batch: fix a pure theoretical memory ordering race
Next by thread: Re: PATCH? rcu_do_batch: fix a pure theoretical memory ordering race
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]