Re: [PATCH] make atomic_t volatile on all architectures

Herbert Xu wrote:

Chris Snook <csnook@redhat.com> wrote:

Some architectures currently do not declare the contents of an atomic_t to be
volatile.  This causes confusion since atomic_read() might not actually read
anything if an optimizing compiler re-uses a value stored in a register, which
can break code that loops until something external changes the value of an
atomic_t.  Avoiding such bugs requires using barrier(), which causes re-loads

Such loops should always use something like cpu_relax() which comes
with a barrier.

If they're not doing anything, sure. Plenty of loops actually do some sort ofreal work while waiting for their halt condition, possibly even work which isnecessary for their halt condition to occur, and you definitely don't want to bedoing cpu_relax() in this case. On register-rich architectures you can do quitea lot of work without needing to reuse the register containing the result of theatomic_read(). Those are precisely the architectures where barrier() hurts themost.

of all registers used in the loop, thus hurting performance instead of helping
it, particularly on architectures where it's unnecessary.  Since we generally

Do you have an example of such a loop where performance is hurt by this?

Not handy. Perhaps more interesting are cases where we access the same atomic_ttwice in a hot path. If we can remove some of those barriers, those hot pathswill get faster.

Performance was only part of the motivation. The IPVS bug was an example of howatomic_t is assumed (not always correctly) to work, and other recent discussionson this list have made it clear that most people assume atomic_read() actuallyreads something every time, and don't even think to consult the documentationuntil they find out the hard way that it does not. I'm not saying we shouldencourage lazy programming, but in this case the assumption is reasonablebecause that's how people actually use atomic_t, and making this behavioruniform across all architectures makes it more convenient to do things the rightway, which we should encourage.

The IPVS code that led to this patch was simply broken and has been
fixed to use cpu_relax().

I agree, busy-waiting should be done with cpu_relax(), if at all. I'm moreconcerned about cases that are not busy-waiting, but could still get compiledwith the same optimization.

	-- Chris
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [PATCH] make atomic_t volatile on all architectures
  - From: Herbert Xu <herbert.xu@redhat.com>

References:
- Re: [PATCH] make atomic_t volatile on all architectures
  - From: Herbert Xu <herbert@gondor.apana.org.au>

Prev by Date: Re: [PATCH 2/3] UIO: Documentation
Next by Date: Re: early boot lockup with 2.6.23-rc1
Previous by thread: Re: [PATCH] make atomic_t volatile on all architectures
Next by thread: Re: [PATCH] make atomic_t volatile on all architectures
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]