Re: [rfc][patch] remove racy sync_page?

On Wed, 31 May 2006, Nick Piggin wrote:
> 
> The requests can only get merged if contiguous requests from the upper
> layers come down, right?

It has nothing to do with merging. It has to do with IO patterns.

Seeking.

Seeking is damn expensive - much more so than command issue. People forget 
that sometimes.

If you can sort the requests so that you don't have to seek back and 
forth, that's often a HUGE win. 

Yes, the requests will still be small, and yes, the IO might happen in 4kB 
chunks, but it happens a lot faster if you do it in a good elevator 
ordering and if you hit the track cache than if you seek back and forth.

And part of that is that you have to submit multiple requests when you 
start, and allow the elevator to work on it.

Now, of course, if you have tons of reqeusts already in flight, you don't 
care (you already have lots of work for the elevator), but at least in 
desktop loads the "starting from idle" case is pretty common. Getting just 
a few requests to start up with is good.

(Yes, tagged queueing makes it less of an issue, of course. I know, I 
know. But I _think_ a lot of disks will start seeking for an incoming 
command the moment they see it, just to get the best latency, rather than 
wait a millisecond or two to see if they get another request. So even 
with tagged queuing, the elevator can help, _especially_ for the initial 
request).

> Why would plugging help if the requests can't get merged, though?

Why do you think we _have_ an elevator in the first place?

And just how well do you think it works if you submit one entry at a time 
(regardless of how _big_ it is) and start IO on it immediately? Vs trying 
to get several IO's out there, so that we can say "do this one first".

Sometimes I think harddisks have gotten too quiet - people no longer hear 
it when access patters are horrible. But the big issue with plugging was 
only partially about request coalescing, and was always about trying to 
get the _order_ right when you start to actually submit the requests to 
the hardware.

And yes, I realize that modern disks do remapping, and that we will never 
do a "perfect" job. But it's still true that the block number has _some_ 
(fairly big, in fact) relationship to the actual disk layout, and that 
avoiding seeking is a big deal.

Rotational latency is often an even bigger issue, of course, but we can't 
do much about that. We really can't estimate where the head is, like 
people used to try to do three decades ago. _That_ time is long past, but 
we can try to avoid long seeks, and it's still true that you can get 
blocks that are _close_ faster (if only because they may end up being on 
the same cylinder and not need a seek).

Even better than "same cylinder" is sometimes "same cache block" - disks 
often do track caching, and they aren't necessarily all that smart about 
it, so even if you don't read one huge contiguous block, it's much better 
to read an area _close_ to another than seek back and forth, because 
you're more likely to hit the disks own track cache.

And I know, disks aren't as sensitive to long seeks as they used to be (a 
short seek is almost as expensive as a long one, and a lot of it is the 
head settling time), but as another example - I think for CD-ROMs you can 
still have things like the motor spinning faster or slower depending on 
where the read head is, for example, meaning that short seeks are cheaper 
than long ones.

(Maybe constant angular velocity is what people use, though. I dunno).

		Linus
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [rfc][patch] remove racy sync_page?
  - From: Nick Piggin <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Helge Hafting <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Mark Lord <[email protected]>

References:
- [rfc][patch] remove racy sync_page?
  - From: Nick Piggin <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Andrew Morton <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Nick Piggin <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Andrew Morton <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Nick Piggin <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Linus Torvalds <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Nick Piggin <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Nick Piggin <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Linus Torvalds <[email protected]>
- Re: [rfc][patch] remove racy sync_page?
  - From: Nick Piggin <[email protected]>

Prev by Date: Re: memcpy_toio on i386 using byte writes even when n%2==0
Next by Date: Re: [PATCHSET] block: fix PIO cache coherency bug
Previous by thread: Re: [rfc][patch] remove racy sync_page?
Next by thread: Re: [rfc][patch] remove racy sync_page?
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]