Re: [rfc patch] optimize o_direct on block device

On Nov 30, 2006, at 10:16 PM, Chen, Kenneth W wrote:

Zach Brown wrote on Thursday, November 30, 2006 1:45 PM
At that time, a patch was written for raw device to demonstrate that
large performance head room is achievable (at ~20% speedup formicro-
benchmark and ~2% for db transaction processing benchmark) with a
tight I/O submission processing loop.
Where exactly does the benefit come from?  icache misses?  "atomic"
ops leading to pipeline flushes?
It benefit from shorter path length. It takes much shorter time toprocessone I/O request, both in the submit and completion path. I alwaysthink interms of how many instructions, or clock ticks does it take toconvert userrequest into bio, submit it and in the return path, to process thebio call
back function and do the appropriate io completion (sync or async).

Sure.

What I'm hoping for is an understanding of what exactly the path isdoing with those cycles. Do we have any more detailed measurementsthan, say, get_cycles() before and after the call?

Maybe it's time for me to have a good sit down with systemtap :)

- z
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- RE: [rfc patch] optimize o_direct on block device
  - From: "Chen, Kenneth W" <kenneth.w.chen@intel.com>

Prev by Date: Re: [GFS2] Change argument of gfs2_dinode_out [17/70]
Next by Date: Re: [GFS2] Change argument of gfs2_dinode_out [17/70]
Previous by thread: RE: [rfc patch] optimize o_direct on block device
Next by thread: man-pages-2.43 is released
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]