Re: 2.6.12-rc2-mm3 — Linux Kernel

Andrew Morton wrote:

Nick Piggin <[email protected]> wrote:

AS basically does its own TCQ strangulation, which IIRC involves things

> >  like completing all reads before issuing new writes, and completing all
> >  reads from one process before reads from another. As well as the
> >  fundamental way that waiting for a 'dependant read' throttles TCQ.

>> My (mpt-fusion-based) workstation is still really slow when there's a lot

> of writeout happening.  Just from a quick test:

>> > 2.6.12-rc2, as, tcq depth=2: 7.241 seconds

> > 2.6.12-rc2, 	as,	tcq depth=64:		12.172 seconds
> > 2.6.12-rc2+patch,as,	tcq depth=64:		7.199 seconds
> > 2.6.12-rc2, 	cfq2,	tcq depth=64:		much more than 5 minutes
> > 2.6.12-rc2, 	cfq3,	tcq depth=64:		much more than 5 minutes

>> 2.6.11-rc4-mm1, as, mpt-f 39.349 seconds>> That was really really slow but had a sudden burst of read I/O at the end

> which made the thing look better than it really is.  I wouldn't have a clue
> what tag depth it's using, and it's the only mpt-fusion based machine I
> have handy...

>Heh.


Well with my current lineup on the mpt-fusion driver and no
as-limit-queue-depth.patch that test takes 17 seconds.  With
as-limit-queue-depth.patch it's down to 10 seconds.  Which is pretty darn
good btw.  I assume from this:

scsi0 : ioc0: LSI53C1030, FwRev=01030600h, Ports=1, MaxQ=222, IRQ=25
scsi1 : ioc1: LSI53C1030, FwRev=01030600h, Ports=1, MaxQ=222, IRQ=26

that it's using a tag depth of 222.

	int			 req_depth;	/* Number of request frames */

I wonder if that's true...


One thing which changed is that this kernel now has the fixed-up mpt-fusion
chipset tuning.  That doubles the IO bandwidth, which would pretty well
account for that difference.  I'll wait and see how irritating things get
under writeout load.

Yes, we'll need to decide if we want to retain as-limit-queue-depth.patch
and toss out some of the older AS logic which was designed to address the
TCQ problem.

Steve, could you help to identify a not-too-hard-to-set-up workload at
which AS was particularly poor?  Thanks.

AS with XFS was pretty bad on a couple of workloads. random 4k readsand "metadata" which was 40%create, 40%append, 20%delete multithreadedworkloads. I'll try to run a few tests with and without this patch onmy hardware setup over the next day or so and see how it does. I havenot really looked at AS performance since about 2.6.6/7. Our databaseteam recently re-checked IO Scheduler performance, and on the Ad HocDecision Support Workload we still saw a 15-20% lower throughput onRHEL4 with AS compared to other schedulers which were all within acouple of %.


Steve
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

References:
- 2.6.12-rc2-mm3
  - From: Andrew Morton <[email protected]>
- Re: 2.6.12-rc2-mm3
  - From: Andrew Morton <[email protected]>
- Re: 2.6.12-rc2-mm3
  - From: Nick Piggin <[email protected]>
- Re: 2.6.12-rc2-mm3
  - From: Andrew Morton <[email protected]>
- Re: 2.6.12-rc2-mm3
  - From: Nick Piggin <[email protected]>
- Re: 2.6.12-rc2-mm3
  - From: Andrew Morton <[email protected]>

Prev by Date: [PATCH] Fix reproducible SMP crash in security/keys/key.c
Next by Date: [patch 136/198] Direct IO async short read fix
Previous by thread: Re: 2.6.12-rc2-mm3
Next by thread: Re: 2.6.12-rc2-mm3
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]