Re: Is BIO_RW_FAILFAST really usable?

Neil Brown wrote:

I've been looking at use BIO_RW_FAILFAST in md/raid to improve
handling of some error cases.

This is particularly significant for the DASD driver (s390 specific).
I believe it uses optic fibre to connect to the drives.  When one of
these paths is unplugged, IO requests will block until an operator
runs a command to reset the card (or until it is plugged back in).
The only way to avoid this blockage is to use BIO_RW_FAILFAST.  So
we really need BIO_RW_FAILFAST for a reliable RAID1 configuration on
DASD drives.

However, I just tested BIO_RW_FAILFAST on my SATA drives: controller

02:06.0 RAID bus controller: Silicon Image, Inc. SiI 3114 [SATALink/SATARaid] Serial ATA Controller (rev 02)

(not using the cards minimal RAID functionality) and requests fail
immediately and always with e.g.

sd 2:0:0:0: [sdc] Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK,SUGGEST_OK
end_request: I/O error, dev sdc, sector 2048

So fail fast obviously isn't generally usable.

What is the answer here?  Is the Silicon Image driver doing the wrong
thing, or is DASD doing the wrong thing, or is BIO_RW_FAILFAST
under-specified and we really need multiple flags or what?


It's a hard thing to implement, in general, for scalability reasons.

To make it work, you need to examine each driver's error handling tofigure out what "fail fast" really means.

Most storage drivers are written to try as hard as possible to completea request, where "try as hard as possible" can often mean internalretries while trying various multi-path configurations and hardware modechanges. You might be catching SATA in the middle of error handling,for example.

So each driver really has a /slight different/ version of "try tocomplete this request", which has the obvious effects on BIO_RW_FAILFAST.

No clue about DASD, but in SATA's case I bet that a media or transfererror could be returned to the system more rapidly, while we continue totry to recover in the background. libata doesn't have any directknowledge of fail-fast at this point, IIRC.

But overall it's a job where you must examine each driver, or set ofdrivers :/


	Jeff


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: Is BIO_RW_FAILFAST really usable?
  - From: Andrey Borzenkov <[email protected]>

References:
- Is BIO_RW_FAILFAST really usable?
  - From: Neil Brown <[email protected]>

Prev by Date: [PATCH] Updates to nfsroot documentation
Next by Date: Re: [PATCH 3/3] [UDP6]: Counter increment on BH mode
Previous by thread: Is BIO_RW_FAILFAST really usable?
Next by thread: Re: Is BIO_RW_FAILFAST really usable?
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]