Re: libata FUA revisited

Tejun Heo wrote:

On the NCQ side, I think it's pretty safe to assume that allcontrollers will handle it. Obviously I've verified it with sata_nv(at least that it doesn't blow up obviously), and the other two NCQdrivers we have, ahci and sata_sil24 just feed raw FIS data into thecontroller so there should be no issue with not supporting it.
FWIW, ICH6/7/8 ahci's clear PMP field when transmitting FIS. The reasonwhy I'm hesitant is because there is no way to tell whether the FUA bitgot honored or ignored. With extra opcode, it's okay because barrierexplicitly fails but if NCQ FUA is not supported, it will succeedsilently as normal write. Everything will be okay generally but thebarrier is done incorrectly and on a really bad day it will lead tojournal corruption.

Well, we should be able to determine that experimentally (at least onspecific controllers) with a little test program that just writes littlebits of data and fsyncs repeatedly (assuming that does in fact triggerFUAs currently..) If it runs faster than the drive could possibly berewriting the physical disk then obviously the FUA bit is not gettingthrough and/or not respected and we can blacklist FUA on that controller.

Also, the FUA bit in the NCQ commands is in the device register, so it'snot like the PMP fields where it's not used for anything else and so thecontroller messing with it wouldn't be otherwise noticed..

So, actually, I was thinking about *always* using the non-NCQ FUAopcode. As currently implemented, FUA request is always issued byitself, so NCQ doesn't make any difference there. So, I think it wouldbe better to turn on FUA on driver-by-driver basis whether thecontroller supports NCQ or not.

Unfortunately not all drives that support NCQ support the non-NCQ FUAcommands (my Seagates are like this).

There's definitely a potential advantage to FUA with NCQ - if you havenon-synchronous accesses going on concurrently with synchronous ones, ifyou have to use non-NCQ FUA or flush cache commands, you have to waitfor all the IOs of both types to drain out before you can issue theflush (since those can't be overlapped with the NCQ read/writes). And ifyou can only use flush cache, then you're forcing all the writes to beflushed including the non-synchronous ones you didn't care about.Whether or not the block layer currently exploits this I don't know, butit definitely could.

Well, I might be being too paranoid but silent FUA failure would bereally hard to diagnose if that ever happens (and I'm fairly certainthat it will on some firmwares).

Well, there are also probably drives that ignore flush cache commands orfail to do other things that they should. There's only so far we cango in coping if the firmware authors are being retarded. If any drive isbroken like that we should likely just blacklist NCQ on it entirely asobviously little thought or testing went into the implementation..


--
Robert Hancock      Saskatoon, SK, Canada
To email, remove "nospam" from [email protected]
Home Page: http://www.roberthancock.com/


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: libata FUA revisited
  - From: Tejun Heo <[email protected]>

References:
- Re: libata FUA revisited
  - From: Robert Hancock <[email protected]>
- Re: libata FUA revisited
  - From: Tejun Heo <[email protected]>

Prev by Date: Re: [RFC] [PATCH] more support for memory-less-node.
Next by Date: Re: User tools for March 11
Previous by thread: Re: libata FUA revisited
Next by thread: Re: libata FUA revisited
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]