Re: Linux does not care for data integrity

Helge Hafting wrote:

Bill Davidsen wrote:
Matthias Andree wrote:
On Sun, 29 May 2005, Greg Stark wrote:
Oracle, Sybase, Postgres, other databases have hard requirements. They
guarantee that when they acknowledge a transaction commit the datahas beenwritten to non-volatile media and will be recoverable even in theface of a
routine power loss.
They meet this requirement just fine on SCSI drives (where writecachinggenerally ships disabled) and on any OS where fsync issues a cacheflush. If
I don't know what facts "generally ships disabled" is based on, all of
the more recent SCSI drives (non SCA type though) I acquired came with
write cache enabled and some also with queue algorithm modifier setto 1.
Worse, if the disk flushes the data to disk out of order it's quite
likely the entire database will be corrupted on any simple power
outage. I'm not clear whether that's the case for any common drives.
It's a matter of enforcing write order. In how far such ordering
constraints are propagated by file systems, VFS layer, down to the
hardware, is the grand question.
The problem is that in many options required to make that happen inthe o/s, hardware, and application are going to kill performance. Andeven if you can control order of write, unless you can get write tofinal non-volatile media control you can get a sane database butstill lose transactions.
If there was a way for the o/s to know when a physical write was doneother than using flushes to force completion, then overallperformance could be higher, but individual transaction might havegreater latency. And the app could use fsync to force order of writeas needed. In many cases groups of writes can be done in any order aslong as they are all done before the next logical step takes place.
There is a workaround. Get an UPS just for the disks. It don't haveto be
big, just enough to keep the disks going long enough to commit their
caches after the rest of the machine died from a power loss. Such asmall
unit could possibly fit inside the cabinet, avoiding the trouble with
people stepping on the power cord.

With this in place, any write that makes it from the controller to the
disk is safely stored for all practical purposes.

Unfortunately even drives in a dual power tray with redundany power fromseparate UPS sources will occasionally have a power failure. Proved thatlast month, the power strip in the rack failed, dumped all the load onthe other leg, the surge tripped a breaker. Had an APC UPS in my officefail in a mode which dropped power, waited for the battery to tricklecharge to charge the battery a bit, then repeat. Looks to be losing halfof a full wave rectifier.

The point is that power failures WILL HAPPEN, even with good backups.The goal should be to prevent excessive and avoidable data damage whenit does.

Shameless plug: for office use I changed from APC to Belkin on all newunits, they have had Linux drivers for some time now, and I like tosupport those who support Linux.


--
bill davidsen <[email protected]>
 CTO TMR Associates, Inc
 Doing interesting things with small computers since 1979

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: Linux does not care for data integrity
  - From: [email protected] (Lennart Sorensen)

References:
- Re: Disk write cache (Was: Hyper-Threading Vulnerability)
  - From: Jeff Garzik <[email protected]>
- Re: Disk write cache
  - From: Kenichi Okuyama <[email protected]>
- Re: Disk write cache
  - From: Jeff Garzik <[email protected]>
- Linux does not care for data integrity (was: Disk write cache)
  - From: Matthias Andree <[email protected]>
- Re: Linux does not care for data integrity (was: Disk write cache)
  - From: Arjan van de Ven <[email protected]>
- Re: Linux does not care for data integrity (was: Disk write cache)
  - From: Matthias Andree <[email protected]>
- Re: Linux does not care for data integrity (was: Disk write cache)
  - From: Arjan van de Ven <[email protected]>
- Re: Linux does not care for data integrity (was: Disk write cache)
  - From: Matthias Andree <[email protected]>
- Re: Linux does not care for data integrity (was: Disk write cache)
  - From: Alan Cox <[email protected]>
- Re: Linux does not care for data integrity (was: Disk write cache)
  - From: Greg Stark <[email protected]>
- Re: Linux does not care for data integrity (was: Disk write cache)
  - From: Matthias Andree <[email protected]>
- Re: Linux does not care for data integrity
  - From: Bill Davidsen <[email protected]>
- Re: Linux does not care for data integrity
  - From: Helge Hafting <[email protected]>

Prev by Date: Re: DL10038D chip datasheet
Next by Date: Re: 2.6.12-rc5-mm1 Oops
Previous by thread: Re: Linux does not care for data integrity
Next by thread: Re: Linux does not care for data integrity
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]