All right... Now I'm really confused. There are, obviously, some individuals on this list who are a LOT more knowledgeable about the internal workings of flash, so I'm hoping for a clear(er) understanding of just WHAT is going on here. On Tue, 2005-05-17 at 16:43 -0400, Richard B. Johnson wrote: > On Tue, 17 May 2005 [email protected] wrote: > >> It can also respond to loosing power during write by getting it's state > >> so mixed up the whole card is dead (it identifies but all sectors fail > >> to read). > > Gee, that just happened to me! Well, actually, thanks to Linux's > > *insistence* on reading the partition table, I haven't managed to > > get I/O errors on anything bit sectors 0 through 7, but I am quite > > sure I wasn't writing those sectors when I pulled the plug: > > hda: read_intr: status=0x51 { DriveReady SeekComplete Error } > > hda: read_intr: error=0x10 { SectorIdNotFound }, LBAsect=6, sector=6 > > hda: read_intr: status=0x51 { DriveReady SeekComplete Error } > > hda: read_intr: error=0x10 { SectorIdNotFound }, LBAsect=6, sector=6 > > hda: read_intr: status=0x51 { DriveReady SeekComplete Error } > > hda: read_intr: error=0x10 { SectorIdNotFound }, LBAsect=6, sector=6 > > hda: read_intr: status=0x51 { DriveReady SeekComplete Error } > > hda: read_intr: error=0x10 { SectorIdNotFound }, LBAsect=6, sector=6 > > ide0: reset: success > > hda: read_intr: status=0x51 { DriveReady SeekComplete Error } > > hda: read_intr: error=0x10 { SectorIdNotFound }, LBAsect=6, sector=6 > > end_request: I/O error, dev 03:00 (hda), sector 6 > > unable to read partition table > [SNIPPED...] > You can "fix" this by writing all sectors. Although the data is lost, > the flash-RAM isn't. This can (read will) happen if you pull the > flash-RAM out of its socket with the power ON. I'm the original poster and someone in another message remarked about not having enough details on the damage to the card... So I just did some spot checking on the card for some details. Block checking with dd bs=512 if=/dev/sda gave me some indicators... Blocks 0-7 DOA, hard read errors, 0+0 records in. Blocks 8-31 would read 8 blocks at a time and then give me an error, but the next 8 blocks would read fine. So 24 consecutive blocks SEEMED to read but strangely. Blocks 32-39 DOA Blocks 40-71 would read 8 blocks at a time. Roughly 1/3 of the blocks seem to be dead in multiples of 8 blocks on 8 block boundries early on. No real pattern to which ones were dead and which ones would read 8 and then error. Once past block 512, huge blocks would be readable but eventually give me an error. Dead fields (0 records read) were always multiples of 8 512 byte blocks, 4KBytes falling on an 4K boundry. Reading with dd bs=4096 gave similar results for 4K blocks with skip count less than 64. Skip count 64 and greater gave me large swaths that were readable. No time did I see a partial record read (indicating a failure off a 4K boundry). Basically, that re-enforced my option that it was block wear-out from uneven wear leveling when copying that 700 Meg file and beating the bejesus out of the FAT tables. Front part of the flash was heavily damaged with sporatic damage deeper in the flash. Now, I saw this message... Well... I didn't remove the key when it was being written to but, what the hell... The key is dead, I've got nothing to loose, and it might yield some more information as to the nature of the failure. So I copied zeros to the entire key with "dd if=/dev/zero of=/dev/sda bs=16M". I'll be a son of a bitch but that key recovered. I've partitioned it and read the whole damn thing back end to end and it's perfect. Ok... So, WTF? It wasn't (AFAICT) due to loss of power or pulling it while writing. What was this failure and why did overwriting it fix it? Did the stick just flaw out all the burned out blocks or did it really recover the ECC errors? I'm really baffled now. BTW... I've killed the "sync" option in hal (you just have to create an XML policy file in the right location to specify that option as false in all cases) and have been beating the crap out of several other keys without a single failure. I'm going to try this key again... Thank you very VERY much for this hint to recover the damaged key. That's a trick I've used for damaged IDE & SCSI hard drives (recover head drift and soft errors) and I never thought to try it with a flash key. I'll be damned if I understand just what has happened at this point but I really appreciate that trick. > Cheers, > Dick Johnson > Penguin : Linux version 2.6.11.9 on an i686 machine (5537.79 BogoMips). > Notice : All mail here is now cached for review by Dictator Bush. > 98.36% of all statistics are fiction. Regards, Mike -- Michael H. Warfield | (770) 985-6132 | [email protected] /\/\|=mhw=|\/\/ | (678) 463-0932 | http://www.wittsend.com/mhw/ NIC whois: MHW9 | An optimist believes we live in the best of all PGP Key: 0xDF1DD471 | possible worlds. A pessimist is sure of it!
Attachment:
signature.asc
Description: This is a digitally signed message part
- Follow-Ups:
- Re: Sync option destroys flash! Now I'm confused...
- From: "Richard B. Johnson" <[email protected]>
- Re: Sync option destroys flash! Now I'm confused...
- References:
- Re: Sync option destroys flash!
- From: [email protected]
- Re: Sync option destroys flash!
- From: "Richard B. Johnson" <[email protected]>
- Re: Sync option destroys flash!
- Prev by Date: Re: [Lse-tech] Re: [RFT PATCH] Dynamic sched domains (v0.6)
- Next by Date: Re: NUMA aware slab allocator V3
- Previous by thread: Re: Sync option destroys flash!
- Next by thread: Re: Sync option destroys flash! Now I'm confused...
- Index(es):