Re: FYI: RAID5 unusably unstable through 2.6.14

Martin Drab wrote:

On Thu, 2 Feb 2006, Bill Davidsen wrote:
Just to state clearly in the first place. I've allready solved the problemby low-level formatting the entire disk that this inconsistent array inquestion was part of.
So now everything is back to normal. So unforunatelly I would not be ableto do any more tests on the device in the non-working state.
I mentioned this problem here now just to let you konw that there is sucha problematic Linux behviour (and IMO flawed) in such circumstances, andthat perhaps it may let you think of such situations when doing furtherimprovements and development in the design of the block device layer (orwherever the problem may possibly come from).

It looks like the problem is in that controller card and its driver.Was this a proprietary closed source driver? Linux is perfectly happyto access the rest of the disk when some parts of it have gone bad;people do this all the time. It looks like your raid controller decidedto take the entire virtual disk that it presents to the kernel offlinebecause it detected errors.

<snip>

The 0,0,0 is the /dev/sda. And even though this is now, after low-levelformatting of the previously inconsistent disc, the indications back thenwere just the same. Which means every indication behaved as usual. Botharrays were properly identified. But when I was accessing the inconsistentone, i.e. /dev/sda, in any way (even just bytes, this has nothing to dowith any filesystems) the error messages mentioned above appeared. I'm notsure what exactly was generating them, but I've CC'd Mark Salyzyn, maybehe can explain more to it.

How did you low level format the drive? These days disk manufacturersship drives already low level formatted and end users can not perform alow level format. The last time I remember being able to low levelformat a drive was with MFM and RLL drives, prior to IDE. My guess iswhat you actually did was simply write out zeros to every sector on thedisk, which replaced the corrupted data in the effected sector with gooddata, rendering it repaired. Usually drives will fail reads to badsectors but when you write to that sector, it will write and read thatsector to see if it is fine after being written again, or if the mediais bad in which case it will remap the sector to a spare.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: FYI: RAID5 unusably unstable through 2.6.14
  - From: Martin Drab <drab@kepler.fjfi.cvut.cz>

References:
- FYI: RAID5 unusably unstable through 2.6.14
  - From: Cynbe ru Taren <cynbe@muq.org>
- Re: FYI: RAID5 unusably unstable through 2.6.14
  - From: Benjamin LaHaise <bcrl@kvack.org>
- Re: FYI: RAID5 unusably unstable through 2.6.14
  - From: Martin Drab <drab@kepler.fjfi.cvut.cz>
- Re: FYI: RAID5 unusably unstable through 2.6.14
  - From: Bill Davidsen <davidsen@tmr.com>
- Re: FYI: RAID5 unusably unstable through 2.6.14
  - From: Martin Drab <drab@kepler.fjfi.cvut.cz>

Prev by Date: Re: [RFC][PATCH 5/7] VPIDs: vpid/pid conversion in VPID enabled case
Next by Date: Re: RFC [patch 13/34] PID Virtualization Define new task_pid api
Previous by thread: Re: FYI: RAID5 unusably unstable through 2.6.14
Next by thread: Re: FYI: RAID5 unusably unstable through 2.6.14
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]