VJ, this is not the first time similar inquiries have
appeared here, and I encountered the same problem with a completely different
hardware setup. I saw this happening on a SCSI raid w/caching
controller. I discovered a work around by disabling cache writes to
filesystem in the mount. Some of the newer ide/sata drives are coming with
large caches (~8meg), so it makes me wonder about a possible correlation
there. Maybe completely off target - just FYI...
Paul
----- Original Message -----
Sent: Saturday, November 27, 2004 2:33
PM
Subject: HDD DMA error and system hangs -
FC3.
Hi,
My PC suffers from HDD dma problem
almost everyday. Motherboard is Gigabyte GA7DXR, HDD is Seagate
ST3160023A.
Output of lspci is:
00:00.0 Host bridge: Advanced Micro Devices [AMD]
AMD-760 [IGD4-1P] System Controller (rev 13) 00:01.0 PCI bridge: Advanced
Micro Devices [AMD] AMD-760 [IGD4-1P] AGP Bridge 00:07.0 ISA bridge: VIA
Technologies, Inc. VT82C686 [Apollo Super South] (rev 40) 00:07.1 IDE interface: VIA Technologies, Inc. VT82C586A/B/VT82C686/A/B/VT823x/A/C PIPC Bus
Master IDE (rev 06) 00:07.4 SMBus: VIA Technologies, Inc. VT82C686 [Apollo
Super ACPI] (rev 40) 00:09.0 Multimedia video controller: Internext
Compression Inc iTVC15 MPEG-2 Encoder (rev 01) 00:0d.0 PCI bridge: Digital
Equipment Corporation DECchip 21152 (rev 03) 00:0e.0 Multimedia audio controller: Ensoniq 5880 AudioPCI (rev 02) 00:10.0 Unknown mass storage
controller: Promise Technology, Inc. PDC20265 (FastTrak100 Lite/Ultra100) (rev
02) 01:05.0 VGA compatible controller: ATI Technologies Inc Rage 128 RF/SG
AGP 02:04.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro 100]
(rev 05) 02:05.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro
100] (rev 05)
This is a portion of my logwatch mail message (i
can provied the /var/log/messages as well if needed).
Buffer I/O error on device hdh4,
...: 10 Time(s) RPC: error 5 connecting to ...: 1
Time(s) end_request: I/O error, dev hdh, sector...: 468
Time(s) hdh: DMA timeout error...: 2
Time(s) hdh: dma timeout error: status=0x00 { }...: 1
Time(s) hdh: dma timeout error: status=0xd0 { B...: 1
Time(s) hdh: read_intr: error=0x04 { DriveStat...: 4
Time(s) hdh: read_intr: status=0x51 { DriveReady SeekComplete
Error }...: 4 Time(s) ide3: reset: master: error
(0x00?)...: 2 Time(s) lost page write due to I/O error
on hdh4...: 10 Time(s)
And a lot of following messages
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071628 Nov
26 04:07:23 end_request: I/O error, dev hdh, sector 13071636 Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071644 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071652 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071660 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071668 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071676 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071684 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071692 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071700 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071708 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071716 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071724 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071732 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071740 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071748 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071756 Nov 26 04:07:23
end_request: I/O error, dev hdh, sector 13071764
I am running kernel "2.6.9-1.681_FC3" with "noapic nolapic
acpi=off".
I ran Segate's own Seagate Tools for surface scan - No problem found. I ran smartctl -t long /dev/hdh for SMART extensive test. No problems there either.
Any ideas???
Regards from
VJ
Apologies. I sent MIME mail by
mistake.
VJ
***************************************************************************
This message is intended only for the use of the Addressee and may
contain information that is PRIVILEGED and CONFIDENTIAL.
If you are not the intended recipient, you are hereby notified that any
dissemination of this communication is strictly prohibited. If you have
received this communication in error, please erase all copies of the
message and its attachments and notify Space Imaging immediately.
***************************************************************************
|