----- Original Message -----
Sent: Saturday, November 27, 2004 6:14
PM
Subject: RE: HDD DMA error and system
hangs - FC3.
VJ, this is not the first time similar inquiries have
appeared here, and I encountered the same problem with a completely different
hardware setup. I saw this happening on a SCSI raid w/caching
controller. I discovered a work around by disabling cache writes to
filesystem in the mount. Some of the newer ide/sata drives are coming
with large caches (~8meg), so it makes me wonder about a possible correlation
there. Maybe completely off target - just FYI...
Paul
----- Original Message -----
Sent: Saturday, November 27, 2004 2:33
PM
Subject: HDD DMA error and system hangs
- FC3.
Hi,
My PC suffers from HDD dma problem
almost everyday. Motherboard is Gigabyte GA7DXR, HDD is Seagate
ST3160023A.
Output of lspci is:
00:00.0 Host bridge: Advanced Micro Devices
[AMD] AMD-760 [IGD4-1P] System Controller (rev 13) 00:01.0 PCI bridge:
Advanced Micro Devices [AMD] AMD-760 [IGD4-1P] AGP Bridge 00:07.0 ISA
bridge: VIA Technologies, Inc. VT82C686 [Apollo Super South] (rev
40) 00:07.1 IDE interface: VIA Technologies, Inc.
VT82C586A/B/VT82C686/A/B/VT823x/A/C PIPC Bus Master IDE (rev 06) 00:07.4
SMBus: VIA Technologies, Inc. VT82C686 [Apollo Super ACPI] (rev
40) 00:09.0 Multimedia video controller: Internext Compression Inc iTVC15
MPEG-2 Encoder (rev 01) 00:0d.0 PCI bridge: Digital Equipment Corporation
DECchip 21152 (rev 03) 00:0e.0 Multimedia audio controller: Ensoniq 5880
AudioPCI (rev 02) 00:10.0 Unknown mass storage controller: Promise
Technology, Inc. PDC20265 (FastTrak100 Lite/Ultra100) (rev 02) 01:05.0
VGA compatible controller: ATI Technologies Inc Rage 128 RF/SG
AGP 02:04.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro 100]
(rev 05) 02:05.0 Ethernet controller: Intel Corp. 82557/8/9 [Ethernet Pro
100] (rev 05)
This is a portion of my logwatch mail message
(i can provied the /var/log/messages as well if needed).
Buffer I/O error on device hdh4,
...: 10 Time(s) RPC: error 5 connecting to ...:
1 Time(s) end_request: I/O error, dev hdh, sector...:
468 Time(s) hdh: DMA timeout error...: 2
Time(s) hdh: dma timeout error: status=0x00 { }...: 1
Time(s) hdh: dma timeout error: status=0xd0 { B...: 1
Time(s) hdh: read_intr: error=0x04 { DriveStat...: 4
Time(s) hdh: read_intr: status=0x51 { DriveReady
SeekComplete Error }...: 4 Time(s) ide3: reset:
master: error (0x00?)...: 2 Time(s) lost page write
due to I/O error on hdh4...: 10 Time(s)
And a lot of following messages
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071628
Nov 26 04:07:23 end_request: I/O error, dev hdh, sector 13071636 Nov
26 04:07:23 end_request: I/O error, dev hdh, sector 13071644 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071652 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071660 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071668 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071676 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071684 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071692 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071700 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071708 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071716 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071724 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071732 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071740 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071748 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071756 Nov 26
04:07:23 end_request: I/O error, dev hdh, sector 13071764
I am running kernel "2.6.9-1.681_FC3" with "noapic nolapic
acpi=off".
I ran Segate's own Seagate Tools for surface scan - No problem
found. I ran smartctl -t long /dev/hdh for SMART extensive test. No
problems there either.
Any ideas???
Regards from
VJ
Apologies. I sent MIME mail by
mistake.
VJ
*************************************************************************** This
message is intended only for the use of the Addressee and may contain
information that is PRIVILEGED and CONFIDENTIAL.
If you are not the
intended recipient, you are hereby notified that any dissemination of this
communication is strictly prohibited. If you have received this
communication in error, please erase all copies of the message and its
attachments and notify Space Imaging
immediately. ***************************************************************************
Hi Paul,
Are you talking about -W
option in hadparm? Please comment.
Thanks
VJ
|