Thanks to mcelog, I am now regularly seeing messages like this on an amd64 machine: kernel: Machine check events logged bit46 = corrected ecc error Data cache ECC error (syndrome 5b) memory/cache error 'data read mem transaction, data transaction, level 2' ADDR 38ed9200 CPU 0 0 data cache TSC fe4f9128ade MCE 0 STATUS 942dc00000000136 MCGSTATUS 0 The RAM modules are *not* ECC modules, nor does the Asus K8V Deluxe motherboard support ECC to my knowledge. I've turned ECC support on and off in the Bios without any effect. I've already run memtest86+ for hours without finding any problems, and I've removed each of the two memory modules for a while, but I still saw these errors appearing. Before I go out and buy a new motherboard (as I assume that it's a L1/L2 cache problem), I'd like to know how I am to interpret these MCE dumps and how I could use them to actually pinpoint the source of the problem. Cheers, -- martin; (greetings from the heart of the sun.) \____ echo mailto: !#^."<*>"|tr "<*> mailto:" net@madduck spamtraps: [email protected] "america may be unique in being a country which has leapt from barbarism to decadence without touching civilization." -- john o'hara
Attachment:
signature.asc
Description: Digital signature (GPG/PGP)
- Follow-Ups:
- Re: How to interpret MCE messages?
- From: [email protected]
- Re: How to interpret MCE messages?
- From: Alan Cox <[email protected]>
- Re: How to interpret MCE messages?
- Prev by Date: Re: 2.6.19-rc1: Volanomark slowdown
- Next by Date: Re: [PATCH] sysctl: Undeprecate sys_sysctl
- Previous by thread: [PATCH] initramfs : handle more than one source dir or file list
- Next by thread: Re: How to interpret MCE messages?
- Index(es):