Re: RFC: outb 0x80 in inb_p, outb_p harmful on some modern AMD64 with MCP51 laptops

Pardon my ignorance, but is port 0xed really safe to push an out cycleat across the entire x86_64 family? How long must real _p pauses be inreality? (and who cares about what the code calls "really slow i/o").

Why are we waiting at all? I read the comments in io_64.h, and am a bitmystified. Does Windoze or DOS do this magical mystery wait?

Anyway, the virtualization hooks in 32-bit x86 almost make it possibleto isolate this simply - maybe - after the merge of 32/64 beingcontemplated.

And anyone who knows what the chipset might be doing with the 80 portrather than POST codes, perhaps could contribute. Any nvidia folks whoknow what's happening under NDA? Any Phoenix BIOS types?

I think the worst of the problems would be fixed by changing just theCMOS_READ/CMOS_WRITE macros. But the danger lingers in the *_p code.


Rene Herman wrote:

On 07-12-07 01:23, Robert Hancock wrote:
David P. Reed wrote:
After much, much testing (months, off and on, pursuing hypotheses),I've discovered that the use of "outb al,0x80" instructions to"delay" after inb and outb instructions causes solid freezes on myHP dv9000z laptop, when ACPI is enabled.
It takes a fair number of out's to 0x80, but the hard freeze isreliably reproducible by writing a driver that solely does a loop of50 outb's to 0x80 and calling it in a loop 1000 times from userspace. !!!
The serious impact is that the /dev/rtc and /dev/nvram devices arevery unreliable - thus "hwclock" freezes very reliably while loopingwaiting for a new second value and calling "cat /dev/nvram" in aloop freezes the machine if done a few times in a row.
This is reproducible, but requires a fair number of outb's to the0x80 diagnostic port, and seems to require ACPI to be on.
io_64.h is the source of these particular instructions, via theCMOS_READ and CMOS_WRITE macros, which are defined inmc146818_64.h. (I wonder if the same problem occurs in 32-bit mode).
I'm happy to complete and test a patch, but I'm curious what theright approach ought to be. I have to say I have no clue as to whatACPI is doing on this chipset (nvidia MCP51) that would make port80 do this. A raw random guess is that something is logging POSTcodes, but if so, not clear what is problematic in ACPI mode.
ANy help/suggestions?
Changing the delay instruction sequence from the outb to short jumpsmight be the safe thing. But Linus, et al. may have experience withthat on other architectures like older Pentiums etc.
The fact that these "pausing" calls are needed in the first placeseems rather cheesy. If there's hardware that's unable to respond toIO port writes as fast as possible, then surely there's a bettersolution than trying to stall the IOs by an arbitrary andhardware-dependent amount of time, like udelay calls, etc. Does anyremotely recent hardware even need this?
The idea is that the delay is not in fact hardware dependent. With inthe the absense of a POST board port 0x80 being sort of guaranteeed tonot be decoded on PCI but forwarded to and left to die on ISA/LPC oneshould get the effect that the _next_ write will have survived anISA/LPC bus address cycle acknowledgement timeout.
I believe.
And no, I don't believe any remotely recent hardware needs it and havein fact wondered about it since actual 386 days, having since thattime never found a device that wouldn't in fact take back to back I/Oeven. Even back then (ie, legacy only systems, no forwarding from PCIor anything) BIOSes provided ISA bus wait-state settings which shouldbe involved in getting insanely stupid and old hardware to behave...
Port 0xed has been suggested as an alternate port. Probably not agreat "fix" but if replacing the out with a simple udelay() isn't thatsimple (during early boot I gather) then it might at least besomething for you to try. I'd hope that the 0x80 ininclude/asm/io.h:native_io_delay() would be the only one you arerunning into, so you could change that to 0xed and see what catches fire.
If there are no sensible fixes, an 0x80/0xed choice could I assume behung of DMI or something (if that _is_ parsed soon enough).
Rene.

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: RFC: outb 0x80 in inb_p, outb_p harmful on some modern AMD64 with MCP51 laptops
  - From: Rene Herman <[email protected]>

References:
- Re: RFC: outb 0x80 in inb_p, outb_p harmful on some modern AMD64 with MCP51 laptops
  - From: Robert Hancock <[email protected]>
- Re: RFC: outb 0x80 in inb_p, outb_p harmful on some modern AMD64 with MCP51 laptops
  - From: Rene Herman <[email protected]>

Prev by Date: Re: [PATCH] scheduler: fix x86 regression in native_sched_clock
Next by Date: Re: PS3: trouble with SPARSEMEM_VMEMMAP and kexec
Previous by thread: Re: RFC: outb 0x80 in inb_p, outb_p harmful on some modern AMD64 with MCP51 laptops
Next by thread: Re: RFC: outb 0x80 in inb_p, outb_p harmful on some modern AMD64 with MCP51 laptops
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]