Linus Torvalds wrote:
On Sun, 23 Dec 2007, Carlos Corbacho wrote:
Fix suspend-to-RAM on nForce 4 (CK804) boards by increasing
PCIBIOS_MIN_IO.
Fixes kernel bugzilla #9528
Problem:
Linus' patch (52ade9b3b97fd3bea42842a056fe0786c28d0555) to re-order
suspend (and fix fall out from Rafael's earlier suspend reordering work)
broke suspend-to-RAM on nForce 4 (CK804) boards.
Why:
After debugging _PTS() in the DSDT, it turns out these nVidia boards are
trying to write to an IO port > 0x1000 (0x142E) during suspend. Before the
re-ordering, we got away with this.
Very interesting.
HOWEVER.
I'd much rather figure out what the magic IO resource is that clashes.
It's almost certainly some hidden and undocumented (or badly documented)
ACPI IO area that the kernel doesn't know about, because it's not a
regular PCI BAR resource, but some northbridge (or southbridge) magic
register range.
Those ranges *should* be reserved by the BIOS in the ACPI tables, but this
would definitely not be the first time that doesn't happen.
I'm having trouble sorting out which report is for which BIOS (and some
of them don't have any dmesg posted), but I believe in these cases that
memory region is indeed reported as reserved by the BIOS, and no PCI
resources should end up allocated there. So I'm not sure why fiddling
with PCIBIOS_MIN_IO would have any effect (other than by accident).
I wonder if this is the culprit (from Arthur Erhardt's dmesg):
pnpacpi: exceeded the max number of mem resources: 12
pnpacpi: exceeded the max number of mem resources: 12
which means we're ignoring some of the memory reservations. I wonder if
some IO reservations are also being ignored?
Why do we have this silly hard limit of number of resources anyway? If
we just ignore random reservations provided by the BIOS, we shouldn't be
surprised if things break randomly. This warning at the very least
should be much louder (i.e. "Warning: This problem may break your system")..
But the right fix would be for us to just figure out what the range is ass
a PCI quirk, and just know to avoid it on purpose, ratehr than just being
lucky and happen to avoid it because PCIBIOS_MIN_IO just happens to be
bigger than the particular address.
So can you:
- show what your /proc/ioports contains (*with* the bug triggering, ie
non-working suspend, so we see what it is that actually ends up using
that area)
- send out 'dmesg' for a boot (same deal)
- add "lspci -xxxvv" output to the deal too.
and also make them part of the bugzilla history (I'm cc'ing bugzilla here,
and added the bug number to the subject, so hopefully this thread ends up
being archived there too).
There was some previous work in the PCIBIOS_MIN_IO area over two years ago
(71db63acff69618b3d9d3114bd061938150e146b) which bumped this to 0x4000,
but this was reverted (2ba84684e8cf6f980e4e95a2300f53a505eb794e) after
causing new and entirely different problems on another nForce board.
The problem here is classic: these magic ranges tend to be *different* on
different boards (because they don't tend to be fixed by hardware, they
are programmed regions set up by firmware), so trying to change
PCIBIOS_MIN_IO to avoid a problem on one board is almost certain to just
introduce it on another board instead.
On *your* particular board, 0x142E is used for something, but on somebody
elses board it might be 0x162E, and now changing PCIBIOS_MIN_IO to 0x1500
might make that other board hang instead.
So you seem to have debugged this very successfully, and I'm wondering if
you might be able to find out where that 0x142e comes from, and we could
fix it for *all* boards using that chipset by just figuring out what the
*hardware* rules (rather than the random firmware setup that will be
different on different boards) for that chipset actually are!
I suspect it's board specific. Looking at the DSDT for my A8N-SLI
Deluxe, that SMIP region is defined at 0x442E (and is reported as
reserved). This BIOS doesn't write there in the _PTS method like the
ones in the report apparently do though.
--
Robert Hancock Saskatoon, SK, Canada
To email, remove "nospam" from [email protected]
Home Page: http://www.roberthancock.com/
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Kernel Newbies]
[Netfilter]
[Bugtraq]
[Photo]
[Stuff]
[Gimp]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Video 4 Linux]
[Linux for the blind]
[Linux Resources]