Hello,
I have a problem with a kernel 2.6.10 (sources from debian) which
disable IRQ of my megraid (driver megaraid_mbox) on several servers
after days of work and several millions of interuptions.
When the IRQ is disable, the servers do that :
May 22 03:02:20 relternet-01 kernel: irq 17: nobody cared!
May 22 03:02:20 relternet-01 kernel: [__report_bad_irq+42/160]
__report_bad_irq+0x2a/0xa0
May 22 03:02:20 relternet-01 kernel: [handle_IRQ_event+48/112]
handle_IRQ_event+0x30/0x70
May 22 03:02:20 relternet-01 kernel: [note_interrupt+112/176]
note_interrupt+0x70/0xb0
May 22 03:02:20 relternet-01 kernel: [__do_IRQ+304/320]
__do_IRQ+0x130/0x140
May 22 03:02:20 relternet-01 kernel: [do_IRQ+25/48] do_IRQ+0x19/0x30
May 22 03:02:20 relternet-01 kernel: [common_interrupt+26/32]
common_interrupt+0x1a/0x20
May 22 03:02:20 relternet-01 kernel: [mwait_idle+51/80]
mwait_idle+0x33/0x50
May 22 03:02:20 relternet-01 kernel: [cpu_idle+59/80] cpu_idle+0x3b/0x50
May 22 03:02:20 relternet-01 kernel: handlers:
May 22 03:02:20 relternet-01 kernel: [pg0+944120576/1069794304]
(megaraid_isr+0x0/0x1e0 [megaraid_mbox])
May 22 03:02:20 relternet-01 kernel: Disabling IRQ #17
I haven't noticed something else curious on the servers.
hardware of these servers :
- bi Intel Xeon 2.4Ghz
- 4.5 GB of RAM
- MegaRaid SCSI 320-2 [1]
The kernel use SMP, HT, high memory support 64GB, megaraid_mbox driver
(v2.20.4.1, in module with initrd until yesterday) and don't use preempt.
As I didn't find anything that match IRQ disable and megaraid under
internet, I have tried several ways :
- at the beginning, the IRQ of eth0 and megaraid was shared. I have
corrected this [2] but the problem is staying.
- I have try newer version of the kernel 2.6.9 to 2.6.10. No benefic
result.
- I have try the boot option "acpi=ht". No benefic result.
- the firmeware of the megaraid has been upgraded (to the 1L37
version). No benefic result.
Since yesterday, I am trying, on all server which have the problem,
kernel 2.6.11.10 (source take kernel.org) with megaraid_mbox built-in
(v2.20.4.5) and with differents boot options on servers :
- "noirqdebug" and "acpi=ht"
- "noirqdebug" and "acpi=off"
- "acpi=off"
- "acpi=ht"
I have this problem since I installed them few weeks ago with my new
FAI (Debian Fully autmatic Installation). I have a lot of difficulty for
solving the probleme because servers can run without problem one or two
weeks. These servers are not yet in production but still in tests of
charge; they should be in production in one or two weeks... if I find a
way to correct this problem !
Do you have any ideas of which way I can search ?
Regards,
Julien
[1] 0000:03:08.0 RAID bus controller: LSI Logic / Symbios Logic MegaRAID
(rev 01)
Subsystem: LSI Logic / Symbios Logic MegaRAID 518 SCSI 320-2
Controller
Flags: bus master, fast Back2Back, 66MHz, slow devsel, latency
32, IRQ 17
Memory at d0500000 (32-bit, prefetchable) [size=64K]
Capabilities: [80] Power Management version 2
[2]
irq 0: 88018734 timer irq 16: 21659113 eth0
irq 1: 9 i8042 irq 17: 12532775 megaraid
irq 2: 0 cascade [4] irq 18: 30 aic79xx
irq 12: 3 irq 19: 30 aic79xx
irq 14: 1 ide0 irq 23: 0 ehci_hcd
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
[Index of Archives]
[Kernel Newbies]
[Netfilter]
[Bugtraq]
[Photo]
[Stuff]
[Gimp]
[Yosemite News]
[MIPS Linux]
[ARM Linux]
[Linux Security]
[Linux RAID]
[Video 4 Linux]
[Linux for the blind]
[Linux Resources]