On Fri, 2006-05-26 at 16:35 -0600, Ashley M. Kirchner wrote: > I have one machine that's consistently crashing with what appears to > be a kernel bug. At the moment it's running kernel-2.6.16-1.2111_FC4 . > Before that it was running 2107_FC4 (I skipped 2108_FC4). Both kernels > produced the same type of kernel message. The weird thing is that > sometimes it will lock up the machine and other times it will continue > to work just fine. I've attached a txt file with the latest one (and I > realize that because of it being text, it will probably just get > attached inline into this message.) > > Anyway, if someone can shed some light on this, I would very much > appreciate it. Thanks. > > plain text document attachment (kernel bug.txt) > May 26 05:35:23 avalon kernel: ------------[ cut here ]------------ > May 26 05:35:23 avalon kernel: kernel BUG at include/linux/list.h:167! > May 26 05:35:23 avalon kernel: invalid opcode: 0000 [#1] > May 26 05:35:23 avalon kernel: last sysfs file: /class/vc/vcsa2/dev > May 26 05:35:23 avalon kernel: Modules linked in: autofs4 w83627hf hwmon_vid hwmon eeprom i2c_isa nfs lockd nfs_acl sunrpc ipv6 ipt_REJECT xt_state ip_conntrack nfnetlink xt_tcpudp iptable_filter ip_tables x_tables dm_mod uhci_hcd hw_random i2c_i801 i2c_core 3c59x mii tulip floppy ext3 jbd > May 26 05:35:23 avalon kernel: CPU: 0 > May 26 05:35:23 avalon kernel: EIP: 0060:[<c012e12f>] Not tainted VLI > May 26 05:35:23 avalon kernel: EFLAGS: 00010097 (2.6.16-1.2111_FC4 #1) > May 26 05:35:23 avalon kernel: EIP is at remove_wait_queue+0x3f/0x53 > May 26 05:35:23 avalon kernel: eax: d99b0888 ebx: c5227028 ecx: ffffffff edx: c5227034 > May 26 05:35:23 avalon kernel: esi: db9b0878 edi: 00000296 ebp: c5227000 esp: da600f48 > May 26 05:35:23 avalon kernel: ds: 007b es: 007b ss: 0068 > May 26 05:35:23 avalon kernel: Process hald (pid: 1655, threadinfo=da600000 task=da544000) > May 26 05:35:23 avalon kernel: Stack: <0>c522703c c5227024 c5227008 c016ac96 00000000 00000000 00000005 09000198 > May 26 05:35:23 avalon kernel: c016b4aa da600fb0 090001a0 090001a0 00000000 00000000 00000000 c5e83e68 > May 26 05:35:23 avalon kernel: 00000005 c5e83e60 c016acc1 c5227000 00000000 09000178 000007d0 00c54ff4 > May 26 05:35:23 avalon kernel: Call Trace: > May 26 05:35:23 avalon kernel: [<c016ac96>] poll_freewait+0x21/0x4c [<c016b4aa>] do_sys_poll+0x2dd/0x394 > May 26 05:35:23 avalon kernel: [<c016acc1>] __pollwait+0x0/0x96 [<c016b7e9>] sys_poll+0x3c/0x4a > May 26 05:35:23 avalon kernel: [<c0102d35>] syscall_call+0x7/0xb <0>Code: 3b 10 75 27 8b 4b 0c 3b 51 04 75 29 89 41 04 89 08 c7 43 0c 00 01 10 00 c7 42 04 00 02 20 00 89 fa 89 f0 5b 5e 5f e9 59 44 1e 00 <0f> 0b a7 00 4f d3 32 c0 eb cf 0f 0b a8 00 4f d3 32 c0 eb cd 57 > May 26 05:35:23 avalon kernel: Continuing in 120 seconds. ^MContinuing in 119 seconds. ^MContinuing in 118 seconds. > > [... this repeats till ...] > > May 26 05:35:23 avalon kernel: tinuing in 10 seconds. ^MContinuing in 9 seconds. ^MContinuing in 8 seconds. ^MContinuing in 7 seconds. ^MContinuing in 6 seconds. ^MContinuing in 5 seconds. ^MContinuing in 4 seconds. ^MContinuing in 3 seconds. ^MContinuing in 2 seconds. ^MContinuing in 1 seconds. > May 26 05:35:23 avalon kernel: <3>Debug: sleeping function called from invalid context at include/linux/rwsem.h:43 > May 26 05:35:23 avalon kernel: in_atomic():0, irqs_disabled():1 > May 26 05:35:23 avalon kernel: [<c011d64e>] profile_task_exit+0x13/0x43 > May 26 05:35:23 avalon kernel: [<c011e604>] do_exit+0x1b/0x81e [<c011d07f>] printk+0x1f/0xb9 > May 26 05:35:23 avalon kernel: [<c010445d>] do_simd_coprocessor_error+0x0/0x155 [<c0104742>] do_invalid_op+0x0/0xab > May 26 05:35:23 avalon kernel: [<c01047e4>] do_invalid_op+0xa2/0xab [<c012e12f>] remove_wait_queue+0x3f/0x53 > May 26 05:35:23 avalon kernel: [<c03125a8>] _spin_unlock_irq+0x5/0x7 [<c0311312>] schedule+0x31c/0x5ce > May 26 05:35:23 avalon kernel: [<c01037ef>] error_code+0x4f/0x54 [<c012e12f>] remove_wait_queue+0x3f/0x53 > May 26 05:35:23 avalon kernel: [<c016ac96>] poll_freewait+0x21/0x4c [<c016b4aa>] do_sys_poll+0x2dd/0x394 > May 26 05:35:23 avalon kernel: [<c016acc1>] __pollwait+0x0/0x96 [<c016b7e9>] sys_poll+0x3c/0x4a > -- fedora-list mailing list fedora-list@xxxxxxxxxx To unsubscribe: https://www.redhat.com/mailman/listinfo/fedora-list The upper bug seems like a memory error. A. Check the CPU/case cooling. B. Run memtest. If all fails, file a bug report at http://bugzilla.redhat.com Gilboa P.S. Can you post the machine configuration?