Bugzilla – Bug 137976
nvidia related X server crash
Last modified: 2006-01-12 14:47:32 UTC
Hi all, My system is locking up, sometimes as much as 4 times in one hour. I've installed SUSE 10.0 OSS in non ACPI mode. I ran memtest86 for 12 hours, and found no errors. The system is: P4b 2.4GHz Asus P4P800-E deluxe 1 GB RAM DDR 266, in dual channel configuration. NVIDIA GeforceFX5500 Terratec Phase88 1 HD on IDE channel, 1 on promise 378 in IDE mode. (which SUSE is not picking up properly, incidentally). This crash also happened under FC4. However, it hasn't occured under XP, or Knoppix 4.0. Does SUSE have a problem with dual channel configurations? A section of var/log/messages gives: Dec 10 23:42:28 linux kernel: invalid operand: 0000 [#1] Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core Dec 10 23:42:28 linux kernel: CPU: 0 Dec 10 23:42:28 linux kernel: EIP: 0060:[<c02f8598>] Tainted: P U VLI Dec 10 23:42:28 linux kernel: EFLAGS: 00010246 (2.6.13-15.7-default) Dec 10 23:42:28 linux kernel: EIP is at krb5_decrypt+0x1c8/0x1e0 Dec 10 23:42:28 linux kernel: eax: 00000404 ebx: 00000200 ecx: ca3b8000 edx: 00000000 Dec 10 23:42:28 linux kernel: esi: cf261f70 edi: cf261f6c ebp: 7fffffff esp: cf261f3c Dec 10 23:42:28 linux kernel: ds: 007b es: 007b ss: 0068 Dec 10 23:42:28 linux kernel: Process nautilus (pid: 18992, threadinfo=cf260000 task=f2a13a40) Dec 10 23:42:28 linux kernel: Stack: c016ab8c cf261f6c dc8e1488 00000609 dc8e1480 cf261f70 cf261f6c 00001000 Dec 10 23:42:28 linux kernel: cf261f70 c016ac8a cf261f9c dc8e1480 00000000 00000000 dc8e1480 081584d0 Dec 10 23:42:28 linux kernel: 00000000 081584d0 c016adba 7fffffff 08158518 fffffff7 dc8e1480 dc8e3480 Dec 10 23:42:28 linux kernel: Call Trace: Dec 10 23:42:28 linux kernel: [<c016ab8c>] do_pollfd+0x5c/0xb0 Dec 10 23:42:28 linux kernel: [<c016ac8a>] do_poll+0xaa/0xd0 Dec 10 23:42:28 linux kernel: [<c016adba>] sys_poll+0x10a/0x220 Dec 10 23:42:28 linux kernel: [<c016a2c0>] __pollwait+0x0/0xa0 Dec 10 23:42:28 linux kernel: [<c0100d1b>] calibrate_delay+0x3db/0x6c0 Dec 10 23:42:28 linux kernel: Code: 32 c0 e9 1f ff ff ff 68 c4 90 33 c0 e8 d2 42 e2 ff 58 eb a3 0f 0b 7e 01 3c 0d 32 c0 e9 75 ff ff ff 0f 0b 7d 01 3c 0d 32 c0 e9 62 <ff> ff ff 0f 0b 14 01 3c 0d 32 c0 e9 b5 fe ff ff 90 8d b4 26 00 Dec 10 23:42:28 linux kernel: ------------[ cut here ]------------ Dec 10 23:42:28 linux kernel: kernel BUG at mm/rmap.c:493! Dec 10 23:42:28 linux kernel: invalid operand: 0000 [#2] Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core Dec 10 23:42:28 linux kernel: CPU: 0 Dec 10 23:42:28 linux kernel: EIP: 0060:[<c014f512>] Tainted: P U VLI Dec 10 23:42:28 linux kernel: EFLAGS: 00010286 (2.6.13-15.7-default) Dec 10 23:42:28 linux kernel: EIP is at page_remove_rmap+0x32/0x40 Dec 10 23:42:28 linux kernel: eax: ffffffff ebx: eb44b058 ecx: c03ee500 edx: c1568900 Dec 10 23:42:28 linux kernel: esi: c1568900 edi: 00000020 ebp: 40016000 esp: cf261d74 Dec 10 23:42:28 linux kernel: ds: 007b es: 007b ss: 0068 Dec 10 23:42:28 linux kernel: Process nautilus (pid: 18992, threadinfo=cf260000 task=f2a13a40) Dec 10 23:42:28 linux kernel: Stack: c0149427 00000000 40017000 c03ee500 40017000 d0ca0404 40017000 40016fff Dec 10 23:42:28 linux kernel: c01495c3 40017000 00000000 c03ee500 40017000 00002000 f35d7074 40017000 Dec 10 23:42:28 linux kernel: c0149716 40017000 00000000 cf260000 f6e6fd00 cf261e04 00006000 00000000 Dec 10 23:42:28 linux kernel: Call Trace: Dec 10 23:42:28 linux kernel: [<c0149427>] zap_pte_range+0xe7/0x200 Dec 10 23:42:28 linux kernel: [<c01495c3>] unmap_page_range+0x83/0xd0 Dec 10 23:42:28 linux kernel: [<c0149716>] unmap_vmas+0x106/0x220 Dec 10 23:42:28 linux kernel: [<c014dabb>] exit_mmap+0x5b/0x120 Dec 10 23:42:28 linux kernel: [<c011a63e>] mmput+0x1e/0x70 Dec 10 23:42:28 linux kernel: [<c011e78d>] do_exit+0xdd/0x350 Dec 10 23:42:28 linux kernel: [<c010468e>] die+0x13e/0x140 Dec 10 23:42:28 linux kernel: [<c0104940>] do_invalid_op+0x0/0xa0 Dec 10 23:42:28 linux kernel: [<c01049d1>] do_invalid_op+0x91/0xa0 Dec 10 23:42:28 linux kernel: [<c02f8598>] krb5_decrypt+0x1c8/0x1e0 Dec 10 23:42:28 linux kernel: [<c0284c2d>] sock_aio_read+0xdd/0x130 Dec 10 23:42:28 linux kernel: [<c0140c19>] buffered_rmqueue+0xb9/0x1f0 Dec 10 23:42:28 linux kernel: [<c0103f0f>] error_code+0x4f/0x60 Dec 10 23:42:28 linux kernel: [<c02f8598>] krb5_decrypt+0x1c8/0x1e0 Dec 10 23:42:28 linux kernel: [<c016ab8c>] do_pollfd+0x5c/0xb0 Dec 10 23:42:28 linux kernel: [<c016ac8a>] do_poll+0xaa/0xd0 Dec 10 23:42:28 linux kernel: [<c016adba>] sys_poll+0x10a/0x220 Dec 10 23:42:28 linux kernel: [<c016a2c0>] __pollwait+0x0/0xa0 Dec 10 23:42:28 linux kernel: [<c0100d1b>] calibrate_delay+0x3db/0x6c0 Dec 10 23:42:28 linux kernel: Code: 75 1f 83 42 08 ff 0f 98 c0 84 c0 75 01 c3 8b 42 08 40 78 17 83 ca ff b8 10 00 00 00 e9 d8 1f ff ff 0f 0b ea 01 37 52 31 c0 eb d7 <0f> 0b ed 01 37 52 31 c0 eb df 8d 74 26 00 55 57 56 53 57 89 d3 Dec 10 23:42:28 linux kernel: <1>Fixing recursive fault but reboot is needed! Dec 10 23:42:28 linux kernel: scheduling while atomic: nautilus/0x00000001/18992 Dec 10 23:42:28 linux kernel: [<c02f9dfc>] schedule+0x55c/0x610 Dec 10 23:42:28 linux kernel: [<c012b7fc>] __kernel_text_address+0x1c/0x30 Dec 10 23:42:28 linux kernel: [<c01041f7>] show_trace+0x27/0x70 Dec 10 23:42:28 linux kernel: [<c014f526>] try_to_unmap_one+0x6/0x180 Dec 10 23:42:28 linux kernel: [<c014f527>] try_to_unmap_one+0x7/0x180 Dec 10 23:42:28 linux kernel: [<c011c863>] printk+0x13/0x20 Dec 10 23:42:28 linux kernel: [<c011e999>] do_exit+0x2e9/0x350 Dec 10 23:42:28 linux kernel: [<c010468e>] die+0x13e/0x140 Dec 10 23:42:28 linux kernel: [<c0104940>] do_invalid_op+0x0/0xa0 Dec 10 23:42:28 linux kernel: [<c01049d1>] do_invalid_op+0x91/0xa0 Dec 10 23:42:28 linux kernel: [<c014f512>] page_remove_rmap+0x32/0x40 Dec 10 23:42:28 linux kernel: [<c02597e2>] elv_queue_empty+0x12/0x20 Dec 10 23:42:28 linux kernel: [<f8855f9b>] ide_do_request+0xbb/0x3a0 [ide_core] Dec 10 23:42:28 linux kernel: [<f885d1d0>] ide_dma_intr+0x0/0xb0 [ide_core] Dec 10 23:42:28 linux kernel: [<c013bd53>] handle_IRQ_event+0x33/0x60 Dec 10 23:42:28 linux kernel: [<c013be23>] __do_IRQ+0xa3/0xd0 Dec 10 23:42:28 linux kernel: [<c0103f0f>] error_code+0x4f/0x60 Dec 10 23:42:28 linux kernel: [<c014f512>] page_remove_rmap+0x32/0x40 Dec 10 23:42:28 linux kernel: [<c0149427>] zap_pte_range+0xe7/0x200 Dec 10 23:42:28 linux kernel: [<c01495c3>] unmap_page_range+0x83/0xd0 Dec 10 23:42:28 linux kernel: [<c0149716>] unmap_vmas+0x106/0x220 Dec 10 23:42:28 linux kernel: [<c014dabb>] exit_mmap+0x5b/0x120 Dec 10 23:42:28 linux kernel: [<c011a63e>] mmput+0x1e/0x70 Dec 10 23:42:28 linux kernel: [<c011e78d>] do_exit+0xdd/0x350 Dec 10 23:42:28 linux kernel: [<c010468e>] die+0x13e/0x140 Dec 10 23:42:28 linux kernel: [<c0104940>] do_invalid_op+0x0/0xa0 Dec 10 23:42:28 linux kernel: [<c01049d1>] do_invalid_op+0x91/0xa0 Dec 10 23:42:28 linux kernel: [<c02f8598>] krb5_decrypt+0x1c8/0x1e0 Dec 10 23:42:28 linux kernel: [<c0284c2d>] sock_aio_read+0xdd/0x130 Dec 10 23:42:28 linux kernel: [<c0140c19>] buffered_rmqueue+0xb9/0x1f0 Dec 10 23:42:28 linux kernel: [<c0103f0f>] error_code+0x4f/0x60 Dec 10 23:42:28 linux kernel: [<c02f8598>] krb5_decrypt+0x1c8/0x1e0 Dec 10 23:42:28 linux kernel: [<c016ab8c>] do_pollfd+0x5c/0xb0 Dec 10 23:42:28 linux kernel: [<c016ac8a>] do_poll+0xaa/0xd0 Dec 10 23:42:28 linux kernel: [<c016adba>] sys_poll+0x10a/0x220 Dec 10 23:42:28 linux kdm[4794]: X server for display :0 terminated unexpectedly Dec 10 23:42:28 linux kernel: [<c016a2c0>] __pollwait+0x0/0xa0 Dec 10 23:42:28 linux kernel: [<c0100d1b>] calibrate_delay+0x3db/0x6c0 Dec 10 23:42:28 linux kernel: Unable to handle kernel paging request at virtual address 010483b6 Dec 10 23:42:28 linux kernel: printing eip: Dec 10 23:42:28 linux kernel: 010483b6 Dec 10 23:42:28 linux kernel: *pde = 00000000 Dec 10 23:42:28 linux kernel: Oops: 0000 [#3] Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core Dec 10 23:42:28 linux kernel: CPU: 0 Dec 10 23:42:28 linux kernel: EIP: 0060:[<010483b6>] Tainted: P U VLI Dec 10 23:42:28 linux kernel: EFLAGS: 00010087 (2.6.13-15.7-default) Dec 10 23:42:28 linux kernel: EIP is at 0x10483b6 Dec 10 23:42:28 linux kernel: eax: e2e5408c ebx: e2e5408c ecx: 00000000 edx: 00000001 Dec 10 23:42:28 linux kernel: esi: 82380000 edi: 105d8b00 ebp: f5283de8 esp: f5283dc8 Dec 10 23:42:28 linux kernel: ds: 007b es: 007b ss: 0068 Dec 10 23:42:28 linux kernel: Process smpppd (pid: 4673, threadinfo=f5282000 task=f7c52530) Dec 10 23:42:28 linux kernel: Stack: c01195c7 00000000 00000001 00000001 e2e56098 00000202 e00ec080 f5283eb4 Dec 10 23:42:28 linux kernel: f5283dfc c0119601 00000000 00000000 e2a13500 e00ec080 c02e306a 00000000 Dec 10 23:42:28 linux kernel: e2a13500 c0287468 00000000 c0288a67 00000001 00000001 c02e50b9 f36c6c80 Dec 10 23:42:28 linux kernel: Call Trace: Dec 10 23:42:28 linux kernel: [<c01195c7>] __wake_up_common+0x37/0x60 Dec 10 23:42:28 linux kernel: [<c0119601>] __wake_up+0x11/0x20 Dec 10 23:42:28 linux kernel: [<c02e306a>] unix_write_space+0x2a/0x60 Dec 10 23:42:28 linux kernel: [<c0287468>] sock_wfree+0x38/0x40 Dec 10 23:42:28 linux kernel: [<c0288a67>] __kfree_skb+0x57/0x180 Dec 10 23:42:28 linux kernel: [<c02e50b9>] unix_stream_recvmsg+0x1c9/0x420 Dec 10 23:42:28 linux kernel: [<c0284c2d>] sock_aio_read+0xdd/0x130 Dec 10 23:42:28 linux kernel: [<c0159136>] do_sync_read+0xb6/0x110 Dec 10 23:42:28 linux kernel: [<c016a77a>] do_select+0x30a/0x340 Dec 10 23:42:28 linux kernel: [<c012df70>] autoremove_wake_function+0x0/0x30 Dec 10 23:42:28 linux kernel: [<c01592d1>] vfs_read+0x141/0x170 Dec 10 23:42:28 linux kernel: [<c01595bc>] sys_read+0x3c/0x70 Dec 10 23:42:28 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79 Dec 10 23:42:28 linux kernel: Code: Bad EIP value. Dec 10 23:42:28 linux kernel: <1>Unable to handle kernel paging request at virtual address 010483b6 Dec 10 23:42:28 linux kernel: printing eip: Dec 10 23:42:28 linux kernel: 010483b6 Dec 10 23:42:28 linux kernel: *pde = 00000000 Dec 10 23:42:28 linux kernel: Oops: 0000 [#4] Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core Dec 10 23:42:28 linux kernel: CPU: 0 Dec 10 23:42:28 linux kernel: EIP: 0060:[<010483b6>] Tainted: P U VLI Dec 10 23:42:28 linux kernel: EFLAGS: 00010087 (2.6.13-15.7-default) Dec 10 23:42:28 linux kernel: EIP is at 0x10483b6 Dec 10 23:42:28 linux kernel: eax: e2e5408c ebx: e2e5408c ecx: 00000000 edx: 00000001 Dec 10 23:42:28 linux kernel: esi: 82380000 edi: 105d8b00 ebp: f5283bf0 esp: f5283bd0 Dec 10 23:42:28 linux kernel: ds: 007b es: 007b ss: 0068 Dec 10 23:42:28 linux kernel: Process smpppd (pid: 4673, threadinfo=f5282000 task=f7c52530) Dec 10 23:42:28 linux kernel: Stack: c01195c7 00000000 00000000 00000001 e2e56098 00000246 f36c6c80 f36c6ccc Dec 10 23:42:28 linux kernel: f5283c04 c0119601 00000000 00000000 e2a13500 dfff4f80 c0287bc9 00000000 Dec 10 23:42:28 linux kernel: c02e328d 00000000 01000286 f6007164 cde93b00 00000000 dfff4600 cde93b24 Dec 10 23:42:28 linux kernel: Call Trace: Dec 10 23:42:28 linux kernel: [<c01195c7>] __wake_up_common+0x37/0x60 Dec 10 23:42:28 linux kernel: [<c0119601>] __wake_up+0x11/0x20 Dec 10 23:42:28 linux kernel: [<c0287bc9>] sock_def_wakeup+0x19/0x20 Dec 10 23:42:28 linux kernel: [<c02e328d>] unix_release_sock+0xcd/0x1d0 Dec 10 23:42:28 linux kernel: [<c0284841>] sock_release+0x11/0x70 Dec 10 23:42:28 linux kernel: [<c0285289>] sock_close+0x19/0x30 Dec 10 23:42:28 linux kernel: [<c015a180>] __fput+0x80/0x160 Dec 10 23:42:28 linux kernel: [<c0158b0b>] filp_close+0x3b/0x60 Dec 10 23:42:28 linux kernel: [<c011db88>] put_files_struct+0x58/0xd0 Dec 10 23:42:28 linux kernel: [<c011e7ad>] do_exit+0xfd/0x350 Dec 10 23:42:28 linux kernel: [<c010468e>] die+0x13e/0x140 Dec 10 23:42:28 linux kernel: [<c011764c>] do_page_fault+0x2dc/0x5ef Dec 10 23:42:28 linux kernel: [<c0125c15>] __group_send_sig_info+0x65/0x80 Dec 10 23:42:28 linux kernel: [<c0169754>] send_sigio_to_task+0xb4/0xf0 Dec 10 23:42:28 linux kernel: [<c0140c19>] buffered_rmqueue+0xb9/0x1f0 Dec 10 23:42:28 linux kernel: [<c0274502>] psmouse_interrupt+0xb2/0x2b0 Dec 10 23:42:28 linux kernel: [<c0117370>] do_page_fault+0x0/0x5ef Dec 10 23:42:28 linux kernel: [<c0103f0f>] error_code+0x4f/0x60 Dec 10 23:42:28 linux kernel: [<c01195c7>] __wake_up_common+0x37/0x60 Dec 10 23:42:28 linux kernel: [<c0119601>] __wake_up+0x11/0x20 Dec 10 23:42:28 linux kernel: [<c02e306a>] unix_write_space+0x2a/0x60 Dec 10 23:42:28 linux kernel: [<c0287468>] sock_wfree+0x38/0x40 Dec 10 23:42:28 linux kernel: [<c0288a67>] __kfree_skb+0x57/0x180 Dec 10 23:42:28 linux kernel: [<c02e50b9>] unix_stream_recvmsg+0x1c9/0x420 Dec 10 23:42:28 linux kernel: [<c0284c2d>] sock_aio_read+0xdd/0x130 Dec 10 23:42:28 linux kernel: [<c0159136>] do_sync_read+0xb6/0x110 Dec 10 23:42:28 linux kernel: [<c016a77a>] do_select+0x30a/0x340 Dec 10 23:42:28 linux kernel: [<c012df70>] autoremove_wake_function+0x0/0x30 Dec 10 23:42:28 linux kernel: [<c01592d1>] vfs_read+0x141/0x170 Dec 10 23:42:28 linux kernel: [<c01595bc>] sys_read+0x3c/0x70 Dec 10 23:42:28 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79 Dec 10 23:42:28 linux kernel: Code: Bad EIP value. Dec 10 23:42:28 linux kernel: <1>Fixing recursive fault but reboot is needed! Dec 10 23:42:28 linux kernel: Unable to handle kernel paging request at virtual address 00800000 Dec 10 23:42:28 linux kernel: printing eip: Dec 10 23:42:28 linux kernel: c0372680 Dec 10 23:42:28 linux kernel: *pde = 00000000 Dec 10 23:42:28 linux kernel: Oops: 0002 [#5] Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core Dec 10 23:42:28 linux kernel: CPU: 0 Dec 10 23:42:28 linux kernel: EIP: 0060:[<c0372680>] Tainted: P U VLI Dec 10 23:42:28 linux kernel: EFLAGS: 00213282 (2.6.13-15.7-default) Dec 10 23:42:28 linux kernel: EIP is at 0xc0372680 Dec 10 23:42:28 linux kernel: eax: d39c9480 ebx: c0372660 ecx: 00000000 edx: cfe63680 Dec 10 23:42:28 linux kernel: esi: 00800000 edi: 00000017 ebp: 00000009 esp: f5287ef8 Dec 10 23:42:28 linux kernel: ds: 007b es: 007b ss: 0068 Dec 10 23:42:28 linux kernel: Process X (pid: 18152, threadinfo=f5286000 task=f756d020) Dec 10 23:42:28 linux kernel: Stack: c0285242 d39c9480 c016a5f3 f5287fa4 f5287f8c 00000000 00000018 00006c8c Dec 10 23:42:28 linux kernel: f75fdbe0 f75fdc00 f75fdc20 f75fdb84 f75fdba4 f75fdbc4 009fff1a 00000000 Dec 10 23:42:28 linux kernel: 00000000 009fff1a 00002000 00000000 00000000 00000000 00000304 00000182 Dec 10 23:42:28 linux kernel: Call Trace: Dec 10 23:42:28 linux kernel: [<c0285242>] sock_poll+0x12/0x20 Dec 10 23:42:28 linux kernel: [<c016a5f3>] do_select+0x183/0x340 Dec 10 23:42:28 linux kernel: [<c016a2c0>] __pollwait+0x0/0xa0 Dec 10 23:42:28 linux kernel: [<c016a9b6>] sys_select+0x1e6/0x360 Dec 10 23:42:28 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79 Dec 10 23:42:28 linux kernel: Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 <80> 26 37 c0 80 26 37 c0 00 00 00 00 e0 76 2e c0 f0 76 2e c0 00 Dec 10 23:42:28 linux kernel: <1>general protection fault: 9280 [#6] Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core Dec 10 23:42:28 linux kernel: CPU: 0 Dec 10 23:42:28 linux kernel: EIP: 0060:[<c043a3c2>] Tainted: P U VLI Dec 10 23:42:28 linux kernel: EFLAGS: 00010282 (2.6.13-15.7-default) Dec 10 23:42:28 linux kernel: EIP is at 0xc043a3c2 Dec 10 23:42:28 linux kernel: eax: d39c9280 ebx: c0372f60 ecx: 00008000 edx: cfe63c80 Dec 10 23:42:28 linux kernel: esi: 00010000 edi: 00000010 ebp: 00000010 esp: e2acfef8 Dec 10 23:42:28 linux kernel: ds: 007b es: 007b ss: 0068 Dec 10 23:42:28 linux kernel: Process dcopserver (pid: 18270, threadinfo=e2ace000 task=e7009a40) Dec 10 23:42:28 linux kernel: Stack: c0285242 d39c9280 c016a5f3 e2acffa4 e2acff8c 00000000 00000011 7fffffff Dec 10 23:42:28 linux kernel: f461d4cc f461d4d0 f461d4d4 f461d4c4 f461d4c8 f461d4cc 00013ed8 00000000 Dec 10 23:42:28 linux kernel: 00000000 00013ed8 00000400 00000000 00000000 00000000 00000304 00000182 Dec 10 23:42:28 linux kernel: Call Trace: Dec 10 23:42:28 linux kernel: [<c0285242>] sock_poll+0x12/0x20 Dec 10 23:42:28 linux kernel: [<c016a5f3>] do_select+0x183/0x340 Dec 10 23:42:28 linux kernel: [<c016a2c0>] __pollwait+0x0/0xa0 Dec 10 23:42:28 linux kernel: [<c016a9b6>] sys_select+0x1e6/0x360 Dec 10 23:42:28 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79 Dec 10 23:42:28 linux kernel: Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 9b ca df 00 12 f6 df 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 d5 <ca> df 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 I also had one crash, after which I could find no evidence of in the logs. Hoping you can help, richardben
Many of these oopses have krb5_decrypt in them. Are you using NFS with RPCSEC_GSS and kerberos? If so, do these crashes disappear if you disable it?
(In reply to comment #1) > Many of these oopses have krb5_decrypt in them. Are you using NFS with > RPCSEC_GSS and kerberos? If so, do these crashes disappear if you disable > it? > Sorry, but I'm not sure what all these are. However: I checked Yast2 NFS configuration, and the NFS Server Do Not Start option is selected. Selecting Kerberos Client in Yast2 results in it telling me that installation of pam_krb5 and krb5-client is required. Is this relevant to what you're asking? The motherboard has a Marvell Gigabit LAN controller chipset, but this is unused. (The machine is not networked). Hope this helps. On perhaps a different tack I am currently using fluxbox instead of KDE, and the crashes are much less frequent.
Looking at the oopses more closely I notice you're using the nvidia module. I strongly suspect this is the main culprit. Can you please try running with the stock nv driver and see if the problem persists?
Right. I ran the NVIDIA uninstaller program, and it told me that there was no NVIDIA driver installed! Also I found that some software that needs 3D hardware acceleration tells me that it can't find any. What is yast doing?! So, instead I just changed the driver to "nv" in Xorg.conf. At the moment it's working, and so far so good; no crash yet. I think if I try the NVIDIA driver again I'll use its own installer. Incidentally, down amongst the oopses is a mention of ac97. The terratec soundcard doesn't use this. It's only for onboard sound, isn't it? which is disabled. richard@linux:~> dmesg | grep ac97 ALSA sound/pci/ac97/ac97_codec.c:1959: AC'97 0 does not respond - RESET ALSA sound/pci/ac97/ac97_codec.c:1968: AC'97 0 access is not valid [0xffffffff], removing mixer. ice1712: cannot initialize ac97 for consumer, skipped I'll let you know if the nv switch remains crash-free. Richard.
Hi again, It's much more stable now running with the nv driver, but I have had one lock-up since the last posting. I was running firefox with multiple tabs, while connected to the net. X died and I ended up at a text page with a login prompt. The display had broken green lines across it. The machine had to be reset. I could find no reports of this in the logs. What's the story with the nvidia driver? What makes you suspect it?
Hi Richard, The nvidia driver is known to mess with a lot of things - including the memory subsystem. And since it is closed source, we cannot support configurations with this driver loaded. The crashes you reported just confirmed that it is always a good idea to see if kernel crashes go away by removing all closed source third party drivers. Please report this issue to Nvidia. Concerning the crash you reported in comment #5, this sounds like an X server bug, not a kernel bug. When the X server crashes, it will leave the video hardware in a funky state. I'm re-assigning this report to the X people.
Please attach the results of "nvidia-bug-report.sh". Thanks.
Sorry, but where is nvidia-bug-report.sh? updatedb followed by locate can't find it. The same day as my last comment, X died. I reinstalled the nvidia driver (and tried to at least get X running again. It didn't work.) and then uninstalled it with its installer, and then removed the glx module from Xorg.conf. Leaving /usr/X11R6/lib/modules/drivers/nvidia_drv.o intact and running Xorg -configure results in a could not load module fatal error. Removing it and running Xorg -configure results in "Could not init font path element" errors, and "fatal server error could not open default font 'fixed'", or "cursor". There is much wailing and gnashing of gears here at the moment. Richard
nvidia-bug-report.sh is part of the nvidia driver installation. I'm afraid I can't help you much in your now messed up system. :-( Hopefully at least you'll manage to get at least a nv driver ocnfiguration running again.
Finally got X up and running again. The thing that's bugging (sic) me is that the last crash WAS with the nv driver. So there won't be an nvidia-bug-report. Sorry. I wish I'd known about that script. It appears that when X crashed, it corrupted a number of font files and XKB related stuff. So I ripped them out and reinstalled/updated in yast. Is it true that when X crashes it dumps stuff into a cache which it hopes will get written to disk? If the machine is locked up, how does it expect this stuff to get written?
The Xserver does nothing special when writing to its logfile. Anyway, since you can't provide the output of nvidia-bug-report.sh me (and NVIDIA) can't investigate this issue. :-( I'm sorry! (--> WONTFIX)