Bug 137976

Summary: nvidia related X server crash
Product: [openSUSE] SUSE LINUX 10.0 Reporter: Richard Brown <richardbb>
Component: X11 3rd PartyAssignee: Stefan Dirsch <sndirsch>
Status: RESOLVED WONTFIX QA Contact: E-mail List <qa-bugs>
Severity: Normal    
Priority: P5 - None CC: aritger, eich
Version: Final   
Target Milestone: ---   
Hardware: x86   
OS: SuSE Linux 10.0   
Whiteboard:
Found By: Other Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---

Description Richard Brown 2005-12-11 14:06:32 UTC
Hi all,

My system is locking up, sometimes as much as 4 times in one hour.  I've installed SUSE 10.0 OSS in non ACPI mode.  I ran memtest86 for 12 hours, and found no errors.  The system is:

P4b 2.4GHz
Asus P4P800-E deluxe
1 GB RAM DDR 266, in dual channel configuration.
NVIDIA GeforceFX5500
Terratec Phase88
1 HD on IDE channel, 1 on promise 378 in IDE mode.  (which SUSE is not picking up properly, incidentally).

This crash also happened under FC4.  However, it hasn't occured under XP, or Knoppix 4.0.  Does SUSE have a problem with dual channel configurations?

A section of var/log/messages gives:

Dec 10 23:42:28 linux kernel: invalid operand: 0000 [#1]
Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core
Dec 10 23:42:28 linux kernel: CPU:    0
Dec 10 23:42:28 linux kernel: EIP:    0060:[<c02f8598>]    Tainted: P     U VLI
Dec 10 23:42:28 linux kernel: EFLAGS: 00010246   (2.6.13-15.7-default) 
Dec 10 23:42:28 linux kernel: EIP is at krb5_decrypt+0x1c8/0x1e0
Dec 10 23:42:28 linux kernel: eax: 00000404   ebx: 00000200   ecx: ca3b8000   edx: 00000000
Dec 10 23:42:28 linux kernel: esi: cf261f70   edi: cf261f6c   ebp: 7fffffff   esp: cf261f3c
Dec 10 23:42:28 linux kernel: ds: 007b   es: 007b   ss: 0068
Dec 10 23:42:28 linux kernel: Process nautilus (pid: 18992, threadinfo=cf260000 task=f2a13a40)
Dec 10 23:42:28 linux kernel: Stack: c016ab8c cf261f6c dc8e1488 00000609 dc8e1480 cf261f70 cf261f6c 00001000 
Dec 10 23:42:28 linux kernel:        cf261f70 c016ac8a cf261f9c dc8e1480 00000000 00000000 dc8e1480 081584d0 
Dec 10 23:42:28 linux kernel:        00000000 081584d0 c016adba 7fffffff 08158518 fffffff7 dc8e1480 dc8e3480 
Dec 10 23:42:28 linux kernel: Call Trace:
Dec 10 23:42:28 linux kernel:  [<c016ab8c>] do_pollfd+0x5c/0xb0
Dec 10 23:42:28 linux kernel:  [<c016ac8a>] do_poll+0xaa/0xd0
Dec 10 23:42:28 linux kernel:  [<c016adba>] sys_poll+0x10a/0x220
Dec 10 23:42:28 linux kernel:  [<c016a2c0>] __pollwait+0x0/0xa0
Dec 10 23:42:28 linux kernel:  [<c0100d1b>] calibrate_delay+0x3db/0x6c0
Dec 10 23:42:28 linux kernel: Code: 32 c0 e9 1f ff ff ff 68 c4 90 33 c0 e8 d2 42 e2 ff 58 eb a3 0f 0b 7e 01 3c 0d 32 c0 e9 75 ff ff ff 0f 0b 7d 01 3c 0d 32 c0 e9 62 <ff> ff ff 0f 0b 14 01 3c 0d 32 c0 e9 b5 fe ff ff 90 8d b4 26 00 
Dec 10 23:42:28 linux kernel:  ------------[ cut here ]------------
Dec 10 23:42:28 linux kernel: kernel BUG at mm/rmap.c:493!
Dec 10 23:42:28 linux kernel: invalid operand: 0000 [#2]
Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core
Dec 10 23:42:28 linux kernel: CPU:    0
Dec 10 23:42:28 linux kernel: EIP:    0060:[<c014f512>]    Tainted: P     U VLI
Dec 10 23:42:28 linux kernel: EFLAGS: 00010286   (2.6.13-15.7-default) 
Dec 10 23:42:28 linux kernel: EIP is at page_remove_rmap+0x32/0x40
Dec 10 23:42:28 linux kernel: eax: ffffffff   ebx: eb44b058   ecx: c03ee500   edx: c1568900
Dec 10 23:42:28 linux kernel: esi: c1568900   edi: 00000020   ebp: 40016000   esp: cf261d74
Dec 10 23:42:28 linux kernel: ds: 007b   es: 007b   ss: 0068
Dec 10 23:42:28 linux kernel: Process nautilus (pid: 18992, threadinfo=cf260000 task=f2a13a40)
Dec 10 23:42:28 linux kernel: Stack: c0149427 00000000 40017000 c03ee500 40017000 d0ca0404 40017000 40016fff 
Dec 10 23:42:28 linux kernel:        c01495c3 40017000 00000000 c03ee500 40017000 00002000 f35d7074 40017000 
Dec 10 23:42:28 linux kernel:        c0149716 40017000 00000000 cf260000 f6e6fd00 cf261e04 00006000 00000000 
Dec 10 23:42:28 linux kernel: Call Trace:
Dec 10 23:42:28 linux kernel:  [<c0149427>] zap_pte_range+0xe7/0x200
Dec 10 23:42:28 linux kernel:  [<c01495c3>] unmap_page_range+0x83/0xd0
Dec 10 23:42:28 linux kernel:  [<c0149716>] unmap_vmas+0x106/0x220
Dec 10 23:42:28 linux kernel:  [<c014dabb>] exit_mmap+0x5b/0x120
Dec 10 23:42:28 linux kernel:  [<c011a63e>] mmput+0x1e/0x70
Dec 10 23:42:28 linux kernel:  [<c011e78d>] do_exit+0xdd/0x350
Dec 10 23:42:28 linux kernel:  [<c010468e>] die+0x13e/0x140
Dec 10 23:42:28 linux kernel:  [<c0104940>] do_invalid_op+0x0/0xa0
Dec 10 23:42:28 linux kernel:  [<c01049d1>] do_invalid_op+0x91/0xa0
Dec 10 23:42:28 linux kernel:  [<c02f8598>] krb5_decrypt+0x1c8/0x1e0
Dec 10 23:42:28 linux kernel:  [<c0284c2d>] sock_aio_read+0xdd/0x130
Dec 10 23:42:28 linux kernel:  [<c0140c19>] buffered_rmqueue+0xb9/0x1f0
Dec 10 23:42:28 linux kernel:  [<c0103f0f>] error_code+0x4f/0x60
Dec 10 23:42:28 linux kernel:  [<c02f8598>] krb5_decrypt+0x1c8/0x1e0
Dec 10 23:42:28 linux kernel:  [<c016ab8c>] do_pollfd+0x5c/0xb0
Dec 10 23:42:28 linux kernel:  [<c016ac8a>] do_poll+0xaa/0xd0
Dec 10 23:42:28 linux kernel:  [<c016adba>] sys_poll+0x10a/0x220
Dec 10 23:42:28 linux kernel:  [<c016a2c0>] __pollwait+0x0/0xa0
Dec 10 23:42:28 linux kernel:  [<c0100d1b>] calibrate_delay+0x3db/0x6c0
Dec 10 23:42:28 linux kernel: Code: 75 1f 83 42 08 ff 0f 98 c0 84 c0 75 01 c3 8b 42 08 40 78 17 83 ca ff b8 10 00 00 00 e9 d8 1f ff ff 0f 0b ea 01 37 52 31 c0 eb d7 <0f> 0b ed 01 37 52 31 c0 eb df 8d 74 26 00 55 57 56 53 57 89 d3 
Dec 10 23:42:28 linux kernel:  <1>Fixing recursive fault but reboot is needed!
Dec 10 23:42:28 linux kernel: scheduling while atomic: nautilus/0x00000001/18992
Dec 10 23:42:28 linux kernel:  [<c02f9dfc>] schedule+0x55c/0x610
Dec 10 23:42:28 linux kernel:  [<c012b7fc>] __kernel_text_address+0x1c/0x30
Dec 10 23:42:28 linux kernel:  [<c01041f7>] show_trace+0x27/0x70
Dec 10 23:42:28 linux kernel:  [<c014f526>] try_to_unmap_one+0x6/0x180
Dec 10 23:42:28 linux kernel:  [<c014f527>] try_to_unmap_one+0x7/0x180
Dec 10 23:42:28 linux kernel:  [<c011c863>] printk+0x13/0x20
Dec 10 23:42:28 linux kernel:  [<c011e999>] do_exit+0x2e9/0x350
Dec 10 23:42:28 linux kernel:  [<c010468e>] die+0x13e/0x140
Dec 10 23:42:28 linux kernel:  [<c0104940>] do_invalid_op+0x0/0xa0
Dec 10 23:42:28 linux kernel:  [<c01049d1>] do_invalid_op+0x91/0xa0
Dec 10 23:42:28 linux kernel:  [<c014f512>] page_remove_rmap+0x32/0x40
Dec 10 23:42:28 linux kernel:  [<c02597e2>] elv_queue_empty+0x12/0x20
Dec 10 23:42:28 linux kernel:  [<f8855f9b>] ide_do_request+0xbb/0x3a0 [ide_core]
Dec 10 23:42:28 linux kernel:  [<f885d1d0>] ide_dma_intr+0x0/0xb0 [ide_core]
Dec 10 23:42:28 linux kernel:  [<c013bd53>] handle_IRQ_event+0x33/0x60
Dec 10 23:42:28 linux kernel:  [<c013be23>] __do_IRQ+0xa3/0xd0
Dec 10 23:42:28 linux kernel:  [<c0103f0f>] error_code+0x4f/0x60
Dec 10 23:42:28 linux kernel:  [<c014f512>] page_remove_rmap+0x32/0x40
Dec 10 23:42:28 linux kernel:  [<c0149427>] zap_pte_range+0xe7/0x200
Dec 10 23:42:28 linux kernel:  [<c01495c3>] unmap_page_range+0x83/0xd0
Dec 10 23:42:28 linux kernel:  [<c0149716>] unmap_vmas+0x106/0x220
Dec 10 23:42:28 linux kernel:  [<c014dabb>] exit_mmap+0x5b/0x120
Dec 10 23:42:28 linux kernel:  [<c011a63e>] mmput+0x1e/0x70
Dec 10 23:42:28 linux kernel:  [<c011e78d>] do_exit+0xdd/0x350
Dec 10 23:42:28 linux kernel:  [<c010468e>] die+0x13e/0x140
Dec 10 23:42:28 linux kernel:  [<c0104940>] do_invalid_op+0x0/0xa0
Dec 10 23:42:28 linux kernel:  [<c01049d1>] do_invalid_op+0x91/0xa0
Dec 10 23:42:28 linux kernel:  [<c02f8598>] krb5_decrypt+0x1c8/0x1e0
Dec 10 23:42:28 linux kernel:  [<c0284c2d>] sock_aio_read+0xdd/0x130
Dec 10 23:42:28 linux kernel:  [<c0140c19>] buffered_rmqueue+0xb9/0x1f0
Dec 10 23:42:28 linux kernel:  [<c0103f0f>] error_code+0x4f/0x60
Dec 10 23:42:28 linux kernel:  [<c02f8598>] krb5_decrypt+0x1c8/0x1e0
Dec 10 23:42:28 linux kernel:  [<c016ab8c>] do_pollfd+0x5c/0xb0
Dec 10 23:42:28 linux kernel:  [<c016ac8a>] do_poll+0xaa/0xd0
Dec 10 23:42:28 linux kernel:  [<c016adba>] sys_poll+0x10a/0x220
Dec 10 23:42:28 linux kdm[4794]: X server for display :0 terminated unexpectedly
Dec 10 23:42:28 linux kernel:  [<c016a2c0>] __pollwait+0x0/0xa0
Dec 10 23:42:28 linux kernel:  [<c0100d1b>] calibrate_delay+0x3db/0x6c0
Dec 10 23:42:28 linux kernel: Unable to handle kernel paging request at virtual address 010483b6
Dec 10 23:42:28 linux kernel:  printing eip:
Dec 10 23:42:28 linux kernel: 010483b6
Dec 10 23:42:28 linux kernel: *pde = 00000000
Dec 10 23:42:28 linux kernel: Oops: 0000 [#3]
Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core
Dec 10 23:42:28 linux kernel: CPU:    0
Dec 10 23:42:28 linux kernel: EIP:    0060:[<010483b6>]    Tainted: P     U VLI
Dec 10 23:42:28 linux kernel: EFLAGS: 00010087   (2.6.13-15.7-default) 
Dec 10 23:42:28 linux kernel: EIP is at 0x10483b6
Dec 10 23:42:28 linux kernel: eax: e2e5408c   ebx: e2e5408c   ecx: 00000000   edx: 00000001
Dec 10 23:42:28 linux kernel: esi: 82380000   edi: 105d8b00   ebp: f5283de8   esp: f5283dc8
Dec 10 23:42:28 linux kernel: ds: 007b   es: 007b   ss: 0068
Dec 10 23:42:28 linux kernel: Process smpppd (pid: 4673, threadinfo=f5282000 task=f7c52530)
Dec 10 23:42:28 linux kernel: Stack: c01195c7 00000000 00000001 00000001 e2e56098 00000202 e00ec080 f5283eb4 
Dec 10 23:42:28 linux kernel:        f5283dfc c0119601 00000000 00000000 e2a13500 e00ec080 c02e306a 00000000 
Dec 10 23:42:28 linux kernel:        e2a13500 c0287468 00000000 c0288a67 00000001 00000001 c02e50b9 f36c6c80 
Dec 10 23:42:28 linux kernel: Call Trace:
Dec 10 23:42:28 linux kernel:  [<c01195c7>] __wake_up_common+0x37/0x60
Dec 10 23:42:28 linux kernel:  [<c0119601>] __wake_up+0x11/0x20
Dec 10 23:42:28 linux kernel:  [<c02e306a>] unix_write_space+0x2a/0x60
Dec 10 23:42:28 linux kernel:  [<c0287468>] sock_wfree+0x38/0x40
Dec 10 23:42:28 linux kernel:  [<c0288a67>] __kfree_skb+0x57/0x180
Dec 10 23:42:28 linux kernel:  [<c02e50b9>] unix_stream_recvmsg+0x1c9/0x420
Dec 10 23:42:28 linux kernel:  [<c0284c2d>] sock_aio_read+0xdd/0x130
Dec 10 23:42:28 linux kernel:  [<c0159136>] do_sync_read+0xb6/0x110
Dec 10 23:42:28 linux kernel:  [<c016a77a>] do_select+0x30a/0x340
Dec 10 23:42:28 linux kernel:  [<c012df70>] autoremove_wake_function+0x0/0x30
Dec 10 23:42:28 linux kernel:  [<c01592d1>] vfs_read+0x141/0x170
Dec 10 23:42:28 linux kernel:  [<c01595bc>] sys_read+0x3c/0x70
Dec 10 23:42:28 linux kernel:  [<c0102d1b>] sysenter_past_esp+0x54/0x79
Dec 10 23:42:28 linux kernel: Code:  Bad EIP value.
Dec 10 23:42:28 linux kernel:  <1>Unable to handle kernel paging request at virtual address 010483b6
Dec 10 23:42:28 linux kernel:  printing eip:
Dec 10 23:42:28 linux kernel: 010483b6
Dec 10 23:42:28 linux kernel: *pde = 00000000
Dec 10 23:42:28 linux kernel: Oops: 0000 [#4]
Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core
Dec 10 23:42:28 linux kernel: CPU:    0
Dec 10 23:42:28 linux kernel: EIP:    0060:[<010483b6>]    Tainted: P     U VLI
Dec 10 23:42:28 linux kernel: EFLAGS: 00010087   (2.6.13-15.7-default) 
Dec 10 23:42:28 linux kernel: EIP is at 0x10483b6
Dec 10 23:42:28 linux kernel: eax: e2e5408c   ebx: e2e5408c   ecx: 00000000   edx: 00000001
Dec 10 23:42:28 linux kernel: esi: 82380000   edi: 105d8b00   ebp: f5283bf0   esp: f5283bd0
Dec 10 23:42:28 linux kernel: ds: 007b   es: 007b   ss: 0068
Dec 10 23:42:28 linux kernel: Process smpppd (pid: 4673, threadinfo=f5282000 task=f7c52530)
Dec 10 23:42:28 linux kernel: Stack: c01195c7 00000000 00000000 00000001 e2e56098 00000246 f36c6c80 f36c6ccc 
Dec 10 23:42:28 linux kernel:        f5283c04 c0119601 00000000 00000000 e2a13500 dfff4f80 c0287bc9 00000000 
Dec 10 23:42:28 linux kernel:        c02e328d 00000000 01000286 f6007164 cde93b00 00000000 dfff4600 cde93b24 
Dec 10 23:42:28 linux kernel: Call Trace:
Dec 10 23:42:28 linux kernel:  [<c01195c7>] __wake_up_common+0x37/0x60
Dec 10 23:42:28 linux kernel:  [<c0119601>] __wake_up+0x11/0x20
Dec 10 23:42:28 linux kernel:  [<c0287bc9>] sock_def_wakeup+0x19/0x20
Dec 10 23:42:28 linux kernel:  [<c02e328d>] unix_release_sock+0xcd/0x1d0
Dec 10 23:42:28 linux kernel:  [<c0284841>] sock_release+0x11/0x70
Dec 10 23:42:28 linux kernel:  [<c0285289>] sock_close+0x19/0x30
Dec 10 23:42:28 linux kernel:  [<c015a180>] __fput+0x80/0x160
Dec 10 23:42:28 linux kernel:  [<c0158b0b>] filp_close+0x3b/0x60
Dec 10 23:42:28 linux kernel:  [<c011db88>] put_files_struct+0x58/0xd0
Dec 10 23:42:28 linux kernel:  [<c011e7ad>] do_exit+0xfd/0x350
Dec 10 23:42:28 linux kernel:  [<c010468e>] die+0x13e/0x140
Dec 10 23:42:28 linux kernel:  [<c011764c>] do_page_fault+0x2dc/0x5ef
Dec 10 23:42:28 linux kernel:  [<c0125c15>] __group_send_sig_info+0x65/0x80
Dec 10 23:42:28 linux kernel:  [<c0169754>] send_sigio_to_task+0xb4/0xf0
Dec 10 23:42:28 linux kernel:  [<c0140c19>] buffered_rmqueue+0xb9/0x1f0
Dec 10 23:42:28 linux kernel:  [<c0274502>] psmouse_interrupt+0xb2/0x2b0
Dec 10 23:42:28 linux kernel:  [<c0117370>] do_page_fault+0x0/0x5ef
Dec 10 23:42:28 linux kernel:  [<c0103f0f>] error_code+0x4f/0x60
Dec 10 23:42:28 linux kernel:  [<c01195c7>] __wake_up_common+0x37/0x60
Dec 10 23:42:28 linux kernel:  [<c0119601>] __wake_up+0x11/0x20
Dec 10 23:42:28 linux kernel:  [<c02e306a>] unix_write_space+0x2a/0x60
Dec 10 23:42:28 linux kernel:  [<c0287468>] sock_wfree+0x38/0x40
Dec 10 23:42:28 linux kernel:  [<c0288a67>] __kfree_skb+0x57/0x180
Dec 10 23:42:28 linux kernel:  [<c02e50b9>] unix_stream_recvmsg+0x1c9/0x420
Dec 10 23:42:28 linux kernel:  [<c0284c2d>] sock_aio_read+0xdd/0x130
Dec 10 23:42:28 linux kernel:  [<c0159136>] do_sync_read+0xb6/0x110
Dec 10 23:42:28 linux kernel:  [<c016a77a>] do_select+0x30a/0x340
Dec 10 23:42:28 linux kernel:  [<c012df70>] autoremove_wake_function+0x0/0x30
Dec 10 23:42:28 linux kernel:  [<c01592d1>] vfs_read+0x141/0x170
Dec 10 23:42:28 linux kernel:  [<c01595bc>] sys_read+0x3c/0x70
Dec 10 23:42:28 linux kernel:  [<c0102d1b>] sysenter_past_esp+0x54/0x79
Dec 10 23:42:28 linux kernel: Code:  Bad EIP value.
Dec 10 23:42:28 linux kernel:  <1>Fixing recursive fault but reboot is needed!
Dec 10 23:42:28 linux kernel: Unable to handle kernel paging request at virtual address 00800000
Dec 10 23:42:28 linux kernel:  printing eip:
Dec 10 23:42:28 linux kernel: c0372680
Dec 10 23:42:28 linux kernel: *pde = 00000000
Dec 10 23:42:28 linux kernel: Oops: 0002 [#5]
Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core
Dec 10 23:42:28 linux kernel: CPU:    0
Dec 10 23:42:28 linux kernel: EIP:    0060:[<c0372680>]    Tainted: P     U VLI
Dec 10 23:42:28 linux kernel: EFLAGS: 00213282   (2.6.13-15.7-default) 
Dec 10 23:42:28 linux kernel: EIP is at 0xc0372680
Dec 10 23:42:28 linux kernel: eax: d39c9480   ebx: c0372660   ecx: 00000000   edx: cfe63680
Dec 10 23:42:28 linux kernel: esi: 00800000   edi: 00000017   ebp: 00000009   esp: f5287ef8
Dec 10 23:42:28 linux kernel: ds: 007b   es: 007b   ss: 0068
Dec 10 23:42:28 linux kernel: Process X (pid: 18152, threadinfo=f5286000 task=f756d020)
Dec 10 23:42:28 linux kernel: Stack: c0285242 d39c9480 c016a5f3 f5287fa4 f5287f8c 00000000 00000018 00006c8c 
Dec 10 23:42:28 linux kernel:        f75fdbe0 f75fdc00 f75fdc20 f75fdb84 f75fdba4 f75fdbc4 009fff1a 00000000 
Dec 10 23:42:28 linux kernel:        00000000 009fff1a 00002000 00000000 00000000 00000000 00000304 00000182 
Dec 10 23:42:28 linux kernel: Call Trace:
Dec 10 23:42:28 linux kernel:  [<c0285242>] sock_poll+0x12/0x20
Dec 10 23:42:28 linux kernel:  [<c016a5f3>] do_select+0x183/0x340
Dec 10 23:42:28 linux kernel:  [<c016a2c0>] __pollwait+0x0/0xa0
Dec 10 23:42:28 linux kernel:  [<c016a9b6>] sys_select+0x1e6/0x360
Dec 10 23:42:28 linux kernel:  [<c0102d1b>] sysenter_past_esp+0x54/0x79
Dec 10 23:42:28 linux kernel: Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 <80> 26 37 c0 80 26 37 c0 00 00 00 00 e0 76 2e c0 f0 76 2e c0 00 
Dec 10 23:42:28 linux kernel:  <1>general protection fault: 9280 [#6]
Dec 10 23:42:28 linux kernel: Modules linked in: joydev sg st sr_mod ppp_deflate zlib_deflate bsd_comp pppoatm ppp_generic slhc nls_utf8 nvidia ipt_pkttype ipt_LOG ipt_limit hfsplus subfs snd_pcm_oss snd_mixer_oss snd_seq_midi snd_seq_midi_event snd_seq edd speedtch firmware_class usbatm atm snd_ice1712 snd_ice17xx_ak4xxx snd_ak4xxx_adda snd_cs8427 snd_ac97_codec snd_pcm snd_timer snd_page_alloc snd_ac97_bus snd_i2c snd_mpu401_uart snd_rawmidi snd_seq_device i2c_i801 i2c_core snd soundcore generic ehci_hcd hw_random intel_agp agpgart uhci_hcd pci_hotplug usbcore parport_pc lp parport ip6t_REJECT ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 nls_iso8859_1 nls_cp437 vfat fat dm_mod reiserfs processor ide_cd cdrom sata_promise libata piix sd_mod scsi_mod ide_disk ide_core
Dec 10 23:42:28 linux kernel: CPU:    0
Dec 10 23:42:28 linux kernel: EIP:    0060:[<c043a3c2>]    Tainted: P     U VLI
Dec 10 23:42:28 linux kernel: EFLAGS: 00010282   (2.6.13-15.7-default) 
Dec 10 23:42:28 linux kernel: EIP is at 0xc043a3c2
Dec 10 23:42:28 linux kernel: eax: d39c9280   ebx: c0372f60   ecx: 00008000   edx: cfe63c80
Dec 10 23:42:28 linux kernel: esi: 00010000   edi: 00000010   ebp: 00000010   esp: e2acfef8
Dec 10 23:42:28 linux kernel: ds: 007b   es: 007b   ss: 0068
Dec 10 23:42:28 linux kernel: Process dcopserver (pid: 18270, threadinfo=e2ace000 task=e7009a40)
Dec 10 23:42:28 linux kernel: Stack: c0285242 d39c9280 c016a5f3 e2acffa4 e2acff8c 00000000 00000011 7fffffff 
Dec 10 23:42:28 linux kernel:        f461d4cc f461d4d0 f461d4d4 f461d4c4 f461d4c8 f461d4cc 00013ed8 00000000 
Dec 10 23:42:28 linux kernel:        00000000 00013ed8 00000400 00000000 00000000 00000000 00000304 00000182 
Dec 10 23:42:28 linux kernel: Call Trace:
Dec 10 23:42:28 linux kernel:  [<c0285242>] sock_poll+0x12/0x20
Dec 10 23:42:28 linux kernel:  [<c016a5f3>] do_select+0x183/0x340
Dec 10 23:42:28 linux kernel:  [<c016a2c0>] __pollwait+0x0/0xa0
Dec 10 23:42:28 linux kernel:  [<c016a9b6>] sys_select+0x1e6/0x360
Dec 10 23:42:28 linux kernel:  [<c0102d1b>] sysenter_past_esp+0x54/0x79
Dec 10 23:42:28 linux kernel: Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 9b ca df 00 12 f6 df 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 d5 <ca> df 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 

I also had one crash, after which I could find no evidence of in the logs.


Hoping you can help,


richardben
Comment 1 Olaf Kirch 2005-12-12 09:53:05 UTC
Many of these oopses have krb5_decrypt in them. Are you using NFS with
RPCSEC_GSS and kerberos? If so, do these crashes disappear if you disable
it?
Comment 2 Richard Brown 2005-12-12 17:40:21 UTC
(In reply to comment #1)
> Many of these oopses have krb5_decrypt in them. Are you using NFS with
> RPCSEC_GSS and kerberos? If so, do these crashes disappear if you disable
> it?
> 

Sorry, but I'm not sure what all these are.  However:
I checked Yast2 NFS configuration, and the NFS Server Do Not Start option is selected.  Selecting Kerberos Client in Yast2 results in it telling me that installation of pam_krb5 and krb5-client is required.  Is this relevant to what you're asking?
The motherboard has a Marvell Gigabit LAN controller chipset, but this is unused.  (The machine is not networked).

Hope this helps.

On perhaps a different tack I am currently using fluxbox instead of KDE, and the crashes are much less frequent.
Comment 3 Olaf Kirch 2005-12-13 10:09:12 UTC
Looking at the oopses more closely I notice you're using the nvidia
module. I strongly suspect this is the main culprit.

Can you please try running with the stock nv driver and see if the problem
persists?
Comment 4 Richard Brown 2005-12-14 17:00:00 UTC
Right.

I ran the NVIDIA uninstaller program, and it told me that there was no NVIDIA driver installed!  Also I found that some software that needs 3D hardware acceleration tells me that it can't find any.  What is yast doing?!
So, instead I just changed the driver to "nv" in Xorg.conf.  At the moment it's working, and so far so good; no crash yet.
I think if I try the NVIDIA driver again I'll use its own installer.
Incidentally, down amongst the oopses is a mention of ac97.  The terratec soundcard doesn't use this.  It's only for onboard sound, isn't it? which is disabled.

richard@linux:~> dmesg | grep ac97
ALSA sound/pci/ac97/ac97_codec.c:1959: AC'97 0 does not respond - RESET
ALSA sound/pci/ac97/ac97_codec.c:1968: AC'97 0 access is not valid [0xffffffff], removing mixer.
ice1712: cannot initialize ac97 for consumer, skipped

I'll let you know if the nv switch remains crash-free.


Richard.
Comment 5 Richard Brown 2005-12-17 22:12:13 UTC
Hi again,


It's much more stable now running with the nv driver, but I have had one lock-up since the last posting.  I was running firefox with multiple tabs, while connected to the net.  X died and I ended up at a text page with a login prompt.  The display had broken green lines across it.  The machine had to be reset.  I could find no reports of this in the logs.

What's the story with the nvidia driver?  What makes you suspect it?

Comment 6 Olaf Kirch 2005-12-19 10:42:14 UTC
Hi Richard,

The nvidia driver is known to mess with a lot of things - including the memory
subsystem. And since it is closed source, we cannot support configurations
with this driver loaded. The crashes you reported just confirmed that it
is always a good idea to see if kernel crashes go away by removing all
closed source third party drivers.

Please report this issue to Nvidia.

Concerning the crash you reported in comment #5, this sounds like an X server
bug, not a kernel bug. When the X server crashes, it will leave the video
hardware in a funky state. I'm re-assigning this report to the X people.
Comment 7 Stefan Dirsch 2005-12-19 12:27:24 UTC
Please attach the results of "nvidia-bug-report.sh". Thanks.
Comment 8 Richard Brown 2005-12-19 17:20:30 UTC
Sorry, but where is nvidia-bug-report.sh? updatedb followed by locate can't find it.


The same day as my last comment, X died.  I reinstalled the nvidia driver (and tried to at least get X running again.  It didn't work.) and then uninstalled it with its installer, and then removed the glx module from Xorg.conf.  
Leaving /usr/X11R6/lib/modules/drivers/nvidia_drv.o intact and running Xorg -configure results in a could not load module fatal error.  Removing it and running Xorg -configure results in "Could not init font path element" errors, and "fatal server error could not open default font 'fixed'", or "cursor".

There is much wailing and gnashing of gears here at the moment.

Richard
Comment 9 Stefan Dirsch 2005-12-19 23:44:23 UTC
nvidia-bug-report.sh is part of the nvidia driver installation. I'm afraid I can't help you much in your now messed up system. :-( Hopefully at least you'll manage to get at least a nv driver ocnfiguration running again.
Comment 10 Richard Brown 2005-12-21 18:23:53 UTC
Finally got X up and running again.  The thing that's bugging (sic) me is that the last crash WAS with the nv driver.  So there won't be an nvidia-bug-report.  Sorry.  I wish I'd known about that script.  
It appears that when X crashed, it corrupted a number of font files and XKB related stuff.  So I ripped them out and reinstalled/updated in yast.  Is it true that when X crashes it dumps stuff into a cache which it hopes will get written to disk?  If the machine is locked up, how does it expect this stuff to get written?
Comment 11 Stefan Dirsch 2006-01-12 14:47:32 UTC
The Xserver does nothing special when writing to its logfile. Anyway, since you can't provide the output of nvidia-bug-report.sh me (and NVIDIA) can't investigate this issue. :-( I'm sorry! (--> WONTFIX)