Bugzilla – Bug 137805
Pundit-R (RS350, P4R8L2) and Linux oops
Last modified: 2005-12-18 17:18:36 UTC
Several times I have got some kernel panic like crash. After the crash the process belongs to the file operation hangs, and gets 'D' flag in the ps table, and every other access to the related mounted drive is also locked. Because of that reboot and shutdown do not work, you have to reset the machine. I suppose it is somehow related to the rieserfs and the sata driver. I could successfully reproduce the error with the following method: copying really big files (more than 2gigs) via scp to a reiserfs partition. There are two interesting thing about it. The same OS (OpenSUSE 10.0) is running on an another machine without showing this crash. The XFS filesystem seems to work and does not crash. Maybe this error is related to (or the same as) the #129623 bug. Crash Machine hw configuration: ----------------------------------------- Mainboard: ASUS Pundit-R P4R8L Chipset : ATI RS300 / IXP200 HDD: Seagate ST380817AS (Memory was tested with memtest86) Kernel: OpenSUSE 10.0 with 2.6.13 and also with 2.6.15rc5 From /var/log/messages 1. ---------------------------------- Dec 6 12:55:29 linux kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000000 Dec 6 12:55:29 linux kernel: printing eip: Dec 6 12:55:29 linux kernel: f9554905 Dec 6 12:55:29 linux kernel: *pde = 00000000 Dec 6 12:55:29 linux kernel: Oops: 0000 [#1] Dec 6 12:55:29 linux kernel: Modules linked in: ipt_pkttype ipt_LOG ipt_limit speedstep_lib freq_table it87 i2c_sensor i2c_isa i2c_core snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device nls_cp850 nls_utf8 smbfs button battery ac radeon drm af_packet edd rfcomm hidp l2cap hci_usb bluetooth pcmcia firmware_class ide_cd cdrom yenta_socket rsrc_nonstatic pcmcia_core ohci1394 ieee1394 snd_atiixp snd_ac97_codec snd_ac97_bus snd_pcm snd_timer snd soundcore snd_page_alloc e100 ohci_hcd mii generic ehci_hcd usbcore ip6t_REJECT ati_agp agpgart shpchp pci_hotplug ipt_REJECT ipt_state iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 parport_pc lp parport ext3 jbd dm_mod reiserfs fan thermal processor atiixp ide_disk ide_core Dec 6 12:55:29 linux kernel: CPU: 0 Dec 6 12:55:29 linux kernel: EIP: 0060:[<f9554905>] Tainted: G U VLI Dec 6 12:55:29 linux kernel: EFLAGS: 00210202 (2.6.13-15-default) Dec 6 12:55:29 linux kernel: EIP is at reiserfs_allocate_blocks_for_region+0xad5/0x14b0 [reiserfs] Dec 6 12:55:29 linux kernel: eax: dfff0a00 ebx: d6981ea8 ecx: 00000000 edx: 00000000 Dec 6 12:55:29 linux kernel: esi: 00001000 edi: 00000001 ebp: d6981e54 esp: d6981d38 Dec 6 12:55:29 linux kernel: ds: 007b es: 007b ss: 0068 Dec 6 12:55:29 linux kernel: Process kasablanca (pid: 5982, threadinfo=d6980000 task=d1148a40) Dec 6 12:55:29 linux kernel: Stack: c9c04694 f3f4d5c8 c113ca80 00000000 f3f4d528 d6981f48 033af001 00000000 Dec 6 12:55:29 linux kernel: c9e4f5c0 f3f03e00 00000001 00000000 00000000 00001000 00000000 00000000 Dec 6 12:55:29 linux kernel: 00000000 f3f4d4dc 00000001 d6981ea8 00000000 000004d0 c9840de4 00000000 Dec 6 12:55:29 linux kernel: Call Trace: Dec 6 12:55:29 linux kernel: [<f9556907>] reiserfs_file_write+0x687/0x6b0 [reiserfs] Dec 6 12:55:29 linux kernel: [<c0284a1d>] sock_aio_read+0xdd/0x130 Dec 6 12:55:29 linux kernel: [<c0159006>] do_sync_read+0xb6/0x110 Dec 6 12:55:29 linux kernel: [<c0130a6a>] wake_futex+0x3a/0x50 Dec 6 12:55:29 linux kernel: [<f9556280>] reiserfs_file_write+0x0/0x6b0 [reiserfs] Dec 6 12:55:29 linux kernel: [<c015936d>] vfs_write+0x8d/0x170 Dec 6 12:55:29 linux kernel: [<c01594fc>] sys_write+0x3c/0x70 Dec 6 12:55:29 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79 Dec 6 12:55:29 linux kernel: Code: 0f 8e 3a 06 00 00 8b 4d 18 c7 85 0c ff ff ff 00 00 00 00 89 8d 30 ff ff ff c7 85 10 ff ff ff 00 00 00 00 8b 9d 30 ff ff ff 8b 13 <8b> 02 f6 c4 08 0f 84 9a 09 00 00 8b 42 0c 89 85 e4 fe ff ff 8b Dec 6 12:55:29 linux kernel: Badness in do_exit at kernel/exit.c:790 Dec 6 12:55:29 linux kernel: [<c011e9e8>] do_exit+0x338/0x340 Dec 6 12:55:29 linux kernel: [<c010468e>] die+0x13e/0x140 Dec 6 12:55:29 linux kernel: [<f9554905>] reiserfs_allocate_blocks_for_region+0xad5/0x14b0 [reiserfs] Dec 6 12:55:29 linux kernel: [<c011764c>] do_page_fault+0x2dc/0x5ef Dec 6 12:55:29 linux kernel: [<f9554905>] reiserfs_allocate_blocks_for_region+0xad5/0x14b0 [reiserfs] Dec 6 12:55:29 linux kernel: [<c0117370>] do_page_fault+0x0/0x5ef Dec 6 12:55:29 linux kernel: [<c0103f0f>] error_code+0x4f/0x60 Dec 6 12:55:29 linux kernel: [<f9554905>] reiserfs_allocate_blocks_for_region+0xad5/0x14b0 [reiserfs] Dec 6 12:55:29 linux kernel: [<f9556907>] reiserfs_file_write+0x687/0x6b0 [reiserfs] Dec 6 12:55:29 linux kernel: [<c0284a1d>] sock_aio_read+0xdd/0x130 Dec 6 12:55:29 linux kernel: [<c0159006>] do_sync_read+0xb6/0x110 Dec 6 12:55:29 linux kernel: [<c0130a6a>] wake_futex+0x3a/0x50 Dec 6 12:55:29 linux kernel: [<f9556280>] reiserfs_file_write+0x0/0x6b0 [reiserfs] Dec 6 12:55:29 linux kernel: [<c015936d>] vfs_write+0x8d/0x170 Dec 6 12:55:29 linux kernel: [<c01594fc>] sys_write+0x3c/0x70 Dec 6 12:55:29 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79 From /var/log/messages 2. ---------------------------------------- Dec 8 22:48:19 linux kernel: Unable to handle kernel paging request at virtual address 00340190 Dec 8 22:48:19 linux kernel: printing eip: Dec 8 22:48:19 linux kernel: f95672d9 Dec 8 22:48:19 linux kernel: *pde = 00000000 Dec 8 22:48:19 linux kernel: Oops: 0000 [#1] Dec 8 22:48:19 linux kernel: Modules linked in: ipt_limit ipt_pkttype ipt_LOG vmnet vmmon nls_cp850 nls_utf8 smbfs it87 i2c_sensor i2c_isa i2c_core radeon drm snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device button battery ac af_packet edd ipt_state ip6t_REJECT ipt_REJECT rfcomm hidp l2cap iptable_mangle iptable_nat iptable_filter pcmcia firmware_class hci_usb bluetooth ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 ehci_hcd ohci1394 ieee1394 yenta_socket rsrc_nonstatic pcmcia_core shpchp pci_hotplug snd_atiixp snd_ac97_codec snd_ac97_bus snd_pcm snd_timer snd soundcore snd_page_alloc e100 mii generic ohci_hcd usbcore ati_agp agpgart ide_cd cdrom parport_pc lp parport ext3 jbd dm_mod reiserfs fan thermal processor atiixp ide_disk ide_core Dec 8 22:48:19 linux kernel: CPU: 0 Dec 8 22:48:19 linux kernel: EIP: 0060:[<f95672d9>] Tainted: PF U VLI Dec 8 22:48:19 linux kernel: EFLAGS: 00010202 (2.6.13-15-default) Dec 8 22:48:19 linux kernel: EIP is at reiserfs_paste_into_item+0x29/0x1f0 [reiserfs] Dec 8 22:48:19 linux kernel: eax: d3e21f48 ebx: ddf074e0 ecx: 00340030 edx: d3e21d98 Dec 8 22:48:19 linux kernel: esi: d3e21e14 edi: c3ad0001 ebp: d3e21f48 esp: d3e21b4c Dec 8 22:48:19 linux kernel: ds: 007b es: 007b ss: 0068 Dec 8 22:48:19 linux kernel: Process kio_file (pid: 16782, threadinfo=d3e20000 task=ee804020) Dec 8 22:48:19 linux kernel: Stack: d3e21d98 0000d286 00000004 00000000 00000070 00000000 f5e71c00 d3e21f48 Dec 8 22:48:19 linux kernel: d3e21d98 00000000 00000000 00000000 00000000 00000000 00000000 00000000 Dec 8 22:48:19 linux kernel: c3ac47cc c015b541 00188002 00000000 00188002 00000002 c3ac47cc 00000002 Dec 8 22:48:19 linux kernel: Call Trace: Dec 8 22:48:19 linux kernel: [<c015b541>] __find_get_block+0xb1/0xd0 Dec 8 22:48:19 linux kernel: [<f9563fe1>] search_by_key+0x1d1/0xed0 [reiserfs] Dec 8 22:48:19 linux kernel: [<f954724f>] scan_bitmap_block+0x24f/0x2c0 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9547503>] scan_bitmap+0x153/0x240 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9564d13>] search_for_position_by_key+0x33/0x350 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9564df3>] search_for_position_by_key+0x113/0x350 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9555038>] reiserfs_allocate_blocks_for_region+0x1208/0x14b0 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9563a2d>] pathrelse+0x2d/0x40 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9556907>] reiserfs_file_write+0x687/0x6b0 [reiserfs] Dec 8 22:48:19 linux kernel: [<c0284a1d>] sock_aio_read+0xdd/0x130 Dec 8 22:48:19 linux kernel: [<c0159006>] do_sync_read+0xb6/0x110 Dec 8 22:48:19 linux kernel: [<f9556280>] reiserfs_file_write+0x0/0x6b0 [reiserfs] Dec 8 22:48:19 linux kernel: [<c015936d>] vfs_write+0x8d/0x170 Dec 8 22:48:19 linux kernel: [<c01594fc>] sys_write+0x3c/0x70 Dec 8 22:48:19 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79 Dec 8 22:48:19 linux kernel: Code: 00 00 55 57 56 53 81 ec bc 01 00 00 89 ce 89 c5 89 14 24 8b bc 24 d0 01 00 00 8b 48 10 85 c9 0f 84 c0 01 00 00 8b 8f 94 00 00 00 <8b> 81 60 01 00 00 8b 40 3c 89 44 24 04 8b 84 24 d8 01 00 00 89 Dec 8 22:48:19 linux kernel: Badness in do_exit at kernel/exit.c:790 Dec 8 22:48:19 linux kernel: [<c011e9e8>] do_exit+0x338/0x340 Dec 8 22:48:19 linux kernel: [<c010468e>] die+0x13e/0x140 Dec 8 22:48:19 linux kernel: [<f95672d9>] reiserfs_paste_into_item+0x29/0x1f0 [reiserfs] Dec 8 22:48:19 linux kernel: [<c011764c>] do_page_fault+0x2dc/0x5ef Dec 8 22:48:19 linux kernel: [<f95672d9>] reiserfs_paste_into_item+0x29/0x1f0 [reiserfs] Dec 8 22:48:19 linux kernel: [<c02ba852>] tcp_rcv_established+0x652/0x820 Dec 8 22:48:19 linux kernel: [<f98b7018>] ipt_hook+0x18/0x20 [iptable_filter] Dec 8 22:48:19 linux kernel: [<f955a238>] unfix_nodes+0x58/0x100 [reiserfs] Dec 8 22:48:19 linux kernel: [<f956d287>] reiserfs_prepare_for_journal+0x47/0x60 [reiserfs] Dec 8 22:48:19 linux kernel: [<c0117370>] do_page_fault+0x0/0x5ef Dec 8 22:48:19 linux kernel: [<c0103f0f>] error_code+0x4f/0x60 Dec 8 22:48:19 linux kernel: [<f95672d9>] reiserfs_paste_into_item+0x29/0x1f0 [reiserfs] Dec 8 22:48:19 linux kernel: [<c015b541>] __find_get_block+0xb1/0xd0 Dec 8 22:48:19 linux kernel: [<f9563fe1>] search_by_key+0x1d1/0xed0 [reiserfs] Dec 8 22:48:19 linux kernel: [<f954724f>] scan_bitmap_block+0x24f/0x2c0 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9547503>] scan_bitmap+0x153/0x240 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9564d13>] search_for_position_by_key+0x33/0x350 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9564df3>] search_for_position_by_key+0x113/0x350 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9555038>] reiserfs_allocate_blocks_for_region+0x1208/0x14b0 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9563a2d>] pathrelse+0x2d/0x40 [reiserfs] Dec 8 22:48:19 linux kernel: [<f9556907>] reiserfs_file_write+0x687/0x6b0 [reiserfs] Dec 8 22:48:19 linux kernel: [<c0284a1d>] sock_aio_read+0xdd/0x130 Dec 8 22:48:19 linux kernel: [<c0159006>] do_sync_read+0xb6/0x110 Dec 8 22:48:19 linux kernel: [<f9556280>] reiserfs_file_write+0x0/0x6b0 [reiserfs] Dec 8 22:48:19 linux kernel: [<c015936d>] vfs_write+0x8d/0x170 Dec 8 22:48:19 linux kernel: [<c01594fc>] sys_write+0x3c/0x70 Dec 8 22:48:19 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79
XFS is a bit nicer, but NULL pointer error is still there. From /var/log/messages: --------------------------------------------- Dec 9 21:10:02 linux kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000028 Dec 9 21:10:02 linux kernel: printing eip: Dec 9 21:10:02 linux kernel: c0159fb2 Dec 9 21:10:02 linux kernel: *pde = 00000000 Dec 9 21:10:02 linux kernel: Oops: 0002 [#1] Dec 9 21:10:02 linux kernel: Modules linked in: ipt_TCPMSS ipt_MASQUERADE vmnet vmmon snd_seq_dummy nls_cp850 nls_utf8 smbfs ipt_pkttype ipt_LOG ipt_limit speedstep_lib freq_table snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device button battery ac af_packet edd rfcomm ip6t_REJECT hidp l2cap hci_usb bluetooth ide_cd pcmcia firmware_class cdrom ipt_REJECT ipt_state iptable_mangle iptable_nat ohci1394 ieee1394 iptable_filter yenta_socket rsrc_nonstatic pcmcia_core e100 mii snd_atiixp ip6table_mangle snd_ac97_codec snd_ac97_bus snd_pcm snd_timer snd soundcore snd_page_alloc ip_conntrack ip_tables ehci_hcd generic ohci_hcd usbcore shpchp pci_hotplug ati_agp agpgart ip6table_filter ip6_tables ipv6 parport_pc lp parport ext3 jbd dm_mod xfs exportfs fan thermal processor atiixp ide_disk ide_core Dec 9 21:10:02 linux kernel: CPU: 0 Dec 9 21:10:02 linux kernel: EIP: 0060:[<c0159fb2>] Tainted: PF U VLI Dec 9 21:10:02 linux kernel: EFLAGS: 00214006 (2.6.13-15-default) Dec 9 21:10:02 linux kernel: EIP is at fput+0x2/0x20 Dec 9 21:10:02 linux kernel: eax: 00000014 ebx: dffff080 ecx: dffff081 edx: 00000014 Dec 9 21:10:02 linux kernel: esi: c0f04024 edi: c0f04008 ebp: 5a8dfa5f esp: f3fe1f74 Dec 9 21:10:02 linux kernel: ds: 007b es: 007b ss: 0068 Dec 9 21:10:02 linux kernel: Process hidd (pid: 4141, threadinfo=f3fe0000 task=f3d7d530) Dec 9 21:10:02 linux kernel: Stack: c016a155 00000000 00000000 00000002 bf9fec18 c016ace6 bf9fec18 00000000 Dec 9 21:10:02 linux kernel: d906e2a0 d906e2a0 c016a180 c0f04000 00000000 bf9fec08 00000064 b7ec6ff4 Dec 9 21:10:02 linux kernel: f3fe0000 c0102d1b bf9fec08 00000002 0000001a 00000064 b7ec6ff4 bf9feb48 Dec 9 21:10:02 linux kernel: Call Trace: Dec 9 21:10:02 linux kernel: [<c016a155>] poll_freewait+0x35/0x60 Dec 9 21:10:02 linux kernel: [<c016ace6>] sys_poll+0x176/0x220 Dec 9 21:10:02 linux kernel: [<c016a180>] __pollwait+0x0/0xa0 Dec 9 21:10:02 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79 Dec 9 21:10:02 linux kernel: Code: 00 85 c0 0f 84 76 ff ff ff 89 da a1 fc fa 41 c0 e8 14 a7 fe ff 31 db 89 d8 5b 5f c3 8d b6 00 00 00 00 8d bc 27 00 00 00 00 89 c2 <ff> 48 14 0f 94 c0 84 c0 75 04 c3 8d 76 00 89 d0 e9 09 00 00 00 Dec 9 21:10:13 linux kernel: XFS: possible memory allocation deadlock in kmem_alloc (mode:0x250) Dec 9 21:19:38 linux kernel: XFS: possible memory allocation deadlock in kmem_alloc (mode:0x250)
The same effect with a Samsung ATA drive, but in this case I managed to totally freez the system. Sad. As far as I see the problem is mainly related with the ati chipset or the motherboard. Maybe it is not a hw faluire because I have tried the devices in two Pundit-R so far, and I have got the same error. From /var/log/messages: -------------------------------------------------- Dec 9 23:14:18 linux kernel: Unable to handle kernel paging request at virtual address 74408581 Dec 9 23:14:18 linux kernel: printing eip: Dec 9 23:14:18 linux kernel: c02bbb43 Dec 9 23:14:18 linux kernel: *pde = 00000000 Dec 9 23:14:18 linux kernel: Oops: 0002 [#1] Dec 9 23:14:18 linux kernel: Modules linked in: nls_cp850 nls_utf8 smbfs rfcomm hidp l2cap ipt_pkttype ipt_LOG ipt_limit speedstep_lib freq_table snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device button battery ac af_packet hci_usb bluetooth pcmcia firmware_class edd ide_cd cdrom ohci1394 ieee1394 yenta_socket rsrc_nonstatic pcmcia_core e100 mii snd_atiixp snd_ac97_codec ip6t_REJECT generic snd_ac97_bus ipt_REJECT snd_pcm snd_timer snd soundcore snd_page_alloc ipt_state ehci_hcd ohci_hcd usbcore ati_agp agpgart shpchp pci_hotplug iptable_mangle iptable_nat iptable_filter ip6table_mangle ip_conntrack ip_tables ip6table_filter ip6_tables ipv6 parport_pc lp parport dm_mod reiserfs fan thermal processor atiixp ide_disk ide_core Dec 9 23:14:18 linux kernel: CPU: 0 Dec 9 23:14:18 linux kernel: EIP: 0060:[<c02bbb43>] Tainted: G U VLI Dec 9 23:14:18 linux kernel: EFLAGS: 00210202 (2.6.13-15-default) Dec 9 23:14:18 linux kernel: EIP is at tcp_transmit_skb+0x43/0x760 Dec 9 23:14:18 linux kernel: eax: 00000000 ebx: ffffff97 ecx: 744085f4 edx: 00000020 Dec 9 23:14:18 linux kernel: esi: dca14c04 edi: dca14c04 ebp: f340b380 esp: dc94dde4 Dec 9 23:14:18 linux kernel: ds: 007b es: 007b ss: 0068 Dec 9 23:14:18 linux kernel: Process ssh (pid: 6079, threadinfo=dc94c000 task=f3f44a40) Dec 9 23:14:18 linux kernel: Stack: f340b3b8 00000020 e49c9864 c028b187 00200282 f340b380 00000180 c0288591 Dec 9 23:14:19 linux kernel: dca14c4c dca14c00 dca14c4c 00000000 dca14c00 00000000 c02b274b dc94deb4 Dec 9 23:14:19 linux kernel: 00000000 00000000 00a450f0 000012e8 dca14ee8 000005a8 00000001 00000000 Dec 9 23:14:19 linux kernel: Call Trace: Dec 9 23:14:19 linux kernel: [<c028b187>] skb_copy_datagram_iovec+0x1a7/0x220 Dec 9 23:14:19 linux kernel: [<c0288591>] alloc_skb+0x31/0xd0 Dec 9 23:14:19 linux kernel: [<c02b274b>] tcp_recvmsg+0x2bb/0x7e0 Dec 9 23:14:19 linux kernel: [<c0287e31>] sock_common_recvmsg+0x31/0x50 Dec 9 23:14:19 linux kernel: [<c0284a1d>] sock_aio_read+0xdd/0x130 Dec 9 23:14:19 linux kernel: [<c0159006>] do_sync_read+0xb6/0x110 Dec 9 23:14:19 linux kernel: [<c016a63a>] do_select+0x30a/0x340 Dec 9 23:14:19 linux kernel: [<c012deb0>] autoremove_wake_function+0x0/0x30 Dec 9 23:14:19 linux kernel: [<c01591a1>] vfs_read+0x141/0x170 Dec 9 23:14:19 linux kernel: [<c015948c>] sys_read+0x3c/0x70 Dec 9 23:14:19 linux kernel: [<c0102d1b>] sysenter_past_esp+0x54/0x79 Dec 9 23:14:19 linux kernel: Code: 8d 42 38 8b 96 b8 01 00 00 89 04 24 89 54 24 04 8b 85 b8 00 00 00 66 83 78 0a 00 0f 84 f5 06 00 00 8b 86 78 03 00 00 8b 40 1c 85 <c0> 74 08 8d 45 10 e8 d2 bc e4 ff 8b 0c 24 f6 41 1c 02 0f 85 a5
Actually the machine is the european version of the pundit-r. So the mainboard is P4R8L2 and the chipset is rs350.
Chris, here comes a reiser problem in 10.0 Please check whether the file system has been corrupted by booting into single user mode and running fsck.reiserfs --rebuilddb /dev/...
It seems, that the two unsupported Hynix memory module caused the errors (ati chipset). With a Kingston the machine passed the big file scp test, and seems to work. Now I'am testing and try to hang it. Final words: use high-end rams in a Pundit-R!
Final words. ASUS Punit-R (RS350, P4R8L2) barebone does not work with 2 memory modules and Linux, but with 1. Sad but true. Maybe it is a chipset related problem in the kernel driver.