Bug 154217

Summary: Kernel hangs with BUG error in /drivers/block/ll_rw_blk.c:3060
Product: [openSUSE] SUSE LINUX 10.0 Reporter: Ernst Verhaar <e.verhaar>
Component: KernelAssignee: E-mail List <kernel-maintainers>
Status: RESOLVED INVALID QA Contact: E-mail List <qa-bugs>
Severity: Critical    
Priority: P5 - None    
Version: Final   
Target Milestone: ---   
Hardware: i586   
OS: SuSE Linux 10.0   
Whiteboard:
Found By: Customer Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Attachments: Kernel BUG log

Description Ernst Verhaar 2006-02-28 21:11:55 UTC
System freezes, on TTY10 I can see an Kernel BUG error. The routing still works (machine is used as a gateway), I cannot log in anymore via SSH orr TTY's. Looks like a deadlock.

-----
Kernel BUG at drivers/block/ll_rw_blk.c:3060!
invalid operand: 0000 [#1]

process scsi_eh_1 pid:1368
-----

System Information:
3Ware Raid Controller 8006 SATA
2 * 200GB
Intel Xeon 2.0Ghz
Asus Mobo
OpenSuse Linux 10.0
Kernel 2.6.13-15.8-smp

No information is written to the log files. Probably because the last normal log is: Warning: 3w-xxxx ATA port Timeout

Could this problem be related to the 3Ware kernel modules?
Comment 1 Greg Kroah-Hartman 2006-03-02 03:21:48 UTC
What 3Ware kernel modules?  Did they come with the 10.0 kernel, or did
you get them from somewhere else?
Comment 2 Ernst Verhaar 2006-03-02 12:29:53 UTC
(In reply to comment #1)
> What 3Ware kernel modules?  Did they come with the 10.0 kernel, or did
> you get them from somewhere else?

Ok, I was kind of confused. Greg, you are correct. There are no kernel modules running for the 3Ware Escalade controller. I had a similar problem with kernel modules for an Areca Card. Therefore I assumed this also was a module. It is a device driver. 

From boot.msg
Inspecting /boot/System.map-2.6.13-15.8-smp
Loaded 24964 symbols from /boot/System.map-2.6.13-15.8-smp.
Symbols match kernel version 2.6.13.
No module symbols loaded - kernel modules not enabled.
....
....
<4>3ware Storage Controller device driver for Linux v1.26.02.001.



Comment 3 Ernst Verhaar 2006-03-02 13:52:01 UTC
Additional Information:

System just crashed again. I was transferring huge amounts of data to the raid array. This time no information visible. Screens wouldn't activate...

Comment 4 Ernst Verhaar 2006-03-02 21:26:43 UTC
And another crash. This time in kernel BUG at mm/rmap.c:493
I attached the logs below. But it now sounds as an hardware problem. 
Please confirm,

Kind regards,

Ernst
Comment 5 Ernst Verhaar 2006-03-02 21:27:22 UTC
Created attachment 71076 [details]
Kernel BUG log
Comment 6 Chris L Mason 2006-03-03 18:45:14 UTC
This is against SL10.0.  In order for us to debug things, we'll need a report against the 10.1 beta kernels.
Comment 7 Ernst Verhaar 2006-03-06 11:28:12 UTC
It seems that although new hard ware was used, there still was an error. With a KnoppixLive CD i got the same results. I am now trying to test the different hard ware for error.

My apologies.