|
Bugzilla – Full Text Bug Listing |
| Summary: | Mouse/Keyboard hang/pause in Beta4 on Sun W2100 dual opteron boxes | ||
|---|---|---|---|
| Product: | [openSUSE] SUSE LINUX 10.0 | Reporter: | Eric Whiting <whitet> |
| Component: | Kernel | Assignee: | Andreas Kleen <ak> |
| Status: | RESOLVED FIXED | QA Contact: | E-mail List <qa-bugs> |
| Severity: | Critical | ||
| Priority: | P5 - None | CC: | trenn, vojtech |
| Version: | Beta 4 | ||
| Target Milestone: | --- | ||
| Hardware: | x86-64 | ||
| OS: | All | ||
| Whiteboard: | |||
| Found By: | Other | Services Priority: | |
| Business Priority: | Blocker: | --- | |
| Marketing QA Status: | --- | IT Deployment: | --- |
| Attachments: |
hwinfo
hwinfo -- mistake -- this attachment is for a different bug. Don't disable interrupts while polling thermal |
||
|
Description
Eric Whiting
2005-09-06 17:59:01 UTC
Does
rmmod thermal
help? Also removing some other ACPI modules which use SMBus might help.
rmmod thermal worked -- lots of hangs with thermal loaded. Hangs went away after rmmod thermal. Current cmdline: cat /proc/cmdline root=/dev/sda1 selinux=0 vga=normal Pavel, can you suggest what to do here? Thermal seems to be spending too much time reading data, and not just in this case. I believe some fixes for that appeared on l-k ... There is ec_burst patch (or how is that option called) that should help; unfortunately it does something weird on other systems, so it can not be enabled by default. ec_burst is enabled by default. Does it help to increase thermal polling? echo 10 >/proc/acpi/thermal_zone/*/polling_frequency With SUSE 10B4 it even seems to be worse. After B4 install and booting the default kernel it was very hard to type. It was losing keystrokes about every 5-10s -> this is probably because we increased thermal polling from 5 to 2 seconds, please tell me a value that sovles your issue. Adding it is trivial but we are running out of time... This has also been declared as sysconfig variable now: /etc/sysconfig/powersave/thermal THERMAL_POLLING_FREQUENCY="" Add your values (in seconds) there and restart the powersave daemon. If I see this right you will always loose keystrokes on a thermal read? So maybe there is no other workaround for you than setting the value very high or just don't load the module? I set the interval to 10s and restarted powersave. Then the mouse/keyboard hung every 10s. (I did a 'watch date' in a window and then moved the window around on the screen continuously -- sure enough it froze/hung every 10s.) I don't think there is any value that will work... It is hard to type even with it set at 10s. (like right now I've been having to backspace and fix things several times just typing this) Perhaps we just need to add a black list (PCI or DMI) and disable thermal polling on this machine. Can you attach hwinfo output? Created attachment 49213 [details]
hwinfo
Does ec_burst=0 help? For what I know it should get worse, but you never know, it should be worth a try... I just updated the bios on this hardware to version 2.1. No change -- same problem. ec_burst=0 does not help -- didn't seem to make it much worse either. Box is still not usable until I do a rmmod thermal. Created attachment 49613 [details]
hwinfo -- mistake -- this attachment is for a different bug.
Created attachment 49756 [details]
Don't disable interrupts while polling thermal
DMI was difficult because x86-64 doesn't have DMI infrastructure right now (and
it's too late in the release to add it) and PCI backlist is not feasible.
But this patch should fix it. It simply doesn't
disable interrupts while reading thermal.
I think the reason it used a irqsave spinlock here
was that it used to be called from the timer interrupts. But these days ACPI
pushes it to a thread, so sleeping is ok. To be sure I check for in_interrupt
and error out if it happens.
I did a similar patch some time ago for the battery
reading in ec.c, this just extends it.
Eric, I will build you a test kernel with this.
Can you please test ftp://nozzle.suse.de/pub/people/ak/test2/kernel-smp* and report if it works now? Good news. Patched kernel seems to work fine. In text console when I hold a key down there is no longer the stop/start pause/hang as the letters appear on the screen. Key-repeat works smoothly. rpm -Uhv failed to setup the nvidia module (I had not downloaded the source). I changed xorg.conf to use nv instead of nvidia and tested under X a little bit. The mouse movement and keyboard behavior under X seemed ok. (I'm downloading the source now so I can run with the nvidia module loaded). Ok thanks. INcreasing severity to bring it onto the radar. Fixed now for RC3 Andi -- I just got my CD/DVD and installed 10.0. It appears that the problem fix did not make it to the GM release. I had to 'rmmod thermal' to get the box working properly. I'll leave it up to you to reopen it if you think it should be... |