Bug 299010

Summary: openSUSE 10.3 beta1 installation fails because ahci fails to detect drives on ATI SB600/700
Product: [openSUSE] openSUSE 10.3 Reporter: Ralf Vogt <ralf-peter>
Component: InstallationAssignee: Tejun Heo <teheo>
Status: RESOLVED FIXED QA Contact: Jiri Srain <jsrain>
Severity: Blocker    
Priority: P5 - None CC: aj, andreas.in.hk, forgotten_7pwnVrOCb1, forgotten_FOUTW3E5Ow, hare, henry.su, info, j.reitsma, jerbes, joachim.deguara, njglin
Version: Beta 1   
Target Milestone: ---   
Hardware: x86-64   
OS: openSUSE 10.3   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Bug Depends on:    
Bug Blocks: 302016    
Attachments: hwinfo and /var/log/boot.msg
Here my kernelmessage,devices,modules,i/o-ports
output of lspci -nn as requested
output of lspci -nn requested by Tejun Heo
disable-msi-on-rs690.patch
lspci output as requested (in bug 299690) by Tejun Heo

Description Ralf Vogt 2007-08-09 20:13:54 UTC
I tried to Install openSUSE 10.3 Beta1 with the DVD , and also with the KDE single CD .
After booting from CD / DVD  I choose "Installation" . After some sceconds comes a bluesceen " >>> Linuxrc v3.0.3(Kernel 2.6.22.1-16-default) <<< " there is a red window in , with the message " Could not find the openSUSE Installation Source.
Activating manual setup program."
When I chouse "OK" starts a new window :
"Chouse a keyboard map.YaST will offer additional Keyboard tables later."
I chouse "Deutsch" . A new window open : "Start Installation or System" I chouse "OK" Again I chous " Start Installation or Update" I chouse "CD-ROM"
But at least it comes always : No repository found" .

My System : Procesor : E6600 , Soundblaster Live , IDE HDD , SATA DVD Drive , Mainboard ASUS P5vdc-MX/C , 1 GB RAM .
Comment 1 Matej Horvath 2007-08-10 13:08:34 UTC
Please attach your yast logs (http://en.opensuse.org/Bugs/YaST).
Comment 2 Keith Goggin 2007-08-11 10:30:50 UTC
Created attachment 156912 [details]
hwinfo and /var/log/boot.msg
Comment 3 Keith Goggin 2007-08-11 10:41:34 UTC
Following my comments today about this bug on factory Klaus Kaempf <kkaempf@suse.de> requested I post the above attachment from my Alpha7 install 
Comment 4 Stephan Kulow 2007-08-11 11:07:56 UTC
I have the feeling that this is for Steffen
Comment 5 Ralf Vogt 2007-08-11 16:35:53 UTC
Hi, 
the "/var/log/YaST2" is empty .
Pity , I can not copy the "/var/log/boot.msg" ...
I have a flopy Disk an a USB Stick , but I can not reach them to save the boot.msg 
 . Perhaps you can give me some tips , how I can save the file ?  


PS: In the bluesceen " >>> Linuxrc v3.0.3(Kernel 2.6.22.1-16-default) <<< I choosed : Verify Installation CD-ROM/DVD => Insert Installations CD-ROM or DVD  =>  NO CD-ROM or DVD found .
My DVD Drive is the Samsung SH-S183L . At the mainboard are : VIA VT8251 Southbridge ; VIA P4M800 PRO Chipsatz .
With the openSUSE 10.3 Alpha7 , I do not have problems with the installation .
Comment 6 Andreas Jaeger 2007-08-12 07:59:48 UTC
Ok, this happens before YaST starts - Steffen has to give us more details on how to debug this.
Comment 7 Steffen Winterfeldt 2007-08-13 09:32:06 UTC
Looks much like an ahci driver problem.

Ralf, when you see the first error message, go to console 9
and have a look whether a cdrom device exists (/dev/sr0 & /sys/block/sr0).

If yes, try to mount it (e.g. 'mount -r /dev/sr0 /mnt'). Does it work?

Also, the linuxrc error log would be interesting. For this, boot with
linuxrclog=/mylog and attach the log file.

BTW, on console 4 are the kernel messages. Maybe you spot something there.
Comment 8 Ralf Vogt 2007-08-13 21:16:31 UTC
Created attachment 157194 [details]
Here my kernelmessage,devices,modules,i/o-ports
Comment 9 Ralf Vogt 2007-08-13 21:19:42 UTC
Thank you for your answer .
But I can not see the CD-ROM Drive .
Perhaps it looks not so niche , but it is the only way for me ( in the moment ) to show you the logfiles : as a screen-shot.
I mde a Attachment, with the screen-shots.
Comment 10 Tejun Heo 2007-08-14 01:19:46 UTC
Thanks, Ralf. It seems DVR-212 doesn't like PMP==15 on softreset which is surprising considering the drive is fairly new and PMP==15 has been in the spec for quite some while now but who in ATA land reads specs?  :-(

Can you please post the result of "hdparm -I /dev/sr0" from alpha6 installation?  Also, can you apply patches and run modified kernels from a6?
Comment 11 Tejun Heo 2007-08-14 02:23:58 UTC
Ah... I thought it was the drive but it's the controller.  I wonder why somebody sets PMP support bit when the controller pukes when the feature is actually used.  Will blacklist affected controllers.
Comment 12 Tejun Heo 2007-08-14 02:25:59 UTC
*** Bug 299439 has been marked as a duplicate of this bug. ***
Comment 13 andreas hoffmann 2007-08-14 05:12:56 UTC
Created attachment 157244 [details]
output of lspci -nn as requested

Here the output of lspci -nn as requested by Tejun Heo.
Comment 14 Tejun Heo 2007-08-14 05:19:08 UTC
*** Bug 299340 has been marked as a duplicate of this bug. ***
Comment 15 Tejun Heo 2007-08-14 05:19:27 UTC
*** Bug 299575 has been marked as a duplicate of this bug. ***
Comment 16 Tejun Heo 2007-08-14 05:21:14 UTC
*** Bug 299690 has been marked as a duplicate of this bug. ***
Comment 17 Tejun Heo 2007-08-14 05:45:54 UTC
Henry, till now only sb600 seems to have problem with SRST w/ PMP==15 but I'm not sure whether sb700 is free from this problem or just hasn't showed up in the bug report yet.  Can you please verify this?
Comment 18 Piotr Miszczak 2007-08-14 06:54:02 UTC
Created attachment 157266 [details]
output of lspci -nn requested by Tejun Heo
Comment 19 Tejun Heo 2007-08-14 06:59:42 UTC
Alright, patch committed for both sb600 and 700.  Beta 2 should be fine.  I'm not sure SB700 is affected by the problem but disabled PMP on it too just in case.  Will update later if proven otherwise.  Closing the bug for now.  Please re-open if the problem persists on b2.  If you don't want to wait for b2, please check out KOTD after two days.

I'll leave this bug open till collecting all the lspci's from duplicate bugs just in case.

Thanks.
Comment 20 Tejun Heo 2007-08-14 07:29:08 UTC
*** Bug 299947 has been marked as a duplicate of this bug. ***
Comment 21 Andreas Jaeger 2007-08-14 07:38:48 UTC
I'm uploading test kernels now to:

ftp://ftp.suse.com/pub/people/aj/10.3-Beta1-Kernel-Test

If you have a working Alpha7 or 10.2 installation, could you test that these new kernels fix the problem and report back? Thanks!
Comment 22 Andreas Jaeger 2007-08-14 08:36:34 UTC
*** Bug 299891 has been marked as a duplicate of this bug. ***
Comment 23 Bryan Perry 2007-08-14 16:39:18 UTC
Will a symlink to kernel-default-2.6.22.2-20070814064043.x86_64.rpm called kernel-default.rpm override the existing kernel-default-2.6.22.1-16.x86_64.rpm in /suse/x86_64 of my PXE install source, or do I need to remove kernel-default-2.6.22.1-16.x86_64.rpm all together?
Comment 24 Jogchum Reitsma 2007-08-14 17:39:09 UTC
Andreas asked me (in a comment on bug 299891) to add the output of lspci here:

00:00.0 Memory controller: nVidia Corporation CK804 Memory Controller (rev a3)
00:01.0 ISA bridge: nVidia Corporation CK804 ISA Bridge (rev a3)
00:01.1 SMBus: nVidia Corporation CK804 SMBus (rev a2)
00:02.0 USB Controller: nVidia Corporation CK804 USB Controller (rev a2)
00:02.1 USB Controller: nVidia Corporation CK804 USB Controller (rev a3)
00:04.0 Multimedia audio controller: nVidia Corporation CK804 AC'97 Audio Controller (rev a2)
00:06.0 IDE interface: nVidia Corporation CK804 IDE (rev a2)
00:07.0 IDE interface: nVidia Corporation CK804 Serial ATA Controller (rev a3)
00:08.0 IDE interface: nVidia Corporation CK804 Serial ATA Controller (rev a3)
00:09.0 PCI bridge: nVidia Corporation CK804 PCI Bridge (rev a2)
00:0a.0 Bridge: nVidia Corporation CK804 Ethernet Controller (rev a3)
00:0c.0 PCI bridge: nVidia Corporation CK804 PCIE Bridge (rev a3)
00:0d.0 PCI bridge: nVidia Corporation CK804 PCIE Bridge (rev a3)
00:0e.0 PCI bridge: nVidia Corporation CK804 PCIE Bridge (rev a3)
00:18.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
00:18.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
00:18.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
00:18.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
00:19.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
00:19.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
00:19.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
00:19.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
01:06.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5788 Gigabit Ethernet (rev 03)
01:07.0 FireWire (IEEE 1394): VIA Technologies, Inc. IEEE 1394 Host Controller (rev 46)
01:08.0 Multimedia video controller: Internext Compression Inc iTVC16 (CX23416) MPEG-2 Encoder (rev 01)
04:00.0 VGA compatible controller: Matrox Graphics, Inc. Millenium P650 PCIe (rev 01)
Comment 25 Jogchum Reitsma 2007-08-14 17:42:36 UTC
Andreas asked me (in a comment on bug 299891) to add the output of lspci -nn here:


00:00.0 Memory controller [Class 0580]: nVidia Corporation CK804 Memory Controller [10de:005e] (rev a3)
00:01.0 ISA bridge [Class 0601]: nVidia Corporation CK804 ISA Bridge [10de:0051] (rev a3)
00:01.1 SMBus [Class 0c05]: nVidia Corporation CK804 SMBus [10de:0052] (rev a2)
00:02.0 USB Controller [Class 0c03]: nVidia Corporation CK804 USB Controller [10de:005a] (rev a2)
00:02.1 USB Controller [Class 0c03]: nVidia Corporation CK804 USB Controller [10de:005b] (rev a3)
00:04.0 Multimedia audio controller [Class 0401]: nVidia Corporation CK804 AC'97 Audio Controller [10de:0059] (rev a2)
00:06.0 IDE interface [Class 0101]: nVidia Corporation CK804 IDE [10de:0053] (rev a2)
00:07.0 IDE interface [Class 0101]: nVidia Corporation CK804 Serial ATA Controller [10de:0054] (rev a3)
00:08.0 IDE interface [Class 0101]: nVidia Corporation CK804 Serial ATA Controller [10de:0055] (rev a3)
00:09.0 PCI bridge [Class 0604]: nVidia Corporation CK804 PCI Bridge [10de:005c] (rev a2)
00:0a.0 Bridge [Class 0680]: nVidia Corporation CK804 Ethernet Controller [10de:0057] (rev a3)
00:0c.0 PCI bridge [Class 0604]: nVidia Corporation CK804 PCIE Bridge [10de:005d] (rev a3)
00:0d.0 PCI bridge [Class 0604]: nVidia Corporation CK804 PCIE Bridge [10de:005d] (rev a3)
00:0e.0 PCI bridge [Class 0604]: nVidia Corporation CK804 PCIE Bridge [10de:005d] (rev a3)
00:18.0 Host bridge [Class 0600]: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration [1022:1100]
00:18.1 Host bridge [Class 0600]: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map [1022:1101]
00:18.2 Host bridge [Class 0600]: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller [1022:1102]
00:18.3 Host bridge [Class 0600]: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control [1022:1103]
00:19.0 Host bridge [Class 0600]: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration [1022:1100]
00:19.1 Host bridge [Class 0600]: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map [1022:1101]
00:19.2 Host bridge [Class 0600]: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller [1022:1102]
00:19.3 Host bridge [Class 0600]: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control [1022:1103]
01:06.0 Ethernet controller [Class 0200]: Broadcom Corporation NetXtreme BCM5788 Gigabit Ethernet [14e4:169c] (rev 03)
01:07.0 FireWire (IEEE 1394) [Class 0c00]: VIA Technologies, Inc. IEEE 1394 Host Controller [1106:3044] (rev 46)
01:08.0 Multimedia video controller [Class 0400]: Internext Compression Inc iTVC16 (CX23416) MPEG-2 Encoder [4444:0016] (rev 01)
04:00.0 VGA compatible controller [Class 0300]: Matrox Graphics, Inc. Millenium P650 PCIe [102b:2538] (rev 01)
Comment 26 Andreas Jaeger 2007-08-14 18:30:06 UTC
(In reply to comment #23 from Bryan Perry)
> Will a symlink to kernel-default-2.6.22.2-20070814064043.x86_64.rpm called
> kernel-default.rpm override the existing kernel-default-2.6.22.1-16.x86_64.rpm
> in /suse/x86_64 of my PXE install source, or do I need to remove
> kernel-default-2.6.22.1-16.x86_64.rpm all together?

the version number is in the rpm as well.

Just do:
rpm -ihv kernel-default.rpm

And you have both kernels installed and can choose at boot time which one to use.

Comment 27 Tejun Heo 2007-08-15 06:37:01 UTC
Jogchum's isn't ATI SB600/700.  Reopening 299891.
Comment 28 andreas hoffmann 2007-08-15 07:40:16 UTC
Tanks very much for the debug test kernels, the one I tried (kernel-debug-2.6.22.2-20070814064043.x86_64.rpm) works very fine on Alpha5 and on Beta1 too. I installed Alpha5 first then add the debug kernel then upgrade to beta1 and when booting with the debug-kernel it works.
Comment 29 Forgotten User FOUTW3E5Ow 2007-08-15 11:04:50 UTC
I also failed to install openSUSE10.3 Beta1 on board with SB600.
Here is my test result on board with SB600 using the debug kernels:
1. Install openSUSE10.3 Alpha7 i386 on SB700;
2. Install kernel-debug-2.6.22.2-20070814064043.i586.rpm
3. Install kernel-default-2.6.22.2-20070814064043.i586.rpm
4. Reboot

Result: Both of those two kernels can boot well on SB600.
Comment 30 Forgotten User FOUTW3E5Ow 2007-08-15 11:22:47 UTC
I also tried to boot these two test kernels on board with SB700,
but in vain. I don't know why these two kernels can NOT boot on
SB700, because all the SB700 related patches should have been included.

And I remember that one guy from Novell tell me that openSUSE10.3 Alpha7
have included all the SB700 patches, but Alpha7 can NOT been installed
on SB700 board either.

I need your help to solve the openSUSE10.3 installation problem on SB700.
I can provide more information on it in the near future.

Thanks
Shane
Comment 31 Juan Erbes 2007-08-15 11:49:33 UTC
(In reply to comment #21 from Andreas Jaeger)
> I'm uploading test kernels now to:
> 
> ftp://ftp.suse.com/pub/people/aj/10.3-Beta1-Kernel-Test
> 
> If you have a working Alpha7 or 10.2 installation, could you test that these
> new kernels fix the problem and report back? Thanks!
> 

Yes, the 32 bit version works with the ATI chipset (sb600/rd580)in OSS 10.3 alpha6 and beta1 with the boot parameter "pci=nomsi". Without this parameter, fails to load the ahci driver.
The 64 bits version, I could'nt test it because I could'nt install it because the related problem of 10.3 beta1.
Comment 32 Tejun Heo 2007-08-15 12:12:21 UTC
Created attachment 157634 [details]
disable-msi-on-rs690.patch

Okay, so msi is still broken on RS690.  This patch should fix it.  Can you please give it a shot?
Comment 33 Tejun Heo 2007-08-15 12:14:55 UTC
Henry, I somehow thought you could see private comments and was waiting for your response on comment 17.  I just made it public.  Please also verify that MSI doesn't work on RS690.

Thanks.
Comment 34 Jacek Glinkowski 2007-08-16 01:37:57 UTC
Created attachment 157810 [details]
lspci output as requested (in bug 299690) by Tejun Heo

Seems MSI is supported only on SMBus of SB600.  MSI is supported on other parts of my PC: Audio device, VGA compatible controller, PCI bridge.
Comment 35 henry su 2007-08-16 04:49:06 UTC
 Hi Tejun,
Sorry for the  delay reply.
SB700 chipset is affacted by this problem too, please disable PMP for it,
and MSI is disabled on RS690.
Thanks.
Comment 36 Tejun Heo 2007-08-16 05:32:15 UTC
Alright, committing patch to disable MSI on RS690 and closing.  Thanks.
Comment 37 Andreas Jaeger 2007-08-16 07:17:11 UTC
I'm uploading new kernels to 
ftp://ftp.suse.com/pub/people/aj/10.3-Beta1-Kernel-Test2
(note it's a 2 at the end) for those that like to test.
Comment 38 Tejun Heo 2007-08-16 11:00:41 UTC
JFYI, RS690 patch is submitted for 2.6.23 inclusion.  PMP blacklist patches are merged into pending PMP mainline merge tree.
Comment 39 Tejun Heo 2007-08-16 17:30:19 UTC
*** Bug 300997 has been marked as a duplicate of this bug. ***
Comment 40 Tristan Hoffmann 2007-08-17 09:02:59 UTC
I just tested the default-kernel from ftp://ftp.suse.com/pub/people/aj/10.3-Beta1-Kernel-Test2 and it works well on my SB600.
Comment 41 Forgotten User FOUTW3E5Ow 2007-08-21 02:07:31 UTC
the kernel-default-2.6.22.3-20070816053233.i586(comment #37) also can work well on SB700+RS690
Comment 42 Forgotten User FOUTW3E5Ow 2007-08-21 03:07:02 UTC
But the same default-2.6.22.3-20070816053233 kernel can NOT boot on
our another board(SB700+RX780), here are some boot messages when it hangs,
Could you please give us some suggestion if you have? thanks

pnp: 00:09: iomem range 0xfee00000-0xfee00fff has been reserved
pnp: 00:0a: iomem range 0xffb80000-0xffbfffff could not be reserved
pnp: 00:0a: iomem range 0xfff00000-0xffffffff could not be reserved
pnp: 00:0b: iomem range 0xe0000000-0xefffffff has been reserved
pnp: 00:0c: iomem range 0x0-0x9ffff could not be reserved
pnp: 00:0c: iomem range 0x0-0x0 could not be reserved
pnp: 00:0c: iomem range 0xe0000-0xfffff could not be reserved
pnp: 00:0c: iomem range 0x100000-0x3fbfffff could not be reserved
Time: hpet clocksource has been installed.
Switched to NOHz mode on CPU #0
Switched to NOHz mode on CPU #1
PCI: Bridge: 0000:00:02.0
  IO window: e000-efff
  MEM window: fea00000-feafffff
  PREFETCH window: d8000000-dfffffff
PCI: Bridge: 0000:00:0a.0
  IO window: disabled.
  MEM window: feb00000-febfffff
  PREFETCH window: disabled.
PCI: Bridge: 0000:00:14.4
  IO window: disabled.
  MEM window: disabled.
  PREFETCH window: disabled.
NET: Registered protocol family 2
IP route cache hash table entries: 32768 (order: 5, 131072 bytes)
TCP established hash table entries: 131072 (order: 8, 1572864 bytes)
TCP bind hash table entries: 65536 (order: 7, 524288 bytes)
TCP: Hash tables configured (established 131072 bind 65536)
TCP reno registered
Machine check exception polling timer started.
audit: initializing netlink socket (disabled)
audit:(1187691115.952:1): initialized
highmem bounce pool size: 64 pages
Total HugeTLB memory allocated, 0
VFS: Disk quotas dquot_6.5.1
Dquot-cache hash table entries: 1024 (order 0, 4096 bytes)
io scheduler noop registered
io scheduler anticipatory registered
io scheduler deadline registered
io scheduler cfq registered (default)

Comment 43 Tejun Heo 2007-08-21 03:16:17 UTC
Does 'pci=nomsi' make any difference?  Also, please file a separate bug report.
Comment 44 Forgotten User FOUTW3E5Ow 2007-08-21 04:09:41 UTC
Hi Tejun:
"acpi=off pci=nomsi" can solve this problem. I filed a new bugzilla
302016, please check it for more information. 
Only "acpi=off" or "pci=nomsi" can NOT solve this problem.

Are you able to disable RX780 MSI before the release of openSUSE10.3 Beta2 ?
So that we can test it. Thanks.