Bug 1212587 - Kernel 6.3.7-1.2 fails to activate Intel Thunderbolt PCIe Bridge
Summary: Kernel 6.3.7-1.2 fails to activate Intel Thunderbolt PCIe Bridge
Status: RESOLVED FIXED
Alias: None
Product: openSUSE Tumbleweed
Classification: openSUSE
Component: Kernel (show other bugs)
Version: Current
Hardware: Other Other
: P5 - None : Normal (vote)
Target Milestone: ---
Assignee: openSUSE Kernel Bugs
QA Contact: E-mail List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2023-06-21 12:18 UTC by Martin Doucha
Modified: 2023-07-12 11:44 UTC (History)
2 users (show)

See Also:
Found By: ---
Services Priority:
Business Priority:
Blocker: ---
Marketing QA Status: ---
IT Deployment: ---


Attachments
Boot log on affected kernel (77.53 KB, text/plain)
2023-06-21 12:18 UTC, Martin Doucha
Details
lspci -vv from kernel 6.2.10-1.1 (56.08 KB, text/plain)
2023-06-21 12:19 UTC, Martin Doucha
Details
lspci -vv from kernel 6.3.7-1.2 (34.56 KB, text/plain)
2023-06-21 12:20 UTC, Martin Doucha
Details
lspci -vv from kernel 6.3.2-1.1 (56.08 KB, text/plain)
2023-06-21 12:51 UTC, Martin Doucha
Details
lspci -vv from kernel 6.3.8-4.1.ge044c8e (34.56 KB, text/plain)
2023-06-21 12:52 UTC, Martin Doucha
Details
lspci -vv from kernel 6.3.4-1.1 (34.56 KB, text/plain)
2023-06-21 13:24 UTC, Martin Doucha
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Martin Doucha 2023-06-21 12:18:59 UTC
Created attachment 867722 [details]
Boot log on affected kernel

After updating to the latest Tumbleweed kernel (v6.3.7-1.2), the USB devices connected through my Dell laptop USB-C docking station stopped working. If I reboot to an older kernel (6.2.10-1.1), they work fine.

Comparing lspci output from the two kernels shows that the newer kernel fails to activate the following devices:

03:00.0 PCI bridge: Intel Corporation JHL6340 Thunderbolt 3 Bridge (C step) [Alpine Ridge 2C 2016] (rev 02) (prog-if 00 [Normal decode])
	Subsystem: Device 2222:1111
04:00.0 PCI bridge: Intel Corporation JHL6340 Thunderbolt 3 Bridge (C step) [Alpine Ridge 2C 2016] (rev 02) (prog-if 00 [Normal decode])
	Subsystem: Device 2222:1111
04:01.0 PCI bridge: Intel Corporation JHL6340 Thunderbolt 3 Bridge (C step) [Alpine Ridge 2C 2016] (rev 02) (prog-if 00 [Normal decode])
	Subsystem: Device 2222:1111
04:02.0 PCI bridge: Intel Corporation JHL6340 Thunderbolt 3 Bridge (C step) [Alpine Ridge 2C 2016] (rev 02) (prog-if 00 [Normal decode])
	Subsystem: Device 2222:1111
3b:00.0 USB controller: Intel Corporation JHL6340 Thunderbolt 3 USB 3.1 Controller (C step) [Alpine Ridge 2C 2016] (rev 02) (prog-if 30 [XHCI])
	Subsystem: Device 2222:1111
Comment 1 Martin Doucha 2023-06-21 12:19:49 UTC
Created attachment 867723 [details]
lspci -vv from kernel 6.2.10-1.1
Comment 2 Martin Doucha 2023-06-21 12:20:14 UTC
Created attachment 867724 [details]
lspci -vv from kernel 6.3.7-1.2
Comment 3 Takashi Iwai 2023-06-21 12:24:10 UTC
Could you check with the latest package in OBS Kernel:stable repo?  A TB fix patch from 6.4-rc7 is included there.

Also, please check whether the early 6.3.x kernel also suffers.  At least 6.3.2 is still found in TW history repo.
Comment 4 Martin Doucha 2023-06-21 12:51:43 UTC
Created attachment 867725 [details]
lspci -vv from kernel 6.3.2-1.1

Kernel v6.3.2-1.1: Not affected
Comment 5 Martin Doucha 2023-06-21 12:52:45 UTC
Created attachment 867726 [details]
lspci -vv from kernel 6.3.8-4.1.ge044c8e

Kernel v6.3.8-4.1.ge044c8e (Kernel:stable): Affected
Comment 6 Takashi Iwai 2023-06-21 13:01:52 UTC
Thanks.  At least it's a good news that 6.3.2 worked.

There are 6.3.4 and 6.3.6 kernels in the history repo.  Could you verify which one starts breaking (or not), too?
Comment 7 Martin Doucha 2023-06-21 13:24:36 UTC
Created attachment 867727 [details]
lspci -vv from kernel 6.3.4-1.1

Both kernels v6.3.4-1.1 and v6.3.6-1.1 are affected.
Comment 8 Takashi Iwai 2023-06-21 13:31:58 UTC
Thanks.  So the likely cause is the fix patch for AMD Thunderbolt, the upstreamc4af8e3fecd03b0aedcd38145955605cfebe7e3a
   thunderbolt: Clear registers properly when auto clear isn't in use
that landed in 6.3.3.

One more quick test: could you check with 6.4-rc7 kernel in OBS Kernel:HEAD?
If the problem is seen there, we'll need to report it to the upstream.
Comment 9 Takashi Iwai 2023-06-21 13:38:09 UTC
Meanwhile, I'm building a test kernel with the revert of the two recent TB patches (including the above mentioned one).  It's being built in OBS home:tiwai:bsc1212587 repo.  Please give it a try later once after the build finishes.
Comment 10 Martin Doucha 2023-06-21 14:30:53 UTC
Kernel 6.4-rc7 is fixed.

The test kernel in home:tiwai:bsc1212587 failed to build.
Comment 11 Takashi Iwai 2023-06-21 16:07:04 UTC
OK, at least, the bug will be fixed in the next week :)

But it's still interesting whether those patches do right or not.
I'm rebuilding the same repo, OBS home:tiwai:bsc1212587.  Let's cross fingers.

Also, another test build is found in OBS home:tiwai:bsc1212587-2 repo.  This syncs the all changes in drivers/thunderbolt/*.  Please try it later if you have time, too.
Comment 12 Martin Doucha 2023-06-23 09:35:37 UTC
(In reply to Takashi Iwai from comment #11)
> OK, at least, the bug will be fixed in the next week :)
> 
> But it's still interesting whether those patches do right or not.
> I'm rebuilding the same repo, OBS home:tiwai:bsc1212587.  Let's cross
> fingers.
> 
> Also, another test build is found in OBS home:tiwai:bsc1212587-2 repo.  This
> syncs the all changes in drivers/thunderbolt/*.  Please try it later if you
> have time, too.

Both kernels work correctly.
Comment 13 Takashi Iwai 2023-06-23 09:40:14 UTC
OK, thanks, it's good to know!  I guess we need to backport relevant ones to SLE15-SP5, too (which had already the patch that caused the problem).

As stable branch will be upgraded to 6.4 on the next Monday, I'll leave this.
If 6.4-rc8 comes up and the upgrade is delayed, I'll merge the backport instead.
Comment 14 Takashi Iwai 2023-07-12 11:44:25 UTC
TW is switching to 6.4.x.  Let's close.