Bug 1228093 - [amdgpu] Secondary monitor does not come up with 6.10
Summary: [amdgpu] Secondary monitor does not come up with 6.10
Status: IN_PROGRESS
Alias: None
Product: openSUSE Tumbleweed
Classification: openSUSE
Component: Kernel (show other bugs)
Version: Current
Hardware: x86-64 Other
: P5 - None : Normal (vote)
Target Milestone: ---
Assignee: Jiri Slaby
QA Contact: E-mail List
URL: https://gitlab.freedesktop.org/drm/am...
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-07-18 06:33 UTC by Jiri Slaby
Modified: 2024-07-19 18:43 UTC (History)
3 users (show)

See Also:
Found By: ---
Services Priority:
Business Priority:
Blocker: ---
Marketing QA Status: ---
IT Deployment: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Jiri Slaby 2024-07-18 06:33:07 UTC
I bisected the issue to:
commit 8b2cb32cf0c613fd937ebb49a331798985f50826
Author: Hersen Wu <hersenxs.wu@amd.com>
Date:   Mon Mar 11 18:18:34 2024 -0400

    drm/amd/display: FEC overhead should be checked once for mst slot nums

Now going to revert in stable temporarily and report to upstream.
Comment 1 Jiri Slaby 2024-07-18 06:38:53 UTC
The monitor simply does not come up in wayland-plasma6 (it does in console). It appears as if it was there (windows open there and mouse cursor can go there), but the monitor is DPMS off.

There is no difference in dmesg regarding [drm].

Reverting the above commit on the top of 6.10 makes it work again.

git bisect log for reference:
> # bad: [0c3836482481200ead7b416ca80c68a29cfdaabd] Linux 6.10
> # good: [a38297e3fb012ddfa7ce0321a7e5a8daeb1872b6] Linux 6.9
> git bisect start 'v6.10' 'v6.9' '--' 'drivers/gpu/drm/amd/'
> # bad: [27e718ac8b8194d13eee5738c4d3fd247736186e] drm/amd/display: fix disable otg wa logic in DCN316
> git bisect bad 27e718ac8b8194d13eee5738c4d3fd247736186e
> # good: [20fd14460f45a01b9ec63aa7b12e6c3c66e54fa7] drm/amdgpu: Fix 'fw_name' buffer size to prevent truncations in amdgpu_mes_init_microcode
> git bisect good 20fd14460f45a01b9ec63aa7b12e6c3c66e54fa7
> # bad: [14f9db4271ef5c78ae87237af844f03fb192d139] drm/amd/display: Enable DTBCLK DTO earlier in the sequence
> git bisect bad 14f9db4271ef5c78ae87237af844f03fb192d139
> # good: [1c5c36530a573de1a4b647b7d8c36f3b298e60ed] drm/amd/display: Set DCN351 BB and IP the same as DCN35
> git bisect good 1c5c36530a573de1a4b647b7d8c36f3b298e60ed
> # good: [d045f4ad7700c271fa1278b78ef7722f833a8068] drm/amd/swsmu: Update smu v14.0.0 headers to be 14.0.1 compatible
> git bisect good d045f4ad7700c271fa1278b78ef7722f833a8068
> # good: [029faefb7302f1079173410697b0e14d2e56e19a] drm/amdgpu: implement IRQ_STATE_ENABLE for SDMA v4.4.2
> git bisect good 029faefb7302f1079173410697b0e14d2e56e19a
> # bad: [b7a1a0ef12b81957584fef7b61e2d5ec049c7209] drm/amd/amdgpu: add pipe1 hardware support
> git bisect bad b7a1a0ef12b81957584fef7b61e2d5ec049c7209
> # bad: [60df5628144b59d5876f8ceac624a7661c336665] drm/amd/display: handle invalid connector indices
> git bisect bad 60df5628144b59d5876f8ceac624a7661c336665
> # bad: [8b2cb32cf0c613fd937ebb49a331798985f50826] drm/amd/display: FEC overhead should be checked once for mst slot nums
> git bisect bad 8b2cb32cf0c613fd937ebb49a331798985f50826
> # first bad commit: [8b2cb32cf0c613fd937ebb49a331798985f50826] drm/amd/display: FEC overhead should be checked once for mst slot nums
Comment 2 Jiri Slaby 2024-07-18 06:44:15 UTC
The external monitor is connected via Lenovo dock (via Thunderbolt) by an HDMI cable.

The card in question:
64:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Phoenix1 [1002:15bf] (rev dd) (prog-if 00 [VGA controller])
        Subsystem: Lenovo Device [17aa:50da]
        Flags: bus master, fast devsel, latency 0, IRQ 57, IOMMU group 16
        Memory at 2400000000 (64-bit, prefetchable) [size=256M]
        Memory at 78000000 (64-bit, prefetchable) [size=2M]
        I/O ports at 1000 [size=256]
        Memory at 78500000 (32-bit, non-prefetchable) [size=512K]
        Capabilities: [48] Vendor Specific Information: Len=08 <?>
        Capabilities: [50] Power Management version 3
        Capabilities: [64] Express Legacy Endpoint, IntMsgNum 0
        Capabilities: [a0] MSI: Enable- Count=1/4 Maskable- 64bit+
        Capabilities: [c0] MSI-X: Enable+ Count=4 Masked-
        Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?>
        Capabilities: [270] Secondary PCI Express
        Capabilities: [2a0] Access Control Services
        Capabilities: [2b0] Address Translation Service (ATS)
        Capabilities: [2c0] Page Request Interface (PRI)
        Capabilities: [2d0] Process Address Space ID (PASID)
        Capabilities: [410] Physical Layer 16.0 GT/s <?>
        Capabilities: [450] Lane Margining at the Receiver
        Kernel driver in use: amdgpu
Comment 3 Daniel Schemp 2024-07-19 18:43:47 UTC
Hi, I've been made aware of this ticket in a forum topic I opened. Hopefully I can help but my issue is linked to Kernel 6.9.3+ and not 6.10 and comes up already at boot.

Topic: https://forums.opensuse.org/t/system-crashes-when-second-daisy-chained-monitor-is-attached-with-amd-gpu-with-kernel-6-9-3/176886

System:
  Kernel: 6.9.7-1-default arch: x86_64 bits: 64 compiler: gcc v: 13.3.0
    clocksource: tsc avail: hpet,acpi_pm
    parameters: initrd=\opensuse-tumbleweed\6.9.7-1-default\initrd-78cac3084ea8018dc0df08f7fd3831a49a0967c4
    root=UUID=[REDACTED] splash=silent quiet
    security=apparmor mitigations=auto
    systemd.machine_id=[REDACTED]
  Desktop: KDE Plasma v: 6.1.2 tk: Qt v: N/A info: frameworks v: 6.3.0
    wm: kwin_x11 tools: avail: xscreensaver vt: 2 dm: SDDM Distro: openSUSE
    Tumbleweed 20240712
Graphics:
  Device-1: AMD Navi 22 [Radeon RX 6700/6700 XT/6750 XT / 6800M/6850M XT]
    vendor: XFX driver: amdgpu v: kernel arch: RDNA-2 code: Navi-2x
    process: TSMC n7 (7nm) built: 2020-22 pcie: gen: 4 speed: 16 GT/s
    lanes: 16 ports: active: DP-4 empty: DP-1, DP-2, DP-3, DP-5, HDMI-A-1,
    Writeback-1 bus-ID: 2d:00.0 chip-ID: 1002:73df class-ID: 0300
  Display: x11 server: X.Org v: 21.1.12 with: Xwayland v: 24.1.0
    compositor: kwin_x11 driver: X: loaded: modesetting unloaded: fbdev,vesa
    dri: radeonsi gpu: amdgpu display-ID: :0 screens: 1
  Screen-1: 0 s-res: 2560x1440 s-dpi: 96 s-size: 677x381mm (26.65x15.00")
    s-diag: 777mm (30.58")
  Monitor-1: DP-4 model: HP Z27u G3 serial: <filter> built: 2021
    res: 2560x1440 hz: 60 dpi: 109 gamma: 1.2 size: 597x336mm (23.5x13.23")
    diag: 685mm (27") ratio: 16:9 modes: max: 2560x1440 min: 720x400
  API: EGL v: 1.5 hw: drv: amd radeonsi platforms: device: 0 drv: radeonsi
    device: 1 drv: swrast surfaceless: drv: radeonsi x11: drv: radeonsi
    inactive: gbm,wayland
  API: OpenGL v: 4.6 compat-v: 4.5 vendor: amd mesa v: 24.1.3 glx-v: 1.4
    direct-render: yes renderer: AMD Radeon RX 6700 XT (radeonsi navi22 LLVM
    18.1.8 DRM 3.57 6.9.7-1-default) device-ID: 1002:73df memory: 11.72 GiB
    unified: no
  API: Vulkan v: 1.3.283 layers: 5 device: 0 type: discrete-gpu name: AMD
    Radeon RX 6700 XT (RADV NAVI22) driver: N/A device-ID: 1002:73df
    surfaces: xcb,xlib