Bug 1213334

Summary: AMDGPU stuttering: *ERROR* Error waiting for DMUB idle: status=3
Product: [openSUSE] openSUSE Tumbleweed Reporter: B <reiokorn>
Component: KernelAssignee: openSUSE Kernel Bugs <kernel-bugs>
Status: RESOLVED FIXED QA Contact: E-mail List <qa-bugs>
Severity: Normal    
Priority: P5 - None CC: reiokorn, tiwai
Version: Current   
Target Milestone: ---   
Hardware: x86-64   
OS: openSUSE Tumbleweed   
URL: https://gitlab.freedesktop.org/drm/amd/-/issues/2708
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---

Description B 2023-07-14 12:41:23 UTC
Stuttering/lag since update to Kernel 6.4.2-1-default
# VERSION="20230712"
AMD ATI Radeon RX 7900 XT

Error message according to journalctl:

---
Jul 14 14:30:11 opensuse-pc kernel: [drm:dc_dmub_setup_subvp_dmub_command [amdgpu]] *ERROR* Error waiting for DMUB idle: status=3
Jul 14 14:30:12 opensuse-pc kernel: [drm:dc_dmub_setup_subvp_dmub_command [amdgpu]] *ERROR* Error waiting for DMUB idle: status=3
Jul 14 14:30:16 opensuse-pc kernel: [drm:dc_dmub_setup_subvp_dmub_command [amdgpu]] *ERROR* Error waiting for DMUB idle: status=3
Jul 14 14:31:18 opensuse-pc kernel: [drm:dc_dmub_setup_subvp_dmub_command [amdgpu]] *ERROR* Error waiting for DMUB idle: status=3
---
Comment 1 B 2023-07-14 13:15:30 UTC
and this one:

Jul 14 15:10:50 opensuse-pc kernel: [drm:dc_dmub_srv_wait_idle [amdgpu]] *ERROR* Error waiting for DMUB idle: status=3
Jul 14 15:10:50 opensuse-pc kernel: [drm:dc_dmub_srv_wait_idle [amdgpu]] *ERROR* Error waiting for DMUB idle: status=3
Jul 14 15:10:50 opensuse-pc kernel: [drm:dc_dmub_srv_wait_idle [amdgpu]] *ERROR* Error waiting for DMUB idle: status=3
Comment 2 Takashi Iwai 2023-07-14 13:38:03 UTC
Might be related with the kernel-firmware update.  Which version do you have?
You can try kernel-firmware-* in OBS Kernel:HEAD repo, if the version differs.
Comment 3 Takashi Iwai 2023-07-14 13:41:21 UTC
I suspected the upstream report:
  https://gitlab.freedesktop.org/drm/amd/-/issues/2666
Comment 4 B 2023-07-14 15:25:06 UTC
(In reply to Takashi Iwai from comment #2)
> Might be related with the kernel-firmware update.  Which version do you have?
> You can try kernel-firmware-* in OBS Kernel:HEAD repo, if the version
> differs.

kernel-firmware-amdgpu | 20230707-1.1

it's currently the same version as in kernel:head - afaik
Comment 5 B 2023-07-14 15:29:13 UTC
(In reply to Takashi Iwai from comment #3)
> I suspected the upstream report:
>   https://gitlab.freedesktop.org/drm/amd/-/issues/2666

cb@opensuse-pc:~> sudo journalctl _TRANSPORT=kernel  | grep -i 'loading.*DMUB'
Jul 14 18:50:19 opensuse-pc kernel: [drm] Loading DMUB firmware via PSP: version=0x07001900
Jul 14 18:50:20 opensuse-pc kernel: [drm] Loading DMUB firmware via PSP: version=0x05000C00
Comment 6 Takashi Iwai 2023-07-14 16:16:50 UTC
OK, then could you try to downgrade kernel-firmware-* packages?  The older packages are found in TW history repo:
  http://download.opensuse.org/history/
Comment 7 B 2023-07-14 18:11:02 UTC
(In reply to Takashi Iwai from comment #6)
> OK, then could you try to downgrade kernel-firmware-* packages?  The older
> packages are found in TW history repo:
>   http://download.opensuse.org/history/

OK. This fixes the issue as far as I'm aware. I've checked the journalctl in realtime and I don't get the error message any longer.

history-oss-20230710 
kernel-firmware-amdgpu | 20230620-1.1
Comment 8 B 2023-07-14 19:33:59 UTC
(In reply to B from comment #7)
> (In reply to Takashi Iwai from comment #6)
> > OK, then could you try to downgrade kernel-firmware-* packages?  The older
> > packages are found in TW history repo:
> >   http://download.opensuse.org/history/
> 
> OK. This fixes the issue as far as I'm aware. I've checked the journalctl in
> realtime and I don't get the error message any longer.
> 
> history-oss-20230710 
> kernel-firmware-amdgpu | 20230620-1.1

correction: it doesn't fix it. 

still

Jul 14 21:32:46 opensuse-pc kernel: [drm:dc_dmub_setup_subvp_dmub_command [amdgpu]] *ERROR* Error waiting for DMUB idle: status=3

on older firmware
Comment 9 Takashi Iwai 2023-07-15 06:17:44 UTC
Thanks for confirmation.  Then at best please report to the upstream.
I guess you can continue on the gitlab issue I mentioned in comment 3.
Comment 10 B 2023-07-15 10:52:27 UTC
(In reply to Takashi Iwai from comment #9)
> Thanks for confirmation.  Then at best please report to the upstream.
> I guess you can continue on the gitlab issue I mentioned in comment 3.

switching back to kernel 6.3.9-1-default does fix it though even with the newer firmware.
Comment 11 B 2023-07-15 23:31:37 UTC
(In reply to Takashi Iwai from comment #9)
> Thanks for confirmation.  Then at best please report to the upstream.
> I guess you can continue on the gitlab issue I mentioned in comment 3.

Looks to be exactly what is described here:
https://gitlab.freedesktop.org/drm/amd/-/issues/1887

"Went back to kernel 6.3.9 and I'm back to normal, no issues." 
I can confirm that.
Comment 12 B 2023-07-17 14:00:52 UTC
updated to TW # VERSION="20230716" today
which included a update to Kernel version 6.4.3-1-default 

So, far it fixed my problem with stuttering/lag. If it stays like that until tomorrow i will close this issue.
Comment 13 B 2023-07-17 19:16:07 UTC
(In reply to B from comment #12)
> updated to TW # VERSION="20230716" today
> which included a update to Kernel version 6.4.3-1-default 
> 
> So, far it fixed my problem with stuttering/lag. If it stays like that until
> tomorrow i will close this issue.

Not fixed. 
It took somehow longer to trigger though.
Comment 14 B 2023-07-17 19:25:41 UTC
someone mentioned a workaround

https://gitlab.freedesktop.org/drm/amd/-/issues/1887#note_2003355

adding kernel parameter:
drm.vblankoffdelay=0
Comment 15 B 2023-07-17 23:49:51 UTC
Created upstream report: https://gitlab.freedesktop.org/drm/amd/-/issues/2708
Comment 16 B 2023-09-16 13:52:19 UTC
(In reply to B from comment #15)
> Created upstream report: https://gitlab.freedesktop.org/drm/amd/-/issues/2708

kernel 6.5.2 fixed the issue for me