Bug 1073382

Summary: No sound and kernel crash with latest TW kernel-default-5.1.7-1.2
Product: [openSUSE] openSUSE Tumbleweed Reporter: Vadim Krevs <vkrevs>
Component: KernelAssignee: E-mail List <kernel-maintainers>
Status: RESOLVED DUPLICATE QA Contact: E-mail List <qa-bugs>
Severity: Normal    
Priority: P5 - None CC: jcheung, jslaby, tiwai, vkrevs
Version: Current   
Target Milestone: ---   
Hardware: Other   
OS: Other   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Attachments: dmesg output after reboot with new kernel
/var/log/messages for first boot with 4.14.6-1.5
dmesg output after boot with 4.14.4
dmesg output after boot with 4.14.6 and working sound
/var/log/messages for boot with 5.1.7-1.2
dmesg for boot with 5.1.7-1.2
alsa-info.txt for boot with 5.1.7-1.2 (no sound)
/var/log/messages for boot with 5.1.7-1.2 after running Yast sound module
dmesg for boot with 5.1.7-1.2 after running Yast sound module
alsa-info.txt for for boot with 5.1.7-1.2 after running Yast sound module (sound works)

Description Vadim Krevs 2017-12-18 20:23:40 UTC
Created attachment 753551 [details]
dmesg output after reboot with new kernel

Just updated to latest Tumbleweed snapshot that includes 4.14.6-1.5. After reboot, there is no sound, the KDE sound icon in the task bar has a red minus in it, Yast Sound is unable to configure the sound card and dmesg contains a kernel crash:

[  196.073287] ------------[ cut here ]------------
[  196.073292] WARNING: CPU: 3 PID: 4834 at ../kernel/memremap.c:98 memremap+0x122/0x170
[  196.073293] Modules linked in: snd_soc_skl(+) snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_hda_ext_core snd_soc_sst_match snd_soc_core snd_compress snd_pcm_dmaengine snd_hda_core snd_pcm snd_timer snd soundcore rfcomm raw fuse af_packet dm_mod arc4 ses enclosure scsi_transport_sas hid_logitech_hidpp bnep rtsx_usb_sdmmc rtsx_usb_ms mmc_core memstick uvcvideo hid_logitech_dj videobuf2_vmalloc rtsx_usb videobuf2_memops videobuf2_v4l2 videobuf2_core usbhid videodev btusb btrtl ath10k_pci ath10k_core ath iTCO_wdt iTCO_vendor_support mac80211 cfg80211 r8169 hid_multitouch vboxpci(O) vboxnetadp(O) vboxnetflt(O) uas usb_storage vboxdrv(O) iscsi_ibft iscsi_boot_sysfs msr intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul nls_iso8859_1 hci_uart
[  196.073322]  crc32c_intel nls_cp437 cdc_ether btbcm ghash_clmulni_intel serdev usbnet pcbc dell_wmi dell_laptop btqca vfat r8152 dell_smbios btintel fat idma64 dcdbas wmi_bmof mii bluetooth tpm_crb aesni_intel processor_thermal_device dell_smm_hwmon aes_x86_64 joydev crypto_simd ecdh_generic glue_helper ucsi_acpi mei_me cryptd pcspkr i2c_i801 shpchp mei intel_lpss_pci int3403_thermal rfkill intel_soc_dts_iosf intel_pch_thermal thermal typec_ucsi typec tpm_tis tpm_tis_core wmi tpm battery pinctrl_sunrisepoint intel_hid pinctrl_intel int3402_thermal intel_lpss_acpi int340x_thermal_zone int3400_thermal intel_lpss acpi_thermal_rel sparse_keymap acpi_als kfifo_buf ac acpi_pad industrialio amdkfd amd_iommu_v2 i915 amdgpu serio_raw i2c_algo_bit drm_kms_helper ttm syscopyarea sysfillrect sysimgblt xhci_pci
[  196.073350]  fb_sys_fops sr_mod xhci_hcd cdrom drm usbcore video i2c_hid button sg efivarfs [last unloaded: soundcore]
[  196.073357] CPU: 3 PID: 4834 Comm: modprobe Tainted: G           O    4.14.6-1-default #1
[  196.073358] Hardware name: Dell Inc. Inspiron 5770/0XH3XD, BIOS 1.0.6 10/25/2017
[  196.073359] task: ffff9d4257834000 task.stack: ffffbe30427b0000
[  196.073362] RIP: 0010:memremap+0x122/0x170
[  196.073363] RSP: 0018:ffffbe30427b3b40 EFLAGS: 00010286
[  196.073364] RAX: 000000000000003e RBX: ffff9d42a7901200 RCX: 0000000000000000
[  196.073365] RDX: ffff9d42bf4d6440 RSI: ffff9d42bf4ce2d8 RDI: ffff9d42bf4ce2d8
[  196.073365] RBP: 0000000000000001 R08: 0000000000000001 R09: 00000000000004ac
[  196.073366] R10: ffffbe30427b3a48 R11: 0000000000000000 R12: 0000000000000000
[  196.073367] R13: ffffffffc0c67120 R14: 00000000c09a0009 R15: 0000000000000009
[  196.073368] FS:  00007f1dfe6ba0c0(0000) GS:ffff9d42bf4c0000(0000) knlGS:0000000000000000
[  196.073369] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  196.073370] CR2: 0000564124af0018 CR3: 000000046ba7a006 CR4: 00000000003606e0
[  196.073370] Call Trace:
[  196.073379]  skl_nhlt_init+0x7f/0xb0 [snd_soc_skl]
[  196.073383]  skl_probe+0x2a6/0x6e0 [snd_soc_skl]
[  196.073409]  local_pci_probe+0x3d/0x90
[  196.073412]  ? pci_match_device+0xde/0x110
[  196.073414]  pci_device_probe+0x125/0x190
[  196.073424]  driver_probe_device+0x2f6/0x440
[  196.073426]  __driver_attach+0x9d/0xe0
[  196.073428]  ? driver_probe_device+0x440/0x440
[  196.073430]  bus_for_each_dev+0x5c/0x90
[  196.073432]  bus_add_driver+0x1c2/0x260
[  196.073433]  ? 0xffffffffc0bde000
[  196.073435]  driver_register+0x57/0xc0
[  196.073436]  ? 0xffffffffc0bde000
[  196.073438]  do_one_initcall+0x4b/0x190
[  196.073441]  ? __vunmap+0x6d/0xb0
[  196.073444]  do_init_module+0x5b/0x1ea
[  196.073446]  load_module+0x168b/0x1e00
[  196.073450]  ? SYSC_finit_module+0xb7/0xd0
[  196.073451]  SYSC_finit_module+0xb7/0xd0
[  196.073455]  entry_SYSCALL_64_fastpath+0x1e/0xa9
[  196.073456] RIP: 0033:0x7f1dfdda6fd9
[  196.073457] RSP: 002b:00007fffbd31f228 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
[  196.073458] RAX: ffffffffffffffda RBX: 0000564124aed2b0 RCX: 00007f1dfdda6fd9
[  196.073459] RDX: 0000000000000000 RSI: 000056412428bdbc RDI: 000000000000000e
[  196.073459] RBP: 000056412428bdbc R08: 0000000000000000 R09: 0000000000000000
[  196.073460] R10: 000000000000000e R11: 0000000000000246 R12: 0000000000040000
[  196.073461] R13: 0000564124aed250 R14: 0000000000000000 R15: 0000564124aed140
[  196.073462] Code: 48 83 c4 08 5b 5d 41 5c c3 80 3d 55 bc b7 00 00 75 b7 4c 89 e2 48 89 e6 48 c7 c7 28 c4 a5 bc c6 05 3f bc b7 00 01 e8 b9 4b f3 ff <0f> ff eb 9a 4c 89 e6 48 89 df e8 bf 25 ec ff 48 85 c0 74 99 48 
[  196.073488] ---[ end trace 3438ce4936846e80 ]---


$ cat /proc/cmdline 
BOOT_IMAGE=/boot/vmlinuz-4.14.6-1-default root=UUID=d58e8e85-bfdd-4a74-a694-5149ccd85110 quiet resume=/dev/sda3 splash=silent quiet showopts


$ rpm -qa  | grep kernel
kernel-syms-4.14.6-1.5.x86_64
kernel-default-4.14.4-1.6.x86_64
kernel-devel-4.14.6-1.5.noarch
kernel-devel-4.14.4-1.6.noarch
kernel-macros-4.14.6-1.5.noarch
kernel-default-devel-4.14.4-1.6.x86_64
kernel-firmware-20171204-1.1.noarch
kernel-default-devel-4.14.6-1.5.x86_64
kernel-docs-4.14.6-1.5.noarch
kernel-syms-4.14.4-1.6.x86_64
kernel-default-4.14.6-1.5.x86_64

lspci output is attached in  bug 1068793.
Comment 1 Vadim Krevs 2017-12-18 20:25:33 UTC
Created attachment 753553 [details]
/var/log/messages for first boot with 4.14.6-1.5
Comment 2 Vadim Krevs 2017-12-18 20:35:24 UTC
Booting into previous 4.14.4-1.6 kernel resolves the issue (until the next kernel update?).
Comment 3 Takashi Iwai 2017-12-18 23:07:35 UTC
By some reason the HD-audio driver got unloaded and the ASoC HD-audio driver got loaded after that.  As there is no change in the relevant drivers between two kernel versions, it must be a problem brought from a different place.

Could you give the dmesg output from the working kernel (4.14.4)?

I guess putting the blacklist to snd-soc-skl should work around it.
After getting the good kernel messages above, try to add the file /etc/modprobe.d/50-hda-blacklist.conf containing the following line:

  blacklist snd-soc-skl

and retest 4.14.6 kernel again.
Comment 4 Vadim Krevs 2017-12-19 07:12:07 UTC
Created attachment 753611 [details]
dmesg output after boot with 4.14.4

dmesg output after boot with 4.14.4 attached
Comment 5 Vadim Krevs 2017-12-19 07:18:32 UTC
Blacklisting snd-soc-skl works around the issue in 4.14.6 kernel. Thank you Takashi.
Comment 6 Takashi Iwai 2017-12-19 07:24:55 UTC
Good to hear.
But it doesn't mean that the issue is really fixed.  The fact that another driver gets loaded implies that something went wrong beforehand.

Now looking at your original log, there are a few AMDGPU messages around 136, 143, 187 and 192 seconds:
[  136.769175] [drm] PCIE GART of 256M enabled (table at 0x000000F400040000).
[  136.777404] amdgpu: [powerplay] can't get the mac of 5
.....
[  143.844475] amdgpu: [powerplay] VI should always have 2 performance levels
[  143.886723] amdgpu 0000:01:00.0: GPU pci config reset
[  187.627361] [drm] PCIE GART of 256M enabled (table at 0x000000F400040000).
[  187.630772] amdgpu: [powerplay] can't get the mac of 5
.....
[  192.993036] amdgpu: [powerplay] VI should always have 2 performance levels
[  193.039854] amdgpu 0000:01:00.0: GPU pci config reset

Then USB-audio is deregistered and the kernel WARNING appears from snd-soc-skl:

[  195.275515] usbcore: deregistering interface driver snd-usb-audio
[  196.073281] memremap attempted on mixed range 0x0000000000000000 size: 0x0
[  196.073287] ------------[ cut here ]------------
[  196.073292] WARNING: CPU: 3 PID: 4834 at ../kernel/memremap.c:98 memremap+0x122/0x170

Do you know what happened on your machine?  Did you unplug something, or any special action at these events at all?
Comment 7 Takashi Iwai 2017-12-19 07:36:13 UTC
Also, could you try to uninstall virtual box modules?
It's out-of-tree modules, hence one of suspects in such a secret breakage.
Comment 8 Vadim Krevs 2017-12-19 09:07:59 UTC
Nothing I am aware of. Did an "init 3", then ran a "zypper dup" to install latest TW snapshot which included the new kernel, then "reboot" to reboot, and then, once booted with the new kernel, there was no sound. 

I'll try uninstalling virtualbox driver and removing the module blacklist entry to  see if that makes any difference once I get home this evening.
Comment 9 Vadim Krevs 2017-12-19 17:54:04 UTC
Hmm, this is strange. Came home, commented out the module blacklist entry, rebooted without uninstalling vbox drivers. And after reboot, sound works in 4.14.6 ... dmesg attached.

Feel free to close the bug.
Comment 10 Vadim Krevs 2017-12-19 17:54:29 UTC
Created attachment 753714 [details]
dmesg output after boot with 4.14.6 and working sound
Comment 11 Takashi Iwai 2017-12-19 19:33:25 UTC
OK, then could you try to re-install / re-enable virtualbox stuff again?
If this triggers the bug again, it's a breakage due to that.
Comment 12 Vadim Krevs 2017-12-20 08:33:08 UTC
I did not unininstall vbox drivers. That's the thing - just rebooting again resulted in working sound - without the need for the module blacklist entry.
Comment 13 Takashi Iwai 2017-12-20 08:43:18 UTC
Hrm, the bug already appears unusual, and it indicates some weird memory corruption or unbalanced refcount or such, i.e. the behavior isn't deterministic.
Please update the bug once when you can reproduce the issue reliably.  Thanks.
Comment 14 Vadim Krevs 2017-12-20 09:03:51 UTC
(In reply to Takashi Iwai from comment #13)
> Hrm, the bug already appears unusual, and it indicates some weird memory
> corruption or unbalanced refcount or such, i.e. the behavior isn't
> deterministic.
> Please update the bug once when you can reproduce the issue reliably. 
> Thanks.

Will do.
Comment 15 Jiri Slaby 2018-02-08 14:42:43 UTC
Closing as invalid for the time being. You still can reopen if you have something new.
Comment 16 Vadim Krevs 2019-06-22 14:50:56 UTC
Several days ago I had upgraded the same laptop to the latest TW snapshot which includes kernel 5.1.7-1.2. 

Just updated to latest Tumbleweed snapshot that includes 4.14.6-1.5. After reboot, there is no sound, the KDE sound icon in the task bar has a red minus in it.

Takashi's workaround from comment #3 does not seem to have any effect.

If I go into Yast->Sound, select the unconfigured "Sunrise Point-LP HD Audio" device, click Edit to perform Quick default configuration, then configuration fails, but the the KDE sound icon in the task bar changes to normal (red minus is gone), and I now have sound - until next reboot.

$ cat /proc/cmdline
BOOT_IMAGE=/boot/vmlinuz-5.1.7-1-default root=UUID=d58e8e85-bfdd-4a74-a694-5149ccd85110 quiet resume=/dev/sda3 splash=silent quiet showopts

$ rpm -qa | grep kernel | sort
kernel-default-5.1.4-1.1.x86_64
kernel-default-5.1.7-1.2.x86_64
kernel-default-devel-5.1.4-1.1.x86_64
kernel-default-devel-5.1.7-1.2.x86_64
kernel-devel-5.1.4-1.1.noarch
kernel-devel-5.1.7-1.2.noarch
kernel-docs-5.1.7-1.2.noarch
kernel-firmware-20190514-1.3.noarch
kernel-macros-5.1.7-1.2.noarch
kernel-source-5.1.4-1.1.noarch
kernel-source-5.1.7-1.2.noarch
kernel-syms-5.1.4-1.1.x86_64
kernel-syms-5.1.7-1.2.x86_64
Comment 17 Vadim Krevs 2019-06-22 14:52:39 UTC
Attaching /var/log/messages, dmesg and alsa-config logs after boot when there is no sound, and after doing Yast->Sound->Edit after which there is sound.
Comment 18 Vadim Krevs 2019-06-22 14:53:53 UTC
Created attachment 808174 [details]
/var/log/messages for boot with 5.1.7-1.2
Comment 19 Vadim Krevs 2019-06-22 14:55:12 UTC
Created attachment 808175 [details]
dmesg  for boot with 5.1.7-1.2
Comment 20 Vadim Krevs 2019-06-22 14:55:36 UTC
Created attachment 808176 [details]
alsa-info.txt  for boot with 5.1.7-1.2 (no sound)
Comment 21 Vadim Krevs 2019-06-22 14:56:38 UTC
Created attachment 808177 [details]
/var/log/messages for boot with 5.1.7-1.2 after running Yast sound module
Comment 22 Vadim Krevs 2019-06-22 14:57:01 UTC
Created attachment 808178 [details]
dmesg  for boot with 5.1.7-1.2 after running Yast sound module
Comment 23 Vadim Krevs 2019-06-22 14:57:29 UTC
Created attachment 808179 [details]
alsa-info.txt  for for boot with 5.1.7-1.2 after running Yast sound module (sound works)
Comment 24 Vadim Krevs 2019-06-22 14:58:59 UTC
Pls ignore the second sentence in comment 16 (cut and paste error).
Comment 25 Vadim Krevs 2019-06-22 15:00:41 UTC
$ lspci
00:00.0 Host bridge: Intel Corporation Xeon E3-1200 v6/7th Gen Core Processor Host Bridge/DRAM Registers (rev 08)
00:02.0 VGA compatible controller: Intel Corporation UHD Graphics 620 (rev 07)
00:04.0 Signal processing controller: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Thermal Subsystem (rev 08)
00:14.0 USB controller: Intel Corporation Sunrise Point-LP USB 3.0 xHCI Controller (rev 21)
00:14.2 Signal processing controller: Intel Corporation Sunrise Point-LP Thermal subsystem (rev 21)
00:15.0 Signal processing controller: Intel Corporation Sunrise Point-LP Serial IO I2C Controller #0 (rev 21)
00:16.0 Communication controller: Intel Corporation Sunrise Point-LP CSME HECI #1 (rev 21)
00:17.0 RAID bus controller: Intel Corporation 82801 Mobile SATA Controller [RAID mode] (rev 21)
00:1c.0 PCI bridge: Intel Corporation Sunrise Point-LP PCI Express Root Port #1 (rev f1)
00:1c.4 PCI bridge: Intel Corporation Sunrise Point-LP PCI Express Root Port #5 (rev f1)
00:1c.5 PCI bridge: Intel Corporation Sunrise Point-LP PCI Express Root Port #6 (rev f1)
00:1f.0 ISA bridge: Intel Corporation Sunrise Point LPC Controller/eSPI Controller (rev 21)
00:1f.2 Memory controller: Intel Corporation Sunrise Point-LP PMC (rev 21)
00:1f.3 Audio device: Intel Corporation Sunrise Point-LP HD Audio (rev 21)
00:1f.4 SMBus: Intel Corporation Sunrise Point-LP SMBus (rev 21)
01:00.0 Display controller: Advanced Micro Devices, Inc. [AMD/ATI] Topaz XT [Radeon R7 M260/M265 / M340/M360 / M440/M445] (rev c1)
02:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL810xE PCI Express Fast Ethernet controller (rev 07)
03:00.0 Network controller: Qualcomm Atheros QCA9377 802.11ac Wireless Network Adapter (rev 31)
Comment 26 Vadim Krevs 2019-07-08 20:27:14 UTC
Was browsing TW bug list filtered by "sound" and stumbled on "Bug 1116849 - yast2-sound: do not add sound modules to initrd". 

My laptop also had an /etc/modules-load.d/yast.conf containing "snd-hda-intel". Removed the file, and re-ran "dracut -f", rebooted - and sound works.

So looks like /etc/modules-load.d/yast.conf with "snd-hda-intel" can result in more than one "quirk".

*** This bug has been marked as a duplicate of bug 1116849 ***