Bug 1223025 - CPU soft lockup when suspending
Summary: CPU soft lockup when suspending
Status: NEW
Alias: None
Product: openSUSE Tumbleweed
Classification: openSUSE
Component: Kernel (show other bugs)
Version: Current
Hardware: x86-64 openSUSE Tumbleweed
: P5 - None : Critical (vote)
Target Milestone: ---
Assignee: openSUSE Kernel Bugs
QA Contact: E-mail List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-04-18 07:39 UTC by Atri Bhattacharya
Modified: 2024-05-23 14:19 UTC (History)
2 users (show)

See Also:
Found By: ---
Services Priority:
Business Priority:
Blocker: ---
Marketing QA Status: ---
IT Deployment: ---
tiwai: needinfo? (badshah400)


Attachments
Full journal log from problematic session (3.63 MB, text/x-log)
2024-04-18 07:39 UTC, Atri Bhattacharya
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Atri Bhattacharya 2024-04-18 07:39:15 UTC
Created attachment 874349 [details]
Full journal log from problematic session

Laptop went into CPU soft lockup look upon closing lid (expecting suspend) until battery died. Here is the relevant section of the log from `journalctl -b -1`:

/me goes to bed at the healthy, healthy time of 01:26:56 and the madness starts:

> Apr 18 01:26:56 systemd[1]: Reached target Sleep.
> Apr 18 01:26:56 systemd[1]: Starting System Suspend...
> Apr 18 01:26:56 systemd-sleep[4069]: INFO: Skip running /usr/lib/systemd/system-sleep/grub2.sleep for suspend
> Apr 18 01:26:56 systemd-sleep[4064]: Performing sleep operation 'suspend'...
> Apr 18 01:26:56 kernel: PM: suspend entry (s2idle)
> Apr 18 01:26:56 kernel: Filesystems sync: 0.017 seconds
> Apr 18 01:26:56 goa-daemon[3630]: g_socket_client_connect_to_host() failed: Error resolving “mail.ulg.ac.be”: Name or service not known (g-resolver-error-quark, 0)
> Apr 18 01:26:57 goa-daemon[3630]: g_socket_client_connect_to_host() failed: Error resolving “mail.ulg.ac.be”: Name or service not known (g-resolver-error-quark, 0)
> Apr 18 01:26:58 dbus-daemon[1409]: [system] Activating via systemd: service name='org.freedesktop.PackageKit' unit='packagekit.service' requested by ':1.96' (uid=1000 pid=3498 comm="/usr/bin/gnome-software --gapplication-service")
> Apr 18 01:27:20 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 26s! [systemd-sleep:4064]
> Apr 18 01:27:20 kernel: Modules linked in: uinput exfat ccm snd_seq_dummy snd_hrtimer snd_seq snd_seq_device rfcomm cmac algif_hash algif_skcipher af_alg af_packet uvcvideo videobuf2_vmalloc uvc videobuf2_memops videobuf2_v4l2 videodev nf_conntrack_netbios_ns nf_conntrack_broadcast videobuf2_common nft_fib_inet mc nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables bnep btusb btrtl btintel btbcm btmtk bluetooth ecdh_generic qrtr binfmt_misc snd_acp_legacy_mach snd_acp_mach snd_soc_nau8821 snd_acp3x_pdm_dma snd_soc_dmic snd_acp3x_rn snd_sof_amd_acp63 snd_sof_amd_vangogh snd_sof_amd_rembrandt snd_sof_amd_renoir snd_sof_amd_acp snd_sof_pci snd_sof_xtensa_dsp snd_ctl_led snd_sof snd_hda_codec_realtek snd_sof_utils snd_hda_codec_generic mt7921e snd_soc_core mt7921_common snd_hda_codec_hdmi mt792x_lib snd_compress nls_iso8859_1 snd_pcm_dmaengine mt76_connac_lib snd_pci_ps nls_cp437 snd_rpl_pci_acp6x mt76 snd_hda_intel
> Apr 18 01:27:20 kernel:  snd_acp_pci vfat fat snd_acp_legacy_common snd_intel_dspcfg snd_intel_sdw_acpi snd_pci_acp6x snd_hda_codec snd_pci_acp5x xfs mac80211 intel_rapl_msr snd_hda_core libarc4 snd_hwdep snd_rn_pci_acp3x intel_rapl_common snd_acp_config snd_pcm think_lmi snd_soc_acpi thinkpad_acpi edac_mce_amd pcspkr cfg80211 efi_pstore firmware_attributes_class wmi_bmof snd_timer snd_pci_acp3x k10temp i2c_piix4 ledtrig_audio thermal tiny_power_button platform_profile nxp_nci_i2c nxp_nci snd ac nci soundcore nfc rfkill i2c_scmi amd_pmc button joydev nvme_fabrics fuse loop dm_mod configfs nfnetlink dmi_sysfs ip_tables x_tables mmc_block amdgpu crct10dif_pclmul crc32_pclmul polyval_clmulni polyval_generic gf128mul amdxcp i2c_algo_bit ghash_clmulni_intel drm_ttm_helper sha512_ssse3 ttm drm_exec gpu_sched sha256_ssse3 xhci_pci drm_suballoc_helper rtsx_pci_sdmmc sha1_ssse3 xhci_pci_renesas drm_buddy mmc_core nvme aesni_intel drm_display_helper xhci_hcd video crypto_simd cec ucsi_acpi cryptd nvme_core typec_ucsi usbcore roles
> Apr 18 01:27:20 kernel:  nvme_auth rc_core rtsx_pci sp5100_tco t10_pi typec battery wmi serio_raw btrfs blake2b_generic libcrc32c crc32c_intel xor raid6_pq r8169 realtek mdio_devres libphy br_netfilter bridge stp llc msr kvm_amd ccp kvm irqbypass efivarfs
> Apr 18 01:27:20 kernel: CPU: 1 PID: 4064 Comm: systemd-sleep Not tainted 6.8.5-1-default #1 openSUSE Tumbleweed 633d4bc373b32029af03848d6d101f997828843f
> Apr 18 01:27:20 kernel: Hardware name: LENOVO 21A0CTO1WW/21A0CTO1WW, BIOS R1MET55W (1.25 ) 10/30/2023
> Apr 18 01:27:20 kernel: RIP: 0010:smp_call_function_many_cond+0x11f/0x4e0
> Apr 18 01:27:20 kernel: Code: 63 d0 e8 14 67 48 00 3b 05 2e 11 f1 01 73 25 48 63 d0 49 8b 37 48 03 34 d5 a0 5c 2c af 8b 56 08 83 e2 01 74 0a f3 90 8b 4e 08 <83> e1 01 75 f6 83 c0 01 eb c1 48 83 c4 40 5b 5d 41 5c 41 5d 41 5e
> Apr 18 01:27:20 kernel: RSP: 0018:ffffb992cfa2bc70 EFLAGS: 00000202
> Apr 18 01:27:20 kernel: RAX: 0000000000000008 RBX: 0000000000000001 RCX: 0000000000000011
> Apr 18 01:27:20 kernel: RDX: 0000000000000001 RSI: ffffa07e92041440 RDI: ffffa07880073848
> Apr 18 01:27:20 kernel: RBP: 0000000000000246 R08: ffffa07880073c50 R09: ffffa07880073c50
> Apr 18 01:27:20 kernel: R10: ffffffffb05ba598 R11: 0000000000000000 R12: ffffa07e91cbbbc0
> Apr 18 01:27:20 kernel: R13: 0000000000000001 R14: 0000000000000001 R15: ffffa07e91cbbbc0
> Apr 18 01:27:20 kernel: FS:  00007f7251605900(0000) GS:ffffa07e91c80000(0000) knlGS:0000000000000000
> Apr 18 01:27:20 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> Apr 18 01:27:20 kernel: CR2: 00007fafe6eccfe8 CR3: 0000000461cce000 CR4: 0000000000750ef0
> Apr 18 01:27:20 kernel: PKRU: 55555554
> Apr 18 01:27:20 kernel: Call Trace:
> Apr 18 01:27:20 kernel:  <IRQ>
> Apr 18 01:27:20 kernel:  ? watchdog_timer_fn+0x1e6/0x270
> Apr 18 01:27:20 kernel:  ? __pfx_watchdog_timer_fn+0x10/0x10
> Apr 18 01:27:20 kernel:  ? __hrtimer_run_queues+0x10f/0x2b0
> Apr 18 01:27:20 kernel:  ? hrtimer_interrupt+0xf8/0x230
> Apr 18 01:27:20 kernel:  ? __sysvec_apic_timer_interrupt+0x4d/0x140
> Apr 18 01:27:20 kernel:  ? sysvec_apic_timer_interrupt+0x6d/0x90
> Apr 18 01:27:20 kernel:  </IRQ>
> Apr 18 01:27:20 kernel:  <TASK>
> Apr 18 01:27:20 kernel:  ? asm_sysvec_apic_timer_interrupt+0x1a/0x20
> Apr 18 01:27:20 kernel:  ? smp_call_function_many_cond+0x11f/0x4e0
> Apr 18 01:27:20 kernel:  ? __pfx_do_sync_core+0x10/0x10
> Apr 18 01:27:20 kernel:  on_each_cpu_cond_mask+0x24/0x40
> Apr 18 01:27:20 kernel:  text_poke_bp_batch+0xbe/0x2f0
> Apr 18 01:27:20 kernel:  ? arch_jump_label_transform_queue+0x57/0x80
> Apr 18 01:27:20 kernel:  text_poke_finish+0x1f/0x30
> Apr 18 01:27:20 kernel:  arch_jump_label_transform_apply+0x1a/0x30
> Apr 18 01:27:20 kernel:  jump_label_update+0x9c/0x100
> Apr 18 01:27:20 kernel:  static_key_slow_inc_cpuslocked+0x51/0x90
> Apr 18 01:27:20 kernel:  static_key_slow_inc+0x1a/0x30
> Apr 18 01:27:20 kernel:  freeze_processes+0xae/0xc0
> Apr 18 01:27:20 kernel:  pm_suspend+0x1bd/0x590
> Apr 18 01:27:20 kernel:  state_store+0x6c/0xd0
> Apr 18 01:27:20 kernel:  kernfs_fop_write_iter+0x133/0x1d0
> Apr 18 01:27:20 kernel:  vfs_write+0x261/0x430
> Apr 18 01:27:20 kernel:  ksys_write+0x67/0xe0
> Apr 18 01:27:20 kernel:  do_syscall_64+0x83/0x170
> Apr 18 01:27:20 kernel:  ? exc_page_fault+0x71/0x160
> Apr 18 01:27:20 kernel:  entry_SYSCALL_64_after_hwframe+0x78/0x80
> Apr 18 01:27:20 kernel: RIP: 0033:0x7f72515069c4
> Apr 18 01:27:20 kernel: Code: 84 00 00 00 00 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa 90 90 80 3d e5 6f 0f 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 48 83 ec 28 48 89 54 24 18 48
> Apr 18 01:27:20 kernel: RSP: 002b:00007ffce66ccec8 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
> Apr 18 01:27:20 kernel: RAX: ffffffffffffffda RBX: 0000000000000004 RCX: 00007f72515069c4
> Apr 18 01:27:20 kernel: RDX: 0000000000000004 RSI: 000055ebc0f530c0 RDI: 0000000000000005
> Apr 18 01:27:20 kernel: RBP: 000055ebc0f530c0 R08: 0000000000000000 R09: 0000000000000001
> Apr 18 01:27:20 kernel: R10: 0000000000000003 R11: 0000000000000202 R12: 0000000000000004
> Apr 18 01:27:20 kernel: R13: 000055ebc0f4f2d0 R14: 00007f72515edee0 R15: 00000000fffffff7
> Apr 18 01:27:20 kernel:  </TASK>
> Apr 18 01:27:21 gsd-power[3460]: Failed to acquire idle monitor proxy: Timeout was reached
> Apr 18 01:27:21 gsd-power[3460]: Failed to acquire idle monitor proxy: Timeout was reached
> Apr 18 01:27:21 gsd-power[3460]: Failed to acquire idle monitor proxy: Timeout was reached
> Apr 18 01:27:21 gsd-power[3460]: Failed to acquire idle monitor proxy: Timeout was reached
> Apr 18 01:27:21 gsd-power[3460]: Error setting property 'PowerSaveMode' on interface org.gnome.Mutter.DisplayConfig: Timeout was reached (g-io-error-quark, 24)
> Apr 18 01:27:21 gnome-session[2977]: gnome-session-binary[2977]: GnomeDesktop-WARNING: Failed to acquire idle monitor proxy: Timeout was reached
> Apr 18 01:27:21 gnome-session-binary[2977]: GnomeDesktop-WARNING: Failed to acquire idle monitor proxy: Timeout was reached
> Apr 18 01:27:23 dbus-daemon[1409]: [system] Failed to activate service 'org.freedesktop.PackageKit': timed out (service_start_timeout=25000ms)
> Apr 18 01:27:26 kernel: lockdown_is_locked_down: 6 callbacks suppressed
> Apr 18 01:27:26 kernel: Lockdown: systemd-logind: hibernation is restricted; see man kernel_lockdown.7
> Apr 18 01:27:48 kernel: watchdog: BUG: soft lockup - CPU#1 stuck for 52s! [systemd-sleep:4064]

and on and on until 03:44:40 when apparently the battery dies and the logging stops.

openSUSE info
=============

* NAME="openSUSE Tumbleweed"
* VERSION="20240412"

Kernel info (uname -a)
======================

* Linux ABThinkPadP14s2 6.8.5-1-default #1 SMP PREEMPT_DYNAMIC Thu Apr 11 04:31:19 UTC 2024 (542f698) x86_64 x86_64 x86_64 GNU/Linux


Full journal log attached. Happy to help with any other info as needed.
Comment 1 Atri Bhattacharya 2024-04-18 07:41:01 UTC
Some supplementary info:

AMD Ryzen 5 PRO 5650U with Radeon Graphics
ThinkPad P14s Gen2
Comment 2 Takashi Iwai 2024-05-23 14:19:25 UTC
TW is moving to 6.9.x already.  Could you check the latest kernel in OBS Kernel:stable repo and see whether the problem persists?