Bug 1215287 - Kernel BUG: soft lockup on btrfs-cleaner if in connection with ssh
Summary: Kernel BUG: soft lockup on btrfs-cleaner if in connection with ssh
Status: RESOLVED DUPLICATE of bug 1214988
Alias: None
Product: openSUSE Distribution
Classification: openSUSE
Component: Kernel (show other bugs)
Version: Leap 15.5
Hardware: x86-64 openSUSE Leap 15.5
: P5 - None : Major (vote)
Target Milestone: ---
Assignee: openSUSE Kernel Bugs
QA Contact: E-mail List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2023-09-13 08:22 UTC by Diego Ercolani
Modified: 2023-09-17 14:05 UTC (History)
1 user (show)

See Also:
Found By: ---
Services Priority:
Business Priority:
Blocker: ---
Marketing QA Status: ---
IT Deployment: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Diego Ercolani 2023-09-13 08:22:48 UTC
This bug is somehow related to my previous bug 1214988.
It seem that when someone connect to my desktop via ssh kernel logs start to show kernel exceptions:

Message from syslogd@pc-diego at Sep 13 09:57:01 ...
 kernel:[35680.372663][    C0] watchdog: BUG: soft lockup - CPU#0 stuck for 26s! [btrfs-cleaner:2874]

Message from syslogd@pc-diego at Sep 13 09:57:33 ...
 kernel:[35712.388800][    C7] watchdog: BUG: soft lockup - CPU#7 stuck for 26s! [btrfs-cleaner:2874]

Message from syslogd@pc-diego at Sep 13 09:58:05 ...
 kernel:[35744.404940][   C13] watchdog: BUG: soft lockup - CPU#13 stuck for 26s! [btrfs-cleaner:2874]

Message from syslogd@pc-diego at Sep 13 09:58:41 ...
 kernel:[35780.405099][   C13] watchdog: BUG: soft lockup - CPU#13 stuck for 26s! [btrfs-cleaner:2874]

Message from syslogd@pc-diego at Sep 13 09:59:21 ...
 kernel:[35820.381269][    C4] watchdog: BUG: soft lockup - CPU#4 stuck for 24s! [btrfs-cleaner:2874]

[35820.381269] watchdog: BUG: soft lockup - CPU#4 stuck for 24s! [btrfs-cleaner:2874]
[35820.381285] Modules linked in: rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache netfs rfcomm tcp_diag inet_diag af_packet 8021q garp mrp bridge vboxnetadp(OEN) stp vboxnetflt(OEN) llc iscsi_ibft iscsi_boot_sysfs qrtr(N) ns(N) vboxdrv(OEN) cmac algif_hash algif_skcipher af_alg bnep snd_seq snd_pcm_oss snd_mixer_oss dmi_sysfs binfmt_misc snd_usb_audio uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 snd_usbmidi_lib videobuf2_common snd_rawmidi snd_seq_device btusb btrtl btbcm btintel btmtk bluetooth xfs ecdh_generic ext4 crc16 mbcache jbd2 nls_iso8859_1 nls_cp437 nvidia_drm(POEX) vfat fat nvidia_modeset(POEX) bcache nvidia_uvm(POEX) nvidia(POEX) snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi ledtrig_audio intel_rapl_msr intel_rapl_common snd_hda_intel snd_intel_dspcfg edac_mce_amd snd_intel_sdw_acpi eeepc_wmi(N) snd_hda_codec asus_wmi battery sparse_keymap snd_hda_core snd_hwdep kvm_amd platform_profile snd_pcm kvm r8169 rfkill realtek snd_timer
[35820.381375]  irqbypass mdio_devres video pcspkr wmi_bmof efi_pstore(N) k10temp i2c_piix4 drm_kms_helper syscopyarea libphy snd sysfillrect sysimgblt fb_sys_fops soundcore gpio_amdpt gpio_generic acpi_cpufreq zram nfsd nfs_acl lockd auth_rpcgss grace drm fuse sunrpc configfs ip_tables x_tables dm_cache_smq dm_cache dm_persistent_data dm_bio_prison dm_bufio raid1 md_mod hid_generic uas sd_mod usbhid t10_pi usb_storage crc32_pclmul crc64_rocksoft_generic sr_mod cdrom crc64_rocksoft crc64 ghash_clmulni_intel xhci_pci xhci_pci_renesas aesni_intel xhci_hcd ahci libahci crypto_simd cryptd ccp libata usbcore sp5100_tco(N) wmi button btrfs blake2b_generic libcrc32c crc32c_intel xor raid6_pq dm_mirror dm_region_hash dm_log v4l2loopback(OEN) videodev mc sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua scsi_mod msr efivarfs
[35820.381530] Supported: No, Proprietary and Unsupported modules are loaded
[35820.381536] CPU: 4 PID: 2874 Comm: btrfs-cleaner Tainted: P           OEL X  N 5.14.21-150500.55.19-default #1 SLE15-SP5 a29285bac85e8baa0757c512c91f36229b4faaae
[35820.381548] Hardware name: System manufacturer System Product Name/PRIME B450M-A, BIOS 3002 03/10/2021
[35820.381555] RIP: 0010:alloc_extent_map+0xc/0x60 [btrfs]
[35820.381590] Code: 47 08 00 00 00 00 48 c7 47 20 00 00 00 00 48 89 47 10 48 89 47 18 e9 23 43 eb f6 0f 1f 00 0f 1f 44 00 00 48 8b 3d 3c eb 10 00 <be> 40 0d 00 00 e8 6a eb 75 f6 48 85 c0 74 36 48 8d 90 80 00 00 00
[35820.381602] RSP: 0018:ffffa3bac8cdbc08 EFLAGS: 00000246
[35820.381608] RAX: 0000000000000200 RBX: ffff98984d46aac0 RCX: 0000000000000000
[35820.381614] RDX: ffffffffffffffff RSI: ffffffffffffffff RDI: ffff9898407f8500
[35820.381620] RBP: 0000004f85884000 R08: 0000000000001000 R09: ffffa3bac8cdbc30
[35820.381625] R10: ffff989a65d17cf0 R11: ffff989aae9d1578 R12: 00000000000009f6
[35820.381631] R13: ffff98984d46ab00 R14: ffff98984d46ab08 R15: ffff98984d46aae0
[35820.381636] FS:  0000000000000000(0000) GS:ffff98a2feb00000(0000) knlGS:0000000000000000
[35820.381643] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[35820.381648] CR2: 00007fc7ca9f0000 CR3: 0000000ae9610000 CR4: 00000000003506e0
[35820.381654] Call Trace:
[35820.381659]  <TASK>
[35820.381663]  btrfs_get_extent+0xc9/0x870 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.381698]  ? __x86_return_thunk+0x5/0x6
[35820.381705]  ? lock_extent_bits+0x4a/0xa0 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.381739]  defrag_lookup_extent+0xcf/0x120 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.381774]  btrfs_defrag_file+0x90a/0x1230 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.381810]  ? __x86_return_thunk+0x5/0x6
[35820.381817]  ? __x86_return_thunk+0x5/0x6
[35820.381822]  ? btrfs_iget_path+0x67/0x700 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.381851]  ? btrfs_get_root_ref+0x18d/0x310 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.381878]  ? btrfs_run_defrag_inodes+0x27c/0x360 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.381908]  ? __x86_return_thunk+0x5/0x6
[35820.381914]  btrfs_run_defrag_inodes+0x221/0x360 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.381946]  cleaner_kthread+0xec/0x130 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.381974]  ? csum_one_extent_buffer+0x110/0x110 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b]
[35820.382001]  kthread+0x156/0x180
[35820.382007]  ? set_kthread_struct+0x50/0x50
[35820.382013]  ret_from_fork+0x22/0x30
[35820.382023]  </TASK>


It's very annoying, I'm trying to understand correlation, I start to think that the problem may be related to some problem in the chipset or th CPU... but I don't know how to understand
Comment 1 Takashi Iwai 2023-09-17 14:05:06 UTC
It must be the side effect of bug 1214988.  The btrfs cleaner task stalled, leading to a lockup like this.  Let's close as dup and concentrate there.

*** This bug has been marked as a duplicate of bug 1214988 ***