Bugzilla – Bug 1215287
Kernel BUG: soft lockup on btrfs-cleaner if in connection with ssh
Last modified: 2023-09-17 14:05:06 UTC
This bug is somehow related to my previous bug 1214988. It seem that when someone connect to my desktop via ssh kernel logs start to show kernel exceptions: Message from syslogd@pc-diego at Sep 13 09:57:01 ... kernel:[35680.372663][ C0] watchdog: BUG: soft lockup - CPU#0 stuck for 26s! [btrfs-cleaner:2874] Message from syslogd@pc-diego at Sep 13 09:57:33 ... kernel:[35712.388800][ C7] watchdog: BUG: soft lockup - CPU#7 stuck for 26s! [btrfs-cleaner:2874] Message from syslogd@pc-diego at Sep 13 09:58:05 ... kernel:[35744.404940][ C13] watchdog: BUG: soft lockup - CPU#13 stuck for 26s! [btrfs-cleaner:2874] Message from syslogd@pc-diego at Sep 13 09:58:41 ... kernel:[35780.405099][ C13] watchdog: BUG: soft lockup - CPU#13 stuck for 26s! [btrfs-cleaner:2874] Message from syslogd@pc-diego at Sep 13 09:59:21 ... kernel:[35820.381269][ C4] watchdog: BUG: soft lockup - CPU#4 stuck for 24s! [btrfs-cleaner:2874] [35820.381269] watchdog: BUG: soft lockup - CPU#4 stuck for 24s! [btrfs-cleaner:2874] [35820.381285] Modules linked in: rpcsec_gss_krb5 nfsv4 dns_resolver nfs fscache netfs rfcomm tcp_diag inet_diag af_packet 8021q garp mrp bridge vboxnetadp(OEN) stp vboxnetflt(OEN) llc iscsi_ibft iscsi_boot_sysfs qrtr(N) ns(N) vboxdrv(OEN) cmac algif_hash algif_skcipher af_alg bnep snd_seq snd_pcm_oss snd_mixer_oss dmi_sysfs binfmt_misc snd_usb_audio uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 snd_usbmidi_lib videobuf2_common snd_rawmidi snd_seq_device btusb btrtl btbcm btintel btmtk bluetooth xfs ecdh_generic ext4 crc16 mbcache jbd2 nls_iso8859_1 nls_cp437 nvidia_drm(POEX) vfat fat nvidia_modeset(POEX) bcache nvidia_uvm(POEX) nvidia(POEX) snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi ledtrig_audio intel_rapl_msr intel_rapl_common snd_hda_intel snd_intel_dspcfg edac_mce_amd snd_intel_sdw_acpi eeepc_wmi(N) snd_hda_codec asus_wmi battery sparse_keymap snd_hda_core snd_hwdep kvm_amd platform_profile snd_pcm kvm r8169 rfkill realtek snd_timer [35820.381375] irqbypass mdio_devres video pcspkr wmi_bmof efi_pstore(N) k10temp i2c_piix4 drm_kms_helper syscopyarea libphy snd sysfillrect sysimgblt fb_sys_fops soundcore gpio_amdpt gpio_generic acpi_cpufreq zram nfsd nfs_acl lockd auth_rpcgss grace drm fuse sunrpc configfs ip_tables x_tables dm_cache_smq dm_cache dm_persistent_data dm_bio_prison dm_bufio raid1 md_mod hid_generic uas sd_mod usbhid t10_pi usb_storage crc32_pclmul crc64_rocksoft_generic sr_mod cdrom crc64_rocksoft crc64 ghash_clmulni_intel xhci_pci xhci_pci_renesas aesni_intel xhci_hcd ahci libahci crypto_simd cryptd ccp libata usbcore sp5100_tco(N) wmi button btrfs blake2b_generic libcrc32c crc32c_intel xor raid6_pq dm_mirror dm_region_hash dm_log v4l2loopback(OEN) videodev mc sg dm_multipath dm_mod scsi_dh_rdac scsi_dh_emc scsi_dh_alua scsi_mod msr efivarfs [35820.381530] Supported: No, Proprietary and Unsupported modules are loaded [35820.381536] CPU: 4 PID: 2874 Comm: btrfs-cleaner Tainted: P OEL X N 5.14.21-150500.55.19-default #1 SLE15-SP5 a29285bac85e8baa0757c512c91f36229b4faaae [35820.381548] Hardware name: System manufacturer System Product Name/PRIME B450M-A, BIOS 3002 03/10/2021 [35820.381555] RIP: 0010:alloc_extent_map+0xc/0x60 [btrfs] [35820.381590] Code: 47 08 00 00 00 00 48 c7 47 20 00 00 00 00 48 89 47 10 48 89 47 18 e9 23 43 eb f6 0f 1f 00 0f 1f 44 00 00 48 8b 3d 3c eb 10 00 <be> 40 0d 00 00 e8 6a eb 75 f6 48 85 c0 74 36 48 8d 90 80 00 00 00 [35820.381602] RSP: 0018:ffffa3bac8cdbc08 EFLAGS: 00000246 [35820.381608] RAX: 0000000000000200 RBX: ffff98984d46aac0 RCX: 0000000000000000 [35820.381614] RDX: ffffffffffffffff RSI: ffffffffffffffff RDI: ffff9898407f8500 [35820.381620] RBP: 0000004f85884000 R08: 0000000000001000 R09: ffffa3bac8cdbc30 [35820.381625] R10: ffff989a65d17cf0 R11: ffff989aae9d1578 R12: 00000000000009f6 [35820.381631] R13: ffff98984d46ab00 R14: ffff98984d46ab08 R15: ffff98984d46aae0 [35820.381636] FS: 0000000000000000(0000) GS:ffff98a2feb00000(0000) knlGS:0000000000000000 [35820.381643] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [35820.381648] CR2: 00007fc7ca9f0000 CR3: 0000000ae9610000 CR4: 00000000003506e0 [35820.381654] Call Trace: [35820.381659] <TASK> [35820.381663] btrfs_get_extent+0xc9/0x870 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.381698] ? __x86_return_thunk+0x5/0x6 [35820.381705] ? lock_extent_bits+0x4a/0xa0 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.381739] defrag_lookup_extent+0xcf/0x120 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.381774] btrfs_defrag_file+0x90a/0x1230 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.381810] ? __x86_return_thunk+0x5/0x6 [35820.381817] ? __x86_return_thunk+0x5/0x6 [35820.381822] ? btrfs_iget_path+0x67/0x700 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.381851] ? btrfs_get_root_ref+0x18d/0x310 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.381878] ? btrfs_run_defrag_inodes+0x27c/0x360 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.381908] ? __x86_return_thunk+0x5/0x6 [35820.381914] btrfs_run_defrag_inodes+0x221/0x360 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.381946] cleaner_kthread+0xec/0x130 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.381974] ? csum_one_extent_buffer+0x110/0x110 [btrfs 0bea397a25b504ffddb94375cf1ffcaec85fa26b] [35820.382001] kthread+0x156/0x180 [35820.382007] ? set_kthread_struct+0x50/0x50 [35820.382013] ret_from_fork+0x22/0x30 [35820.382023] </TASK> It's very annoying, I'm trying to understand correlation, I start to think that the problem may be related to some problem in the chipset or th CPU... but I don't know how to understand
It must be the side effect of bug 1214988. The btrfs cleaner task stalled, leading to a lockup like this. Let's close as dup and concentrate there. *** This bug has been marked as a duplicate of bug 1214988 ***