Bug 1227720

Summary: infiniband bnxt_re0: Couldn't start port
Product: [openSUSE] openSUSE Distribution Reporter: Georg Pfuetzenreuter <georg.pfuetzenreuter>
Component: KernelAssignee: openSUSE Kernel Bugs <kernel-bugs>
Status: NEW --- QA Contact: E-mail List <qa-bugs>
Severity: Normal    
Priority: P5 - None CC: michael, suse-beta, tbogendoerfer, tiwai
Version: Leap 15.6   
Target Milestone: ---   
Hardware: x86-64   
OS: Other   
Whiteboard:
Found By: --- Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---
Attachments: journal falkor21.i.o.o

Description Georg Pfuetzenreuter 2024-07-12 17:19:19 UTC
Created attachment 876033 [details]
journal falkor21.i.o.o

Hi,

the following is sometimes observed during boot after upgrading to Leap 15.6:

```
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.0: QPLIB: bnxt_re_is_fw_stalled: FW STALL Detected. cmdq[0xe]=0x3 waited (71980 > 40000) msec active 1 
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.0 bnxt_re0: Failed to modify HW QP
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re0: Couldn't change QP1 state to INIT: -110
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re0: Couldn't start port
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.0 bnxt_re0: Failed to destroy HW QP
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: ------------[ cut here ]------------
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: WARNING: CPU: 127 PID: 7690 at ../drivers/infiniband/core/cq.c:322 ib_free_cq+0xf6/0x130 [ib_core]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: Modules linked in: rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace sunrpc fscache netfs af_packet 8021q garp mrp stp llc intel_rapl_msr amd_atl intel_rapl_common amd64_edac edac_mce_amd kvm_amd kvm irqbypass pcspkr acpi_cpufreq wmi_bmof nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_tables ebtable_nat ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 bonding iptable_mangle dm_service_time tls iptable_raw iptable_security iscsi_ibft iscsi_boot_sysfs rfkill ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter bpfilter ipmi_ssif irdma ice gnss bnxt_re(+) ib_uverbs ib_core i40e acpi_ipmi ast bnxt_en i2c_piix4 k10temp i2c_algo_bit ptdma ipmi_si ipmi_devintf ipmi_msghandler joydev button dm_multipath dm_mod nls_iso8859_1 nls_cp437 vfat fat xfs fuse dmi_sysfs ip_tables x_tables hid_generic usbhid raid1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  md_mod lpfc nvmet_fc crc32_pclmul ghash_clmulni_intel sha512_ssse3 sha256_ssse3 sha1_ssse3 aesni_intel crypto_simd cryptd nvmet nvme_keyring configfs nvme_fc ahci nvme_fabrics libahci nvme_core libata nvme_auth scsi_transport_fc sd_mod scsi_dh_emc xhci_pci scsi_dh_rdac xhci_pci_renesas scsi_dh_alua t10_pi xhci_hcd crc64_rocksoft_generic crc64_rocksoft sg usbcore scsi_mod ccp crc64 wmi btrfs blake2b_generic libcrc32c crc32c_intel xor raid6_pq softdog efivarfs
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: Supported: Yes
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: CPU: 127 PID: 7690 Comm: (udev-worker) Not tainted 6.4.0-150600.21-default #1 SLE15-SP6 164789ca04a9536e8c7d37bb2235c09c255f6d93
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: Hardware name: Happyware Super Server/H12DSi-NT6, BIOS 2.4 04/22/2022
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RIP: 0010:ib_free_cq+0xf6/0x130 [ib_core]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: Code: 89 de e8 8d 55 02 00 65 ff 0d 0e 44 c2 3e 75 80 e8 3f 2b bf eb e9 76 ff ff ff 83 f8 03 0f 84 4a ff ff ff 0f 0b e9 43 ff ff ff <0f> 0b 5b e9 62 9e 85 ec 0f 0b 5b e9 5a 9e 85 ec 80 3d 29 14 03 00
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RSP: 0018:ffffbd546da27888 EFLAGS: 00010202
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RAX: 0000000000000002 RBX: 0000000000000001 RCX: 0000000000000000
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RDX: 0000000000000000 RSI: ffff9b01fd9a3500 RDI: ffff9a848abe5800
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RBP: ffff9a8607600000 R08: 0000000000000000 R09: c0000000fff7ffff
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: R10: ffffbd546da27808 R11: ffffbd546da27570 R12: 00000000ffffff92
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: R13: ffff9a841e3958f8 R14: ffff9a841e395870 R15: ffff9a841e395000
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: FS:  00007f22bc8203c0(0000) GS:ffff9b01fd980000(0000) knlGS:0000000000000000
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: CR2: 00007fac4e708340 CR3: 0000010091de4005 CR4: 0000000000f70ee0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: PKRU: 55555554
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: Call Trace:
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  <TASK>
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __warn+0x7d/0x140
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? ib_free_cq+0xf6/0x130 [ib_core 9f7b873871c10d599cbeb9db2bacc00b07d4c2c3]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? report_bug+0xfb/0x1e0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? handle_bug+0x44/0x80
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? exc_invalid_op+0x13/0x60
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? asm_exc_invalid_op+0x16/0x20
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? ib_free_cq+0xf6/0x130 [ib_core 9f7b873871c10d599cbeb9db2bacc00b07d4c2c3]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ib_mad_init_device+0x282/0x830 [ib_core 9f7b873871c10d599cbeb9db2bacc00b07d4c2c3]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  add_client_context.part.27+0xe1/0x1d0 [ib_core 9f7b873871c10d599cbeb9db2bacc00b07d4c2c3]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  enable_device_and_get+0xcb/0x1e0 [ib_core 9f7b873871c10d599cbeb9db2bacc00b07d4c2c3]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ib_register_device+0x541/0x690 [ib_core 9f7b873871c10d599cbeb9db2bacc00b07d4c2c3]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? up+0x12/0x60
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __pfx_bnxt_re_srqn_handler+0x10/0x10 [bnxt_re 06ff08d1119d4b5be04b34c9e1f649eae038b452]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? alloc_port_data.part.24+0x3f/0x110 [ib_core 9f7b873871c10d599cbeb9db2bacc00b07d4c2c3]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? srso_alias_return_thunk+0x5/0x7f
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __kmalloc+0x4d/0x130
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __pfx_bnxt_re_srqn_handler+0x10/0x10 [bnxt_re 06ff08d1119d4b5be04b34c9e1f649eae038b452]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? bnxt_re_probe+0xc5a/0x1100 [bnxt_re 06ff08d1119d4b5be04b34c9e1f649eae038b452]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? srso_alias_return_thunk+0x5/0x7f
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  bnxt_re_probe+0xc5a/0x1100 [bnxt_re 06ff08d1119d4b5be04b34c9e1f649eae038b452]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __pfx_bnxt_re_probe+0x10/0x10 [bnxt_re 06ff08d1119d4b5be04b34c9e1f649eae038b452]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  auxiliary_bus_probe+0x40/0x90
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? driver_sysfs_add+0x5b/0xc0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  really_probe+0x109/0x3c0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? pm_runtime_barrier+0x4f/0xa0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  __driver_probe_device+0x79/0x150
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  driver_probe_device+0x1f/0xa0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  __driver_attach+0x101/0x160
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __pfx___driver_attach+0x10/0x10
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  bus_for_each_dev+0x7a/0xd0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  bus_add_driver+0x199/0x230
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  driver_register+0x62/0x100
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  __auxiliary_driver_register+0x66/0xc0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __pfx_bnxt_re_mod_init+0x10/0x10 [bnxt_re 06ff08d1119d4b5be04b34c9e1f649eae038b452]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  bnxt_re_mod_init+0x3a/0xff0 [bnxt_re 06ff08d1119d4b5be04b34c9e1f649eae038b452]
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  do_one_initcall+0x44/0x220
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? srso_alias_return_thunk+0x5/0x7f
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? kmalloc_trace+0x26/0x90
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  do_init_module+0x60/0x230
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  load_module+0x1eac/0x1fa0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __x86_indirect_jump_thunk_r11+0x1/0x20
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __do_sys_init_module+0xdf/0x1c0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? __do_sys_init_module+0x185/0x1c0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  __do_sys_init_module+0x185/0x1c0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  do_syscall_64+0x5b/0x80
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? syscall_exit_to_user_mode+0x1e/0x40
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? srso_alias_return_thunk+0x5/0x7f
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? do_syscall_64+0x67/0x80
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  ? exc_page_fault+0x69/0x150
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  entry_SYSCALL_64_after_hwframe+0x77/0xe1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RIP: 0033:0x7f22bc72fefa
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: Code: c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d f6 ee 0c 00 f7 d8 64 89 01 48
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RSP: 002b:00007fff2e24b3f8 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RAX: ffffffffffffffda RBX: 00005650f7e6cad0 RCX: 00007f22bc72fefa
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RDX: 00007f22bcecbf93 RSI: 0000000000067a70 RDI: 00005650f80aa100
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: RBP: 00007f22bcecbf93 R08: 00005650f7ed8bfe R09: 00007f22bcecbfc8
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 00005650f80aa100
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: R13: 00005650f7f37be0 R14: 000000000000000c R15: 0000000000000007
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel:  </TASK>
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: ---[ end trace 0000000000000000 ]---
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.0 bnxt_re0: Free MW failed: 0xffffff92
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re0: Couldn't open port 1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re0: Device registered with IB successfully
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0x6]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0x7]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=0
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0x8]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0x9]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0xa]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid 2a07:de40:b27e:1201:0000:0000:0000:00f1 error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0xb]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid 2a07:de40:b27e:1201:0000:0000:0000:00f1 error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0xc]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0xd]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0xe]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fd4b:5292:d67e:1002:0000:0000:0000:00f1 error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0xf]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fd4b:5292:d67e:1002:0000:0000:0000:00f1 error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0x10]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0x11]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0x12]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0x13]=0x11 status 0x1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to add GID: 0xfffffff2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: add_roce_gid GID add failed port=1 index=2
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: __ib_cache_gid_add: unable to add gid fe80:0000:0000:0000:3eec:efff:fecb:981e error=-14
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1: QPLIB: cmdq[0x1a]=0x15 status 0x5
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to allocate HW AH for Shadow QP
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: bnxt_en 0000:01:00.1 bnxt_re1: Failed to create AH entry for ShadowQP
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: Couldn't create ib_mad QP1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: Couldn't open port 1
Jul 12 17:09:01 falkor21.infra.opensuse.org kernel: infiniband bnxt_re1: Device registered with IB successfully
```

Any ideas?

Full journal output (includes kernel log) is attached.

Best,
Georg
Comment 1 Georg Pfuetzenreuter 2024-07-12 20:01:58 UTC
Forgot the obligatory:

```
# rpm -qa kernel*
kernel-default-6.4.0-150600.21.2.x86_64
kernel-default-5.14.21-150500.55.65.1.x86_64
kernel-firmware-platform-20240201-150600.1.2.noarch
```
Comment 2 Michael Heimann 2024-07-16 23:37:56 UTC
I've had the same exact error message on my Supermicro H13SSL-NT with an onboard BCM957416.

Please also see this thread https://forum.proxmox.com/threads/broadcom-nics-down-after-pve-8-2-kernel-6-8.146185/page-2 which helped me find the solution: upgrade the firmware to 229 or newer.

A workaround is to blacklist bnxt_re.

I *think* that the newer firmware just disables infiniband per default and thus bnxt_re isn't loaded anymore but that is fine for me since I don't need it and this way I didn't need a manual blacklist in my systems.