[Bug 281714] mlx5 Kernel panic after implementation of mlx5en support for ipsec offload on vlan if

From: <bugzilla-noreply_at_freebsd.org>
Date: Wed, 25 Sep 2024 15:11:25 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=281714

            Bug ID: 281714
           Summary: mlx5 Kernel panic after implementation of mlx5en
                    support for ipsec offload on vlan if
           Product: Base System
           Version: 15.0-CURRENT
          Hardware: amd64
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: kern
          Assignee: bugs@FreeBSD.org
          Reporter: freebsd@mhka.no

Created attachment 253814
  --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=253814&action=edit
full backtrace

This kernel panic happens when initializing virtual functions (VFs) on mlx5
hardware, as of commit `e23731db48ef9c6568d4768b1f87d48514339faa` (22nd July)
and later. The kernel panic occurs every time iovctl is run to initialize
virtual functions on the ConnectX-6 Lx.

The issue does not appear prior to this commit, but does appear in this commit
and in future commits.

I’ve attached backtrace (bt.txt), as well as output from my kgdb digging
(gdb.txt).

It seems like there’s something funky with the lock in
sys/dev/mlx5/mlx5_core/mlx5_fs_core.c:1056.

Info about my system:
AMD64, sr-iov and ARI enabled, 10bit TAG enabled (issue also happens without),
ConnectX-6 Lx dualport NIC. I’ve also attached relevant pciconf output for the
ConnectX card. (mlx5_core0_pciconf.txt)

Reproduction steps:
Configure the ConnectX-6 Lx with sr-iov enabled and a handful of supported VF’s
Use a minimal iovctl config file (attached: mlx5_core0.minimal.conf)
(kernel panic)

-- 
You are receiving this mail because:
You are the assignee for the bug.