[Bug 272924] cxgbei drops connections during write, with 16+ sessions

From: <bugzilla-noreply_at_freebsd.org>
Date: Thu, 03 Aug 2023 21:24:46 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=272924

            Bug ID: 272924
           Summary: cxgbei drops connections during write, with 16+
                    sessions
           Product: Base System
           Version: CURRENT
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: kern
          Assignee: bugs@FreeBSD.org
          Reporter: asomers@FreeBSD.org

On both FreeBSD 13.2 and 14.0 I can use cxgbei offload for my iSCSI sessions. 
There is no trouble with reads.  Writes work too, with up to 9 iSCSI sessions. 
But somewhere around 14 simultaneous sessions, overall bandwidth becomes
terrible (10s to 100s of MB/s) and the server's log is spammed with messages
like this:

WARNING: 172.32.10.78 (iqn.1994-09.org.freebsd:MYINITATOR.MYDOMAIN.net): no
ping reply (NOP-Out) after 5 seconds; dropping connection

and this:

2023-08-03T20:32:53.730407+00:00 MYSERVER.MYDOMAIN.net ctld[6453] 172.32.10.79
(iqn.1994-09.org.freebsd:MYINITIATOR.MYDOMAIN.net): error returned from CTL
iSCSI limits request: cfiscsi_ioctl_limits: icl_limits failed with error 6;
dropping connection


Meanwhile, the initiator's log is spammed with messages like these:

2023-08-03T20:33:57.756484+00:00 MYINITIATOR kernel: WARNING: 172.33.10.58
(iqn.2018-10.net.MYDOMAIN.MYSERVER:zd17): login timed out after 61 seconds;
reconnecting
2023-08-03T20:33:57.756486+00:00 slc-rb19b-ss kernel: WARNING:
MYSERVER.MYDOMAIN.net (iqn.2018-10.net.MYDOMAIN.MYSERVER:zd17): login timed out
after 61 seconds; reconnecting

Is there some kind of undocumented limit that I'm running into?  FWIW the limit
doesn't seem to be related to the number of connected sessions, just the number
of active sessions with traffic.  When the failures start, I think I have about
50 total outstanding commands, and around 2-3 GBps of traffic.

-- 
You are receiving this mail because:
You are the assignee for the bug.