9.2 ixgbe tx queue hang

Christopher Forgeron csforgeron at gmail.com
Fri Mar 21 15:01:09 UTC 2014


(Pardon me, for some reason my gmail is sending on my cut-n-pastes if I cr
down too fast)

First set of logs:

Mar 21 11:07:00 SAN0 kernel: before pklen=65542 actl=65542 csum=4116
Mar 21 11:07:00 SAN0 kernel: after mbcnt=33 pklen=65542 actl=65542
Mar 21 11:07:00 SAN0 kernel: before pklen=65542 actl=65542 csum=4116
Mar 21 11:07:00 SAN0 kernel: after mbcnt=33 pklen=65542 actl=65542
Mar 21 11:07:00 SAN0 kernel: before pklen=65542 actl=65542 csum=4116
Mar 21 11:07:00 SAN0 kernel: after mbcnt=33 pklen=65542 actl=65542
Mar 21 11:07:00 SAN0 kernel: before pklen=65542 actl=65542 csum=4116
Mar 21 11:07:00 SAN0 kernel: after mbcnt=33 pklen=65542 actl=65542
Mar 21 11:07:00 SAN0 kernel: before pklen=65542 actl=65542 csum=4116

Here's a few later on.

Mar 21 11:10:09 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:10:09 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538
Mar 21 11:10:09 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:10:09 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538
Mar 21 11:10:09 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:10:09 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538
Mar 21 11:10:09 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:10:09 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538

Mar 21 11:23:00 SAN0 kernel: after mbcnt=33 pklen=65546 actl=65546
Mar 21 11:23:01 SAN0 kernel: before pklen=65546 actl=65546 csum=4116
Mar 21 11:23:01 SAN0 kernel: after mbcnt=33 pklen=65546 actl=65546
Mar 21 11:23:03 SAN0 kernel: before pklen=65546 actl=65546 csum=4116
Mar 21 11:23:03 SAN0 kernel: after mbcnt=33 pklen=65546 actl=65546
Mar 21 11:23:04 SAN0 kernel: before pklen=65546 actl=65546 csum=4116
Mar 21 11:23:04 SAN0 kernel: after mbcnt=33 pklen=65546 actl=65546

Mar 21 11:41:25 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:41:25 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538
Mar 21 11:41:25 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:41:25 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538
Mar 21 11:41:25 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:41:25 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538
Mar 21 11:41:25 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:41:25 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538
Mar 21 11:41:26 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:41:26 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538
Mar 21 11:41:26 SAN0 kernel: before pklen=65538 actl=65538 csum=4116
Mar 21 11:41:26 SAN0 kernel: after mbcnt=33 pklen=65538 actl=65538

To be clear, I changed tp->t_tsomax to IP_MAXPACKET at ~ 777 in
sys/netinet/tcp_output.c like so:

                        if (len > IP_MAXPACKET - hdrlen) {
                                len = IP_MAXPACKET - hdrlen;
                                sendalot = 1;
                        }

I notice there is more that is different between 9.1 and 10 for this file:
http://fxr.watson.org/fxr/diff/netinet/tcp_output.c?v=FREEBSD10;diffval=FREEBSD91;diffvar=v

I'm going to attempt inserting a 9.1 tcp_output.c and see if that makes any
difference.

Otherwise, I wait further ideas from the list.

Thanks.


More information about the freebsd-net mailing list