ix(intel) vs mlxen(mellanox) 10Gb performance
Yonghyeon PYUN
pyunyh at gmail.com
Wed Aug 19 07:30:23 UTC 2015
On Tue, Aug 18, 2015 at 06:04:25PM -0400, Rick Macklem wrote:
> Hans Petter Selasky wrote:
> > On 08/18/15 14:53, Rick Macklem wrote:
> > > If this is just a test machine, maybe you could test with these lines (at
> > > about #880)
> > > in sys/netinet/tcp_output.c commented out? (It looks to me like this will
> > > disable TSO
> > > for almost all the NFS writes.)
> > > - around line #880 in sys/netinet/tcp_output.c:
> > > /*
> > > * In case there are too many small fragments
> > > * don't use TSO:
> > > */
> > > if (len <= max_len) {
> > > len = max_len;
> > > sendalot = 1;
> > > tso = 0;
> > > }
> > >
> > > This was added along with the other stuff that did the
> > > if_hw_tsomaxsegcount, etc and I
> > > never noticed it until now (not my patch).
> >
> > FYI:
> >
> > These lines are needed by other hardware, like the mlxen driver. If you
> > remove them mlxen will start doing m_defrag(). I believe if you set the
> > correct parameters in the "struct ifnet" for the TSO size/count limits
> > this problem will go away. If you print the "len" and "max_len" and also
> > the cases where TSO limits are reached, you'll see what parameter is
> > triggering it and needs to be increased.
> >
> Well, if the driver isn't setting if_hw_tsomaxsegcount correctly, then it
> is the driver that needs to be fixed.
> Having the above code block disable TSO for all of the NFS writes, including
> the ones that set if_hw_tsomaxsegcount correctly doesn't make sense to me.
> If the driver authors don't set these, the drivers do lots of m_defrag()
> calls. I have posted more than once to freebsd-net@ asking the driver authors
> to set these and some now have. (I can't do it, because I don't have the
> hardware to test it with.)
>
Thanks for reminder. I have generated a diff against HEAD.
https://people.freebsd.org/~yongari/tso.param.diff
The diff restores optimal TSO parameters which were lost in r271946
for drivers that relied on sane default values. I'll commit it
after some testing.
> I do think that most/all of them don't subtract 1 for the tcp/ip header and
> I don't think they should be expected to, since the driver isn't supposed to
> worry about the protocol at that level.
I agree.
> --> I think tcp_output() should subtract one from the if_hw_tsomaxsegcount
> provided by the driver to handle this, since it chooses to count mbufs
> (the while() loop at around line #825 in sys/netinet/tcp_output.c.)
> before it prepends the tcp/ip header mbuf.
>
> rick
>
> > --HPS
More information about the freebsd-stable
mailing list