bce(4) - com_no_buffers (Again)

Pyun YongHyeon pyunyh at gmail.com
Thu Sep 23 18:40:47 UTC 2010


On Thu, Sep 23, 2010 at 10:05:33AM -0500, Tom Judge wrote:
> On 09/13/2010 03:53 PM, Pyun YongHyeon wrote:
> > On Mon, Sep 13, 2010 at 03:38:41PM -0500, Tom Judge wrote:
> >   
> >> On 09/13/2010 02:33 PM, Pyun YongHyeon wrote:
> >>     
> >>> On Mon, Sep 13, 2010 at 02:07:58PM -0500, Tom Judge wrote:
> >>>   
> >>>       
> >>>>>> Without BCE_JUMBO_HDRSPLIT then we see no errors.  With it we see number
> >>>>>> of errors, however the rate seems to be reduced compaired to the
> >>>>>> previous version of the driver.        
> >>>>>>             
> >>>>> It seems there are issues in header splitting and it was disabled
> >>>>> by default. Header splitting reduces packet processing overhead in
> >>>>> upper layer so it's normal to see better performance with header
> >>>>> splitting.
> >>>>>           
> >>>> The reason that we have had header splitting enabled in the past is that
> >>>> historically there have been issues with memory fragmentation when using
> >>>> 8k jumbo frames (resulting in 9k mbuf's).
> >>>>         
> >>> Yes, if you use jumbo frames, header splitting would help to reduce
> >>> memory fragmentation as header splitting wouldn't allocate jumbo
> >>> clusters.
> >>>
> >>>       
> >> Under testing I have yet to see a memory fragmentation issue with this
> >> driver.  I follow up if/when I find a problem with this again.
> >>
> >>     
> So here we are again.  The system is locking up again because of 9k mbuf
> allocation failures.
> 
> tj at pidge '14:12:25' '~'
> > $ netstat -m
> 514/4781/5295 mbufs in use (current/cache/total)
> 0/2708/2708/25600 mbuf clusters in use (current/cache/total/max)
> 0/1750 mbuf+clusters out of packet secondary zone in use (current/cache)
> 0/2904/2904/12800 4k (page size) jumbo clusters in use
> (current/cache/total/max)
> 513/3274/3787/6400 9k jumbo clusters in use (current/cache/total/max)

Number of 9k clusters didn't reach to the limit.

> 0/0/0/3200 16k jumbo clusters in use (current/cache/total/max)
> 4745K/47693K/52438K bytes allocated to network (current/cache/total)
> 0/0/0 requests for mbufs denied (mbufs/clusters/mbuf+clusters)
> 0/2692655/0 requests for jumbo clusters denied (4k/9k/16k)

I see large denied value for 9k jumbo clusters but it could be
normal under hight network load. But it should not lock up the
controller. Note, under these conditions(cluster allocation
failure) driver would drop incoming frames which in turn will does
not pass received frames to upper stack. The end result could be
shown as locked up as upper stack does not receive frames. I think
you can check MAC statistics whether the driver is still running or
not.

> 0/0/0 sfbufs in use (current/peak/max)
> 0 requests for sfbufs denied
> 0 requests for sfbufs delayed
> 0 requests for I/O initiated by sendfile
> 0 calls to protocol drain routines
> 
> 
> >>>> I have a kernel with the following configuration in testing right now:
> >>>>
> >>>> * Flow control enabled.
> >>>> * Jumbo header splitting turned off.
> >>>>
> >>>>
> >>>> Is there any way that we can fix flow control with jumbo header
> >>>> splitting turned on?
> >>>>
> >>>>     
> >>>>         
> >>> Flow control has nothing to do with header splitting(i.e. flow
> >>> control is always enabled). 
> >>>
> >>>   
> >>>       
> >> Sorry let me rephrase that:
> >>
> >> Is there a way to fix the RX buffer shortage issues (when header
> >> splitting is turned on) so that they are guarded by flow control.  Maybe
> >> change the low watermark for flow control when its enabled?
> >>
> >>     
> > I'm not sure how much it would help but try changing RX low
> > watermark. Default value is 32 which seems to be reasonable value.
> > But it's only for 5709/5716 controllers and Linux seems to use
> > different default value.
> >   
> These are: NetXtreme II BCM5709 Gigabit Ethernet
> 
> So my next task is to turn the watermark related defines into sysctls
> and turn on header splitting so that I can try to tune them without
> having to reboot.
> 
> 
> 
> My next question is, is it possible to increase the size of the RX ring
> without switching to RSS?
> 

Yes but I doubt it would help in this case as you seem to suffer
from 9K jumbo frame allocation failure.


More information about the freebsd-net mailing list