ARP problem with 6.2-STABLE Intel PRO/1000 NIC, latest em driver
Jack Vogel
jfvogel at gmail.com
Mon Mar 5 16:41:04 UTC 2007
On 3/4/07, Duane Whitty <duane at dwlabs.ca> wrote:
> On Sun, Mar 04, 2007 at 11:30:07PM -0700, Mark Costlow wrote:
> > The Machine:
> >
> > I have a dual Xeon 5130 machine, Supermicro motherboard, with
> > the 82563EB NIC. From dmesg:
> >
> > CPU: Intel(R) Xeon(R) CPU 5130 @ 2.00GHz (2000.08-MHz 686-class CPU)
> > cpu0: <ACPI CPU> on acpi0
> > em0: <Intel(R) PRO/1000 Network Connection Version - 6.2.9> port 0x2000-0x201f mem 0xda000000-0xda01ffff irq 18 at device 0.0 on pci4
> >
> > The machine has 4G RAM and a 3ware 9000 series RAID controller with 2 drives.
> >
> > pciconf -l says:
> >
> > em0 at pci4:0:0: class=0x020000 card=0x000015d9 chip=0x10968086 rev=0x01 hdr=0x00
> > em1 at pci4:0:1: class=0x020000 card=0x000015d9 chip=0x10968086 rev=0x01 hdr=0x00
> >
> >
> > The symptom:
> >
> > The machine boots OK, but can only intermittently make netork connections.
> > Eventually determined that it seems to only see a few ARP packets, so
> > it's falling out of other machines' ARP tables, and is often unable to
> > see the replies to its own ARP requests. It does see SOME ARPs
> > though. When it is able to communicate with another machine, it
> > does not appear to drop any packets between them (e.g. I scp'd a 500M file
> > at 300Mbps to this machine).
> >
> > When I run "tcpdump -n arp" I see a few ARPs, but not many. In a 1-minute
> > period, I saw 3 ARP who-has/reply packets. On a different machine on
> > the same ethernet switch, I saw 225 who-has/reply packets in the same
> > 1-minute period.
> >
> > I've tried different cables, and a different switch. I started with
> > 6.2-RELEASE, and then went to 6.2-STABLE on 3/3/07 to get the latest
> > em driver fixes. I've used SMP and GENERIC kernels. I get the same
> > results in all cases.
> >
> > There are no firewall rules installed.
> >
> > I plugged in a USB ethernet adapter (realtek), and it works straight away.
> > "tcpdump -n arp" sees the same noise as other machines on that LAN.
> >
>
> Sounds like it could be bad hardware. Can you swap nics?
No he can't these are LOMs (on the motherboard).
> > I read through the recent threads on the em driver, but didn't see any
> > reported symptoms like this. Has anyone seen anything like this? Got
> > any hints for me? Am I doing something stupid? Did I leave out any
> > useful information about my configuration?
> >
>
> Maybe more of your dmesg might help as it could show interrrupt issues
> that perhaps others could help diagnose
Yes, agreed, this might be revealing.
Jack
More information about the freebsd-stable
mailing list