network lock manager (lockd) deadlocked in 'rpcrecv'
John Hein
jhein at timing.com
Thu Jul 9 04:27:51 UTC 2009
John Hein wrote at 15:51 -0600 on Jul 8, 2009:
> John Hein wrote at 15:31 -0600 on Jul 8, 2009:
> > I have a home directory on FreeBSD 7.2-stable (20090705), amd64.
> > It is serving up the directory over nfs (v3, tcp), and now
> > I'm seeing lots of 'lockd not responding' on Fedora 10 & 11 systems.
.
.
> Also in dmesg:
>
> NLM: failed to contact remote rpcbind, stat = 5, port = 28416
.
.
Here's some good information. This seems to happen when there are 2
or more Fedora systems trying to access locks via lockd.
Rebooting the Fedora box that has 'lockd not responding' frees
up the deadlocked freebsd lockd.
But _also_ disabling the firewall on the Fedora boxes helps, too.
This doesn't necessarily completely implicate or exonerate
lockd. But what should lockd do when the remote box asks
for a lock, but doesn't complete the RPC dialog? Is there
a way we can deal with this problem and not have lockd
deadlock?
More information about the freebsd-net
mailing list