NFS corruption in recent HEAD.

Rick Macklem rmacklem at uoguelph.ca
Mon Jan 23 01:07:09 UTC 2012


Pawel Jakub Dawidek wrote:
> On Sat, Nov 26, 2011 at 09:03:46PM -0500, Rick Macklem wrote:
> > Pawel Jakub Dawidek wrote:
> > > Hi.
> > >
> > > I'm booting machine over the network using new NFS client and I'm
> > > getting those warnings on boot:
> > >
> > > /etc/rc.subr: 666: Syntax error: "(" unexpected (expecting ";;")
> [...]
> > Oh, and maybe you could try reverting r227543 in the client
> > (assuming
> > the client is post-r227543). Maybe that file's vnode type isn't set
> > to
> > VREG early in the diskless booting and needs the ncl_flush() for
> > some
> > reason.
> >
> > I don't actually have a bug that needs r227543 to fix it. It just
> > seemed
> > incorrect to flush non-VREG files (particularily VDIR). As such,
> > reverting
> > it wouldn't be a big deal.
> 
> I haven't tried reverting anything yet, but I think I was able to
> reproduce this with old NFS client as well. The problem goes away when
> I
> comment out root mount point from /etc/fstab or remove mntudp from
> mount
> options. NFS root is mounted using TCP, AFAIK and it probably happens
> when startup scripts (rc.d/mountcritremote) remounts root with mntudp
> flag. The rc.subr warning starts to appear just after mountcritremote
> is
> called.
> 
I finally got around to poking at this and found there are 2 bugs in the
NFS clients (the new and old use essentially the same code).

1 - If the rsize/wsize is greater than NFS_MAXDGRAMDATA (which will be the
    case for a default TCP mount) and an I/O RPC is in progress when
    "mount -u -o udp" is done, the thread(s) will get stuck retrying the
    RPC forever, since it tries to do an I/O greater than NFS_MAXDGRAMDATA.
    - The only fix I can see for this is to not allow the switch to UDP for
      this case. I'm not sure if "udp" should be silently ignored or
      "mount -u" should fail with an error. I've asked this question in a
      separate email.
    patch for "fail" version at http://people.freebsd.org/~rmacklem/udpupdate.patch

2 - If the rsize/wsize is changed by a "mount -u", data in the buffer cache
    can be corrupted. I think this is what you saw, above. I think the problem
    that causes this one is that the code above the buffer cache uses mnt_stat.f_iosize
    (which changes when rsize/wsize changes) instead of v_bufobj.bo_bsize,
    which is set to the value of mnt_stat.f_iosize when the vnode is created.
    patch at http://people.freebsd.org/~rmacklem/bobsize.patch

Pawel, I don't know if it's convenient, but maybe you could test the patch for #2?
(If you apply the patch for #1, the mount update will just fail, so you'll never
 exercise #2.)

Also, if someone reading this is conversant with the buffer cache code, maybe they
could review patch for #2?

Thanks, rick
ps: The above patches are for the new NFS client, but essentially the same patches
    could be applied to the old one.


More information about the freebsd-fs mailing list