fix for remove for NFS through nullfs
Gleb Kurtsou
gleb.kurtsou at gmail.com
Mon Jul 19 07:41:05 UTC 2010
On (18/07/2010 22:40), Rick Macklem wrote:
> Mikolaj Golub submitted the attached patch that fixes a problem w.r.t.
> a nullfs mounted NFS mount point for remove. The problem is that,
> without this patch, NFS does not see that a file is still open
> (v_usecount > 1) when removed and removes it instead of silly renaming
> it. This patch increments the v_usecount of the lower level vnode
> during the remove call, so that silly rename works. kib@ has noted
> that this may be "racy" and result in silly rename happening when it
> isn't required but, imho, that is less of a problem than it never
> working. (I have tested it a bit for NFS and UFS and it seems to
> work for those file systems under a nullfs mount.)
>
> Why I am posting is that I am wondering if anyone knows of a file
> system type where this extra v_usecount on the vnode at the time of remove
> will/might cause problems?
It seems to be easily reproducible only with stacked filesystems. But
the problem is that v_usecount can be different from number of open
descriptors for the vnode, and it generally is.
IMHO using v_usecount is racy for all filesystems. Grep for vref and
VREF, it's used all over the place.
The issue was discussed some time ago already:
http://marc.info/?l=freebsd-hackers&m=125675165319186&w=4
I think the better solution would be to add a means of getting number of
opened file descriptors, but not misuse v_usecount, e.g. the patch looks
a pure hack for me.
Thanks,
Gleb.
>
> Thanks in advance for looking at this, rick
> --- submitted patch for nullfs ---
> --- fs/nullfs/null_vnops.c.sav 2010-07-18 19:33:00.000000000 -0400
> +++ fs/nullfs/null_vnops.c 2010-07-18 19:35:25.000000000 -0400
> @@ -499,6 +499,29 @@
> }
>
> /*
> + * Increasing refcount of lower vnode is needed at least for the case
> + * when lower FS is NFS to do sillyrename if the file is in use.
> + */
> +static int
> +null_remove(struct vop_remove_args *ap)
> +{
> + int retval;
> + struct vnode *lvp;
> + boolean_t vreleit;
> +
> + if (ap->a_vp->v_usecount > 1) {
> + lvp = NULLVPTOLOWERVP(ap->a_vp);
> + VREF(lvp);
> + vreleit = TRUE;
> + } else
> + vreleit = FALSE;
> + retval = null_bypass(&ap->a_gen);
> + if (vreleit)
> + vrele(lvp);
> + return (retval);
> +}
> +
> +/*
> * We handle this to eliminate null FS to lower FS
> * file moving. Don't know why we don't allow this,
> * possibly we should.
> @@ -809,6 +832,7 @@
> .vop_open = null_open,
> .vop_print = null_print,
> .vop_reclaim = null_reclaim,
> + .vop_remove = null_remove,
> .vop_rename = null_rename,
> .vop_setattr = null_setattr,
> .vop_strategy = VOP_EOPNOTSUPP,
> _______________________________________________
> freebsd-fs at freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-fs
> To unsubscribe, send any mail to "freebsd-fs-unsubscribe at freebsd.org"
More information about the freebsd-fs
mailing list