readdir() -> d_type always zero on NFS v3 mounted filesystems

Rick Macklem rmacklem at uoguelph.ca
Thu Apr 29 14:39:19 UTC 2021


N.J. Mann wrote:
>I recently changed over from using svn to gitup to update /usr/ports
>on my local system and have been experiencing problems since.  At first
>either a kernel issue or a configuration issue.  I originally posted
>about the problem to the freebsd-ports mailing list:
>https://lists.freebsd.org/pipermail/freebsd-ports/2021-April/120929.html
>
>Since then I have dug deeper and come to the conclusion that it is not a
>problem with gitup.
>
>The issue I am seeing is that gitup is unable to delete files and
>directories, even complete ports, which have been removed from the
>repository.  gitup basically does the following:
>
>prune_tree(base_path)
>{
>if ((directory = opendir(base_path)) != NULL) {
>    while ((entry = readdir(directory)) != NULL) {
>        snprintf(full_path, sizeof(full_path), "%s/%s", base_path, entry->d_name);
>        if (entry->d_type == DT_DIR) {
>            prune_tree(full_path);
>        } else {
>            if ((remove(full_path) != 0) && (errno != ENOENT))
>                 err(EXIT_FAILURE, "prune_tree: cannot remove %s", full_path);
>            }
>        }
>        closedir(directory);
>        if (rmdir(base_path) != 0)
>            err(EXIT_FAILURE, "prune_tree: cannot remove %s", base_path);
>    }
>}

The code should check for d_type == DT_UNKNOWN and then do
stat() to find out the type, as Ronald noted.
--> The d_type is normally filled in if you use the "rdirplus"
      NFS mount option,  which might work around the issue.

rick

         closedir()


When gitup is run on either a UFS or ZFS file system this works.  However,
when I run it on a NFS v3 mounted filesystem it fails.  Liberal addition
of printf's shows that for non-NFS mounted filesystems d_type contains the
correct value for each file/directory, but for NFS mounted file systems it
is zero - other fields such as d_name and d_namlen are correct.  While
debugging this I added a call to stat() and that always returns the correct
values.

At this point I started digging in libc and quickly found that readdir()
is basically a wrapper around a system call.  Digging in the kernel I
quickly got out of my depth and hence my posting here.  I then wrote a
simple test programme which also shows the issue.  I have attached the
source for the test programme and the output from two runs, the first on
an NFS mounted file system and the second on a local UFS file system.

Before gitup started failing I had not seen any failures like this and that
was why I initially assumed this must be a gitup issue.  I now believe this
is either a kernel issue or a confuration issue.

My configuration is as follows:

file server:
  /exports/ports - ZFS file system for FreeBSD ports repo exported via NFS
  /remote/ports  - FreeBSD ports repo NFS (v3 rw,tcp) mounted from /export/ports
                   on file server
  /usr/ports     - symbolic link to /remote/ports
client machines:
  /remote/ports  - FreeBSD ports repo NFS (v3 ro,tcp) mounted from /export/ports
                   on file server
  /usr/ports     - symbolic link to /remote/ports

I run gitup in /remote/ports on the file server and then pkg, portmaster, &.c,
in /usr/ports on the file server and then the client machines.  All systems are
running 11-STABLE from about 12 days ago - I usually update every couple of weeks.

Any assistance, suggestions, patches, clue bats, will be gratefully accepted.


Regards,
        Nick.
--


More information about the freebsd-fs mailing list