cvs commit: src/sys/nfsclient nfs_bio.c nfs_vfsops.c nfsargs.h
nfsmount.h src/sys/sys buf.h bufobj.h src/sys/kern vfs_bio.c
Brian Fundakowski Feldman
green at freebsd.org
Sun Jun 12 12:25:17 GMT 2005
On Sun, Jun 12, 2005 at 06:30:49AM -0400, Daniel Eischen wrote:
> On Sun, 12 Jun 2005, Alfred Perlstein wrote:
>
> > * Brian Fundakowski Feldman <green at freebsd.org> [050612 01:26] wrote:
> > > On Sun, Jun 12, 2005 at 01:08:33AM -0700, Alfred Perlstein wrote:
> > > >
> > > > Seriously, have you tested what happens to a libc_r app that
> > > > opens an nfs file F_SYNC? My guess is that it's not pretty.
> > >
> > > This code path is related to O_NONBLOCK, not O_FSYNC. O_FSYNC is
> > > synonymous with the slow fallback path that large transactional block
> > > now takes, rather than deadlocking. O_NONBLOCK really means that
> > > whatever they do, they are required to check for EAGAIN.
> >
> > To make it perfectly clear.
> >
> > If an application linked against libc_r opens a file with O_FSYNC.
> > Libc_r will set O_NONBLOCK (it does so for each open(2))
> > A write on that descriptor will return EAGAIN (to libc_r)
> > Libc_r will then attempt to select(2) on this decriptor, which
> > will return "ready" (as do all select(2)'s on disk files)
> >
> > The question is:
> >
> > Will Libc_r then busy spin?
>
> Yes, for the most part.
>
> > If so, how many other apps might get screwed just sometimes (over
> > nfs) because only _half_ of this "solution" is implemented?
> >
> > Or is my thinking on this wrong?
>
> I think I agree with Alfred.
>
> If select() returns ready, then you should be able to write
> some part of your buffer. Also, anyone using a file descriptor
> in non-blocking mode should expect short writes and loop until
> the entire buffer has been written.
>
> >From my understanding, disk I/O has always returned ready, then
> blocked in the kernel if necessary. I think that if we are going
> to start honoring non-blocking mode for disk I/O (or NFS, whatever),
> it should be done fully. That means you allow short writes when
> select() returns ready, and select() doesn't return ready if no
> data can be written.
That's a good point. I wasn't more than vaguely aware of libc_r's
continued existance and usage of such things. Fixing select(3) to
match up would be easiest, right? Short writes are definitely not
allowed for a non-socket, though.
--
Brian Fundakowski Feldman \'[ FreeBSD ]''''''''''\
<> green at FreeBSD.org \ The Power to Serve! \
Opinions expressed are my own. \,,,,,,,,,,,,,,,,,,,,,,\
More information about the cvs-src
mailing list