From nobody Sat Apr 15 14:36:25 2023 X-Original-To: freebsd-current@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4PzG8s5Hj0z45RJx; Sat, 15 Apr 2023 14:36:29 +0000 (UTC) (envelope-from cy.schubert@cschubert.com) Received: from omta002.cacentral1.a.cloudfilter.net (omta002.cacentral1.a.cloudfilter.net [3.97.99.33]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "Client", Issuer "CA" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id 4PzG8s3JFXz3CRn; Sat, 15 Apr 2023 14:36:29 +0000 (UTC) (envelope-from cy.schubert@cschubert.com) Authentication-Results: mx1.freebsd.org; none Received: from shw-obgw-4002a.ext.cloudfilter.net ([10.228.9.250]) by cmsmtp with ESMTP id nezCpDQRvjvm1nh0upINE6; Sat, 15 Apr 2023 14:36:28 +0000 Received: from spqr.komquats.com ([70.66.148.124]) by cmsmtp with ESMTPA id nh0spyKNyyAOenh0tpiHoL; Sat, 15 Apr 2023 14:36:28 +0000 X-Authority-Analysis: v=2.4 cv=e5oV9Il/ c=1 sm=1 tr=0 ts=643ab66c a=Cwc3rblV8FOMdVN/wOAqyQ==:117 a=Cwc3rblV8FOMdVN/wOAqyQ==:17 a=kj9zAlcOel0A:10 a=dKHAf1wccvYA:10 a=CjxXgO3LAAAA:8 a=6I5d2MoRAAAA:8 a=NEAV23lmAAAA:8 a=YxBL1-UpAAAA:8 a=EkcXrb_YAAAA:8 a=2iz5XMn2izFUwJidrLYA:9 a=CjuIK1q_8ugA:10 a=IjZwj45LgO3ly-622nXo:22 a=Ia-lj3WSrqcvXOmTRaiG:22 a=LK5xJRSDVpKd5WXXoEvA:22 Received: from slippy.cwsent.com (slippy [10.1.1.91]) by spqr.komquats.com (Postfix) with ESMTP id D86219FC; Sat, 15 Apr 2023 07:36:25 -0700 (PDT) Received: by slippy.cwsent.com (Postfix, from userid 1000) id 99388387; Sat, 15 Apr 2023 07:36:25 -0700 (PDT) X-Mailer: exmh version 2.9.0 11/07/2018 with nmh-1.8+dev Reply-to: Cy Schubert From: Cy Schubert X-os: FreeBSD X-Sender: cy@cwsent.com X-URL: http://www.cschubert.com/ To: FreeBSD User cc: Mark Millard , Charlie Li , Pawel Jakub Dawidek , Cy Schubert , Mateusz Guzik , dev-commits-src-main@freebsd.org, Current FreeBSD Subject: Re: git: 2a58b312b62f - main - zfs: merge openzfs/zfs@431083f75 In-reply-to: <20230415115452.08911bb7@thor.intern.walstatt.dynvpn.de> References: <20230413071032.18BFF31F@slippy.cwsent.com> <20230413063321.60344b1f@cschubert.com> <20230413135635.6B62F354@slippy.cwsent.com> <319a267e-3f76-3647-954a-02178c260cea@dawidek.net> <441db213-2abb-b37e-e5b3-481ed3e00f96@dawidek.net> <5ce72375-90db-6d30-9f3b-a741c320b1bf@freebsd.org> <99382FF7-765C-455F-A082-C47DB4D5E2C1@yahoo.com> <32cad878-726c-4562-0971-20d5049c28ad@freebsd.org> <20230415115452.08911bb7@thor.intern.walstatt.dynvpn.de> Comments: In-reply-to FreeBSD User message dated "Sat, 15 Apr 2023 11:54:25 +0200." List-Id: Discussions about the use of FreeBSD-current List-Archive: https://lists.freebsd.org/archives/freebsd-current List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-current@freebsd.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Date: Sat, 15 Apr 2023 07:36:25 -0700 Message-Id: <20230415143625.99388387@slippy.cwsent.com> X-CMAE-Envelope: MS4xfBC5KK+FGweuTH2Bwo9Fgf2E1NVmkxk65aZu3jql7QWlfdUg1ENrNMGw9W3tgL/GUyC/Jxt1LOXwkv3E1rGXb4Io5cACxlu9Fflm/TtpyUGqb/9AlIej gsw5JXntb7xu5FFJkBc5WmEVMnu1E0RndsKKXJ1ZRDFMZVMwht8SglG5uq+DGi3lpyBwT1f2yhvAuu70OobSKqprpKCL58kTcx4MDsWyRIj7hGg8qHH9FOsz NuqgKrU8ybYumtoifv66hL4kGBmP/xzVnchNamJf/KuRBGfLpTXp4NQNMCm+EZwXvbVZCA0AsYdnNCEtVGdCMuTbsWDEsFlRuBhD5ccXR8Eu5pyJey/8BJoP euTru4iUP/pANRevPFWG+wh7uOY0XDeedx7jfr03LJ0sWU8WYAA= X-Rspamd-Queue-Id: 4PzG8s3JFXz3CRn X-Spamd-Bar: ---- X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[]; ASN(0.00)[asn:16509, ipnet:3.96.0.0/15, country:US] X-Rspamd-Pre-Result: action=no action; module=replies; Message is reply to one we originated X-ThisMailContainsUnwantedMimeParts: N In message <20230415115452.08911bb7@thor.intern.walstatt.dynvpn.de>, FreeBSD Us er writes: > Am Thu, 13 Apr 2023 22:18:04 -0700 > Mark Millard schrieb: > > > On Apr 13, 2023, at 21:44, Charlie Li wrote: > > > > > Mark Millard wrote: > > >> FYI: in my original report for a context that has never had > > >> block_cloning enabled, I reported BOTH missing files and > > >> file content corruption in the poudriere-devel bulk build > > >> testing. This predates: > > >> https://people.freebsd.org/~pjd/patches/brt_revert.patch > > >> but had the changes from: > > >> https://github.com/openzfs/zfs/pull/14739/files > > >> The files were missing from packages installed to be used > > >> during a port's build. No other types of examples of missing > > >> files happened. (But only 11 ports failed.) > > > I also don't have block_cloning enabled. "Missing files" prior to brt_rev > ert may actually > > > be present, but as the corruption also messes with the file(1) signature, > some tools like > > > ldconfig report them as missing. > > > > For reference, the specific messages that were not explicit > > null-byte complaints were (some shown with a little context): > > > > > > ===> py39-lxml-4.9.2 depends on shared library: libxml2.so - not found > > ===> Installing existing package /packages/All/libxml2-2.10.3_1.pkg > > [CA72_ZFS] Installing libxml2-2.10.3_1... > > [CA72_ZFS] Extracting libxml2-2.10.3_1: .......... done > > ===> py39-lxml-4.9.2 depends on shared library: libxml2.so - found > > (/usr/local/lib/libxml2.so) . . . > > [CA72_ZFS] Extracting libxslt-1.1.37: .......... done > > ===> py39-lxml-4.9.2 depends on shared library: libxslt.so - found > > (/usr/local/lib/libxslt.so) ===> Returning to build of py39-lxml-4.9.2 > > . . . > > ===> Configuring for py39-lxml-4.9.2 > > Building lxml version 4.9.2. > > Building with Cython 0.29.33. > > Error: Please make sure the libxml2 and libxslt development packages are in > stalled. > > > > > > [CA72_ZFS] Extracting libunistring-1.1: .......... done > > ===> libidn2-2.3.4 depends on shared library: libunistring.so - not found > > > > > > > [CA72_ZFS] Extracting gmp-6.2.1: .......... done > > ===> mpfr-4.2.0,1 depends on shared library: libgmp.so - not found > > > > > > ===> nettle-3.8.1 depends on shared library: libgmp.so - not found > > ===> Installing existing package /packages/All/gmp-6.2.1.pkg > > [CA72_ZFS] Installing gmp-6.2.1... > > the most recent version of gmp-6.2.1 is already installed > > ===> nettle-3.8.1 depends on shared library: libgmp.so - not found > > *** Error code 1 > > > > > > autom4te: error: need GNU m4 1.4 or later: /usr/local/bin/gm4 > > > > > > checking for GNU > > M4 that supports accurate traces... configure: error: no acceptable m4 coul > d be found in > > $PATH. GNU M4 1.4.6 or later is required; 1.4.16 or newer is recommended. > > GNU M4 1.4.15 uses a buggy replacement strstr on some systems. > > Glibc 2.9 - 2.12 and GNU M4 1.4.11 - 1.4.15 have another strstr bug. > > > > > > ld: error: /usr/local/lib/libblkid.a: unknown file type > > > > > > === > > Mark Millard > > marklmi at yahoo.com > > > > > > Hello > > whar is the recent status of fixing/mitigate this desatrous bug? Especially f > or those with the > new option enabled on ZFS pools. Any advice? > > In an act of precausion (or call it panic) I shutdown several servers to prev > ent irreversible > damages to databases and data storages. We face on one host with /usr/ports r > esiding on ZFS > always errors on the same files created while staging (using portmaster, leav > es the system > with noninstalled software, i.e. www/apache24 in our case). Deleting the work > folder doesn't > seem to change anything, even when starting a scrubbing of the entire pool (R > AIDZ1 pool) - > cause unknown, why it affects always the same files to be corrupted. Same wit > h deve/ruby-gems. > > Poudriere has been shutdown for the time being to avoid further issues. > > Are there any advies to proceed apart from conserving the boxes via shutdown? > > Thank you ;-) > oh > > > > -- > O. Hartmann With an up-to-date tree + pjd@'s "Fix data corruption when cloning embedded blocks. #14739" patch I didn't have any issues, except for email messages with corruption in my sent directory, nowhere else. I'm still investigating the email messages issue. IMO one is generally safe to run poudriere on the latest ZFS with the additional patch. My tests of the additional patch concluded that it resolved my last problems, except for the sent email problem I'm still investigating. I'm sure there's a simple explanation for it, i.e. the email thread was corrupted by the EXDEV regression which cannot be fixed by anything, even reverting to the previous ZFS -- the data in those files will remain damaged regardless. I cannot speak to the others who have had poudriere and other issues. I never had any problems with poudriere on top of the new ZFS. WRT reverting block_cloning pools to without, your only option is to backup your pool and recreate it without block_cloning. Then restore your data. -- Cheers, Cy Schubert FreeBSD UNIX: Web: https://FreeBSD.org NTP: Web: https://nwtime.org e^(i*pi)+1=0