From nobody Mon Feb 20 05:50:45 2023 X-Original-To: freebsd-arm@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4PKs3X2KnKz3ryfZ for ; Mon, 20 Feb 2023 05:51:04 +0000 (UTC) (envelope-from marklmi@yahoo.com) Received: from sonic308-8.consmr.mail.gq1.yahoo.com (sonic308-8.consmr.mail.gq1.yahoo.com [98.137.68.32]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4PKs3W6XQCz3qy2 for ; Mon, 20 Feb 2023 05:51:03 +0000 (UTC) (envelope-from marklmi@yahoo.com) Authentication-Results: mx1.freebsd.org; none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s2048; t=1676872261; bh=ILFKGihcB432UFw0tCVhHGAdz0MR5G/oTOJD1HpVYtU=; h=Subject:From:In-Reply-To:Date:Cc:References:To:From:Subject:Reply-To; b=dATacpgPt/i5XKmNdCR5h8F2lMcLMyUhYsPEZWXpxUVCx9DMUTyxjPOshpHY0lHj5ERaCShrPpT6ERgk76Bd/r6BVLDMM4ykklX3BCfh9PyygdX4YO5rpcdmdI85TMmzBDc9r7ZMrLuYuwMx81vVdFUWWp92UOMUir3oySb/Vp7KQ2ueBON7o/DMlSs1t/TqM2rVw27wLwYRVoi0GzXLdH4ZuDVVpkyB6KVqnyoMwDkUKlj1Ex9UPvXh3tI6H2Q5GgKUGM0t/jIm5xPefTpfhfhtSj+kn+N8azYL4K+wYdz/yFo8wRXFhZK5x5HGxyS7oZQ435sZbDBhcEiQNz2H/g== X-SONIC-DKIM-SIGN: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s2048; t=1676872261; bh=P+9SRqiFQEsfiHtP2fUOZ13xszqGX2NWuOS59Bvf2wa=; h=X-Sonic-MF:Subject:From:Date:To:From:Subject; b=MwJ2Ot+mdE4HildWgIYbDNT+7NQuWkShf5TXdUXCuU7L+cjOAQVG+BCuwSHBfOnhlFijn3TG+f7n3etfz8ljw9OvARbT+1OZ6CQphiHhL+qGu11anGP/bCpsb1b+MW4AoP0GnyMArT9ckiGjKM6a29TXiErPzJQIy8yNJ5g44Byl5YN7jn6XwuFwxg+xGxOegJZaAlq1QtEUXkQgUeFYeDSwVXfq0eYMICDWcLYIDcHnd4+p4cfQx4HjskTx8KYqWLwNhN64KR+7fPKLRNuteHPVHoLhawXy5QsiKxWfid7OQCErQXYOybSo/WrYNBvpNjaE5XIXPGhZIDCQEnWItw== X-YMail-OSG: lexE1OEVM1lvUN24_w1cf3zhTDM_ZC79vVeOdli9dr0QB48ubqgu9Yhh0dcmvGS qFtKhJOot359YuQdjGSh5ZMEf6KlKk_Khr4JGd_Nr7pMFLAr1ha78AD.F83fDZhela38DSx2vO5J gdNbBOmN2dmAyEl0EpCnHHORHUrYftCO_P8jsIGPubu6DDvEOLIf9fDRWOsrxxl6OvXSBZ7jHRlj Pvk6cSJTUPu9O5OVzQ6.oWag24VZhGmx4cJkW694Vni096aJu7NpwCgyFdBIJp7k1g431GJ5YEvc _KwLL41PXLN6UwUWxHEbyiYvgpeJvDb8KkY7MSjAfRl8kg6apPPxk_QhjQsmFzAUPPBZUWZ54QuK Z_DAUUrcA.44DsVoP9IKU0L6MB1d26h9gG_SaTR9StfAoG47XXWKf62iSovSJooWju293kskYXE_ KvKbZlKaDWgvxB5t0LaCL.vJJTt1Lj18LQ1T3B.bmHJN_mVj9Rc7IOxKyRPL8IVGmze9KL64Eh6L HJ6zAzOcwM1pDMzSh.AO91D0VQSMvoSBiQknjm1qJJGzlIUVbMHAFQDdsRvhAhG7Ai4RyMnSATQG uITODXnxumr_zHRZUIame.W0MPcQ2NSNVDzItxknRZiWIfbEOkhht.853YH2SNTiwCjRd2SjelCZ 2QBjx5uY9ss3zhQ0gNDXUFzAAU0.d0Ka3YD9.gdNY5c0Z0iQrJHzSPtXun9fz5nxF2k3lHrYL0ZA nlEDG0pehlp8IKJuFFM5FTYz_vZiu9MP.9s5.5oe.1NlObQVYNJSw7lLVfAqSJ5VKz0JOtRPyRm8 lSlq57B7yAAu2BGO4Lr0RTDzrd8Yj9zXzef70ZOeIgQeBtJ1J72lW.Amzvp8ubFRZiH5h6TFzzMd RGQ33cmxFguCMGihJshhzhkntpf1ewpWqzqm3zDl5e3x_8xoQqFIb0JyvdApliOtwd.c_M6zL0Ae fUbdnmj2C9VJe6IE76ULua8B2mbJfbkTYMFVCNBrKQ3EaGg1oMgGPijIVEQyWvN3CCiGJwP6i_1r xM6AIcbWxx3QrjYnO3QnZyzrPHI_U_7YatgJu9y22ZOxq9kvcc5brooHpJqYiGfrSc8B4PL.nOum HN7PDcvpy8A.4AEW6jUPn_OgOFgSeWINBOMVjasO9RBRwiluk3oRbDuWfT1Tyy9EoI1cpUyhBDYA Ns5HoStrG6Zamz7t32tg2ktT9XvL_u7nqcHWHFFNd4Uf60Gk8Do4TlYmaFCtGaWxERMlIVC8c.3J .twaSmQwx22egBVgKLUSltbUgY2gFjIwOJLPR.jmbMcd4zGgnSrkYs21V3tbtqteDkaGbJ4PiRj2 Wvyz0jt3ndEV9JGMhOTvoB78uGh0b5KM1bIRZGa3WuRVNHsMcJjf0l3FIdMSMcKRfWQmTLDoPzDT ost6cEgIzowJQpnj2eIP2AuD4JfJie_EA1YoZgt_vW7Ezgrvc.2fR3kDqoKUyyFnz.1cozATOfAx eKB8T9_E0UArFLfMGkF_oxsh5b4h.i_aktxJKb9dTkEtmVCJAhiLHSAKq68G68ZETiKQJiVsVyiX XlhFNxHUkpjPUJF6b59qCKyEPCPzboZjeEPQsRckaXxe9z5O13JV3H2SgFqtoJP.Mge65A7F.iYK P_392zMSb5CWnmG3CZKY.IF4y2Xj7UQ6kbC5vL07c3NNoJGAdAP62B4QxIKKlwlUiT75nKsC7k62 jpPnj1yPPhTp9BTlNkZoSlC6WzNT2kpjMFzhEVgtc4tSGAn.f.z3alkgNP5I4WpBYjh8ZLuOw9hM Mxc3BFim_.oV5FE9WrjlA.wDWG9Jak4MlduoCODhxWa57NpoqA7L3_XSNf6HhppoH9ncSfv_P.qm Nf1W5nw2E.bC3XkBEOXT9n7oSp4XmcUGB4Oy8pOj.KNKxbzU6UO1heX0Kd70W06ZvdU1OYPWlID4 RH04jkx_XL47sruQSlFIjbWKZpaI.GBYIpGHzBEf2VBZtMhTMGILYTgeRai8dBIq8k4cOjSMQK37 7xB1EM49cSDZG8c2q.9zgZ4JGHJ.DZ766UcLqh1.m3zP22xLkNSY3wprnOiiNH04otRIzO9zdXpi rBKbTs5ZXF3w1J0IvrTuPpaWV7fa3GPjDETLtvu0QEeL7RD4CTzvoUksewPJ6gXdlB47Hy9yvKSt YRLMlhKztesA_igs62yE9FHGMuAtUMN1Vk_SNxCAtqUPpsxzgV.VNgAvMGIAan8pF6Vdh_tSksy4 T3xkFfKjwWNpnyX._Dp3A2k5J2U8SdfLDFDoJ1p1q3Iva3dNwPhPpaNGQ0k1E7vCQItrceRiTSTL 4Gwk- X-Sonic-MF: Received: from sonic.gate.mail.ne1.yahoo.com by sonic308.consmr.mail.gq1.yahoo.com with HTTP; Mon, 20 Feb 2023 05:51:01 +0000 Received: by hermes--production-bf1-57c96c66f6-76kbw (Yahoo Inc. Hermes SMTP Server) with ESMTPA ID 1a078502384662df17a582e6b8f0bd1a; Mon, 20 Feb 2023 05:50:57 +0000 (UTC) Content-Type: text/plain; charset=us-ascii List-Id: Porting FreeBSD to ARM processors List-Archive: https://lists.freebsd.org/archives/freebsd-arm List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-arm@freebsd.org Mime-Version: 1.0 (Mac OS X Mail 16.0 \(3731.400.51.1.1\)) Subject: Re: fsck segfaults on rpi3 running 13-stable (and on 14-CURRENT analyzing the same file system that resulted from the 13-STABLE crash) From: Mark Millard In-Reply-To: <20230220044544.GB57936@www.zefox.net> Date: Sun, 19 Feb 2023 21:50:45 -0800 Cc: freebsd-arm@freebsd.org Content-Transfer-Encoding: quoted-printable Message-Id: <9CEF4E7A-2F13-454F-A04A-A6C5A80FD4B7@yahoo.com> References: <202302192054.31JKsq7w079295@chez.mckusick.com> <3DD8EEC2-6135-42A0-A80C-F195CAAC025E@yahoo.com> <20230219222328.GA55941@www.zefox.net> <2F5B20E9-AFF8-42F6-9E1F-50BBDF4E1B79@yahoo.com> <20230220044544.GB57936@www.zefox.net> To: bob prohaska X-Mailer: Apple Mail (2.3731.400.51.1.1) X-Rspamd-Queue-Id: 4PKs3W6XQCz3qy2 X-Spamd-Bar: ---- X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[]; ASN(0.00)[asn:36647, ipnet:98.137.64.0/20, country:US] X-Rspamd-Pre-Result: action=no action; module=replies; Message is reply to one we originated X-ThisMailContainsUnwantedMimeParts: N On Feb 19, 2023, at 20:45, bob prohaska wrote: > On Sun, Feb 19, 2023 at 02:35:15PM -0800, Mark Millard wrote: >>=20 >> Kirk likely monitors the freebsd-fs list. >=20 > I didn't notice there was such a list 8-\ >=20 >> Kirk likely does not monitor the freebsd-arm list. >> None of us thought to switch to freebsd-fs at the >> time. The only part of your context that ended up >> to be arm specific was original buildworld crash. >> You definitely started in an appropriate place >> (freebsd-arm). After the crash, the rest was more >> general relative to platforms and more specific >> relative to file system handling (UFS support). >>=20 >> I do not see any reason for any of this exchange >> to go to any lists, given the current status. >=20 > Alas, the story's not over yet 8-( =20 >=20 > After getting the disk fsck'd and booting once more, > an attempt to buildworld using a fresh /usr/src > and empty /usr/obj crashed again, I'm confused. The original crash was reported to be on a RPi2B using a armv7 kernel, or so I thought. (The RPi3B was for later fsck_ffs activity for the media's UFS.) This new material indicates a RPi3B arm64 (aarch64) context for this buildworld failure. Is it the same media as for the prior buildworld failure? > in I think the > same way. This time some notes have been collected > at > http://www.zefox.net/~fbsd/rpi3/scsi_status_error/readme >=20 > To a casual glance, it looks like a hardware error. > But, the machine seems to work fine until it's running > buildworld, and then crashes during a relatively easy > part of buildworld. The initial error message is: >=20 > bob@pelorus:/usr/src % (da0:umass-sim0:0:0:0): READ(10). CDB: 28 00 43 = 29 d6 40 00 00 40 00=20 > (da0:umass-sim0:0:0:0): CAM status: SCSI Status Error > (da0:umass-sim0:0:0:0): SCSI status: Check Condition > (da0:umass-sim0:0:0:0): SCSI sense: MEDIUM ERROR asc:11,0 (Unrecovered = read error) > (da0:umass-sim0:0:0:0): Error 5, Unretryable error A description of "Media Error" from seagate is: Medium Error - Indicates the command terminated with a nonrecovered = error condition, probably caused by a flaw in the medium or an error in = the recorded data. To compare/contrast with other alternatives, see: https://www.seagate.com/support/kb/scsi-sense-key-chart-196259en/ A more extensive list with asc/ascq involved as well is at: https://en.wikipedia.org/wiki/Key_Code_Qualifier/ Allowing more comparison/contrast with other classifications. It indicates: 3 11 00 Medium Error - unrecovered read error (matching the reported text). > SCSI errors are not unknown, but they usually succeed on retry. > It's not obvious why this is treated as un-retryable.=20 Because that is what the "3 11 00" combination involved means. The drive is reporting that. It is not a FreeBSD driver choice of handling. (I'm not expert at drive internals, so I take it at face value.) > Are there any simple tests that might help decide what's wrong? > It's likely that re-running buildworld will reproduce the crash. See the https://en.wikipedia.org/wiki/Key_Code_Qualifier/ description material for some background information? > I've placed the results of smartctl -a at the end of the notes.=20 > The interpretation isn't self evident, hopefully someone else > can lend an eye. I'll try smartctl -t after a good night's sleep.=20 man smartctl reports: UNC: UNCorrectable Error in Data The 3 examples of: After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 ff ff ff 0f Error: UNC at LBA =3D 0x0fffffff =3D 268435455 indicate UNC. All 3 list the same LBA value. Error 4 occurred at disk power-on lifetime: 11121 hours (463 days + 9 = hours) Error 3 occurred at disk power-on lifetime: 11098 hours (462 days + 10 = hours) Error 2 occurred at disk power-on lifetime: 11096 hours (462 days + 8 = hours) So spread over a little over a day overall, with 2 and 3 spread over a couple of hours. It suggests to me that the drive is no longer usable. But I'm no expert. =3D=3D=3D Mark Millard marklmi at yahoo.com