From nobody Sat Feb 25 16:09:43 2023 X-Original-To: freebsd-arm@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4PPBY66bjPz3tZft for ; Sat, 25 Feb 2023 16:09:46 +0000 (UTC) (envelope-from mike@karels.net) Received: from mail2.karels.net (mail2.karels.net [3.19.118.201]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "freebsd", Issuer "freebsd" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id 4PPBY64LSVz3LMC; Sat, 25 Feb 2023 16:09:46 +0000 (UTC) (envelope-from mike@karels.net) Authentication-Results: mx1.freebsd.org; none Received: from mail2.karels.net (localhost [IPv6:0:0:0:0:0:0:0:1]) by mail2.karels.net (8.16.1/8.16.1) with ESMTP id 31PG9ijW064565; Sat, 25 Feb 2023 10:09:44 -0600 (CST) (envelope-from mike@karels.net) Received: from [10.0.2.130] ([73.62.165.147]) by mail2.karels.net with ESMTPSA id vAPgKcgy+mMz/AAAs/W3XQ (envelope-from ); Sat, 25 Feb 2023 10:09:44 -0600 From: Mike Karels To: Mark Millard Cc: Mitchell Horne , freebsd-arm Subject: Re: reboot broken on RPi4 on main [breaks at git: e6cf1a0826c9 - main - physmem: add ram0 pseudo-driver] Date: Sat, 25 Feb 2023 10:09:43 -0600 X-Mailer: MailMate (1.14r5937) Message-ID: <5E055A77-24DB-44D3-89F8-5113555283EA@karels.net> In-Reply-To: References: <9EA91B94-87E4-47B8-A4E2-60D556E64276@yahoo.com> <4FCD4987-6DAF-4889-B684-B6E464F41144@yahoo.com> List-Id: Porting FreeBSD to ARM processors List-Archive: https://lists.freebsd.org/archives/freebsd-arm List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-arm@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable X-Rspamd-Queue-Id: 4PPBY64LSVz3LMC X-Spamd-Bar: ---- X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[]; ASN(0.00)[asn:16509, ipnet:3.16.0.0/14, country:US] X-Rspamd-Pre-Result: action=no action; module=replies; Message is reply to one we originated X-ThisMailContainsUnwantedMimeParts: N Top posting to make this easier to find: I verified that the current kernel can reboot if it was booted with hint.ram.0.disabled=3D=E2=80=9C1=E2=80=9D in /boot/device.hints. Mike On 25 Feb 2023, at 3:58, Mark Millard wrote: > [I probably should have also shown related output from > earlier in the boot log.] > > On Feb 25, 2023, at 01:23, Mark Millard wrote: > >> On Feb 24, 2023, at 22:16, Mark Millard wrote: >> >>> [The first main version to not reboot RPi4B's is: >>> git: e6cf1a0826c9 - main - physmem: add ram0 pseudo-driver Mitchell H= orne >>> ] >>> >>> On Feb 24, 2023, at 17:45, Mark Millard wrote: >>> >>>> On Feb 24, 2023, at 15:18, Mark Millard wrote: >>>> >>>>> On Feb 23, 2023, at 14:01, Mike Karels wrote: >>>>> >>>>>> Reboot (shutdown -r) is hanging on RPi4 on main as of today=E2=80=99= s snapshot. >>>>>> It hangs after printing the Uptime. The initial bootstrap and boo= t from >>>>>> power-up work. I haven=E2=80=99t tested a snapshot since Jan 1, s= o I=E2=80=99m not sure >>>>>> when it stopped working. Any ideas what might have broken it? I = can bisect >>>>>> the recent snapshots if nothing else. >>>>>> >>>>>> 13.2-BETA2 works, however. >>>>> >>>>> For reference . . . >>>>> >>>>> While it is a personal build instead of a snapshot, >>>>> I see the "shutdown -r now" hang problem for my build >>>>> based on: >>>>> >>>>> # ~/fbsd-based-on-what-commit.sh -C /usr/main-src/ >>>>> d04c86717c8c (HEAD -> main, freebsd/main, freebsd/HEAD) bsd.sys.mk:= Add NO_WSTRICT_PROTOTYPES like in kernel >>>>> branch: main >>>>> merge-base: d04c86717c8ca3aa1bd9d8927a37a1f5443925b5 >>>>> merge-base: CommitDate: 2023-02-19 07:02:12 +0000 >>>>> n261026 (--first-parent --count for merge-base) >>>>> >>>>> The context has a "B0T" 8 GiByte RPi4B. >>>> >>>> For the artifact builds available for arm64, testing >>>> the kernels: >>>> >>>> The last that has "shutdown -r now" working on the RPi4B I'm using: >>>> >>>> =E2=80=A2 git: 8b418c83d175 - main - cp: Adjust the sparse file tes= ts. Dag-Erling Sm=C3=B8rgrav >>>> ( https://artifact.ci.freebsd.org/snapshot/main/8b418c83d175fde3b1f6= 5210509ddcf2ac248d9f/arm64/aarch64/kernel.txz ) >>>> >>>> No arm64 artifacts after that until . . . >>>> >>>> The first that has "shutdown -r now" failing on the same RPi4B: >>>> >>>> =E2=80=A2 git: ded5f2954e1a - main - nfsd: Fix handling of the erro= r case for nfsvno_open Rick Macklem >>>> ( https://artifact.ci.freebsd.org/snapshot/main/ded5f2954e1a1bb77486= 46888938af767ee6257a/arm64/aarch64/kernel.txz ) >>>> >>>> So the range is limited to (including those end points): >>>> (All are 2023-Feb-08 artifact builds.) >>>> >>>> =E2=80=A2 git: 8b418c83d175 - main - cp: Adjust the sparse file tes= ts. Dag-Erling Sm=C3=B8rgrav >>>> =E2=80=A2 git: 87d405eab911 - main - iommu_gas: initialize start_ga= p as first node Doug Moore >>>> =E2=80=A2 git: 8c784bb8cf36 - main - lua: Update to 5.4.4 Warner Lo= sh >>>> =E2=80=A2 git: 5fff09660e06 - main - geli: split the initalization = of HMAC Gordon Tetlow >>>> =E2=80=A2 git: 81ad626541db - main - Merge llvm-project main llvmor= g-15-init-15358-g53dc0f10787 Dimitry Andric >>>> =E2=80=A2 git: 753f127f3ace - main - Merge llvm-project main llvmor= g-15-init-16436-g18a6ab5b8d1f Dimitry Andric >>>> =E2=80=A2 git: fcaf7f8644a9 - main - Merge llvm-project main llvmor= g-15-init-17485-ga3e38b4a206b Dimitry Andric >>>> =E2=80=A2 git: 972a253a57b6 - main - Merge llvm-project main llvmor= g-15-init-17826-g1f8ae9d7e7e4 Dimitry Andric >>>> =E2=80=A2 git: 61cfbce3347e - main - Merge llvm-project release/15.= x llvmorg-15.0.0-rc2-40-gfbd2950d8d0d Dimitry Andric >>>> =E2=80=A2 git: a4a491e2238b - main - Merge llvm-project release/15.= x llvmorg-15.0.0-9-g1c73596d3454 Dimitry Andric >>>> =E2=80=A2 git: 6246ae0b85d8 - main - Merge llvm-project release/15.= x llvmorg-15.0.2-10-gf3c5289e7846 Dimitry Andric >>>> =E2=80=A2 git: f3fd488f1e19 - main - Merge llvm-project release/15.= x llvmorg-15.0.6-0-g088f33605d8a Dimitry Andric >>>> =E2=80=A2 git: 50d7464c3fe6 - main - Merge llvm-project release/15.= x llvmorg-15.0.7-0-g8dfdcc7b7bf6 Dimitry Andric >>>> =E2=80=A2 git: 3264f6b88fce - main - Bump __FreeBSD_version for llv= m 15.0.7 merge Dimitry Andric >>>> =E2=80=A2 git: 89a072d11cd2 - main - Makefile.amd64: remove constru= ct that serves no purpose Warner Losh >>>> =E2=80=A2 git: ae1dca798e0f - main - e1000: fix I219 hang on reset = Kevin Bowling >>>> =E2=80=A2 git: 647f2d2bc0cb - main - e1000: bump driver version Kev= in Bowling >>>> =E2=80=A2 git: 48bfd3597654 - main - Add nproc(1) Mateusz Guzik >>>> =E2=80=A2 git: 1d03c3578d05 - main - arm: add an interrupt rman to = nexus Mitchell Horne >>> >>> My build of the above version works for "shutdown -r now". >>> >>>> =E2=80=A2 git: f9bdaab95ec4 - main - ofwbus: remove handling of res= ources from ofwbus Mitchell Horne >>> >>> My build of the above version works for "shutdown -r now". >>> It is the last version to do so. >>> >>>> =E2=80=A2 git: e6cf1a0826c9 - main - physmem: add ram0 pseudo-drive= r Mitchell Horne >>> >>> My build of the above version fails for "shutdown -r now": >>> hang-up just after Uptime: . . . message. >>> >>>> =E2=80=A2 git: fa3f6655421f - main - netmap: drop redundant if_mtu = assignment Vincenzo Maffione >>>> =E2=80=A2 git: ded5f2954e1a - main - nfsd: Fix handling of the erro= r case for nfsvno_open Rick Macklem >>>> >>>> . . . >>> >>> >> >> The following from boot -v looks somewhat odd: >> >> ram0: reserving memory region: 2000-7ef0000 >> ram0: reserving memory region: 7f10000-31c00000 >> ram0: reserving memory region: 331d7000-39c2a000 >> ram0: reserving memory region: 39c2b000-39c2e000 >> ram0: reserving memory region: 39c2f000-39c30000 >> ram0: reserving memory region: 39c32000-39c33000 >> ram0: reserving memory region: 39c37000-3b050000 >> ram0: reserving memory region: 3b060000-3b300000 >> ram0: reserving memory region: 40000000-fc000000 >> ram0: reserving excluded region: 0-1fff >> ram0: reserving excluded region: 7ef0000-7f0ffff >> ram0: reserving excluded region: 31c00000-331d6fff >> ram0: reserving excluded region: 39c2a000-39c2afff >> ram0: reserving excluded region: 39c2e000-39c2efff >> ram0: reserving excluded region: 39c30000-39c31fff >> ram0: reserving excluded region: 39c33000-39c36fff >> ram0: reserving excluded region: 3b050000-3b05ffff >> ram0: reserving excluded region: 3ee5c000-3ee5cfff >> ram0: reserving excluded region: 3ee5c000-3ee5cfff >> ram0: failed to reserve region >> ram0: reserving excluded region: fe100000-fe100fff >> >> Possible oddities (4 GiByte RPi4B example): >> >> Nothing covers any part of [3b300000,3ee5c000) . >> Exclusion [3ee5c000-3ee5cfff] is repeated. (Related to the above?) >> The "failed to reserve region" notice. (Related to repetition?) >> Nothing covers any part of [fc000000,fe100000) . >> Nothing covers any part of [fe101000,ffffffff] . >> >> (The 2 different styles of specifying high bounds >> reads oddly.) > > Related boot log material: > > Type Physical Virtual #Pages Attr > Reserved 000000000000 000000000000 00000002 WB > ConventionalMemory 000000002000 000000002000 00007eee WB > ACPIReclaimMemory 000007ef0000 000007ef0000 00000020 WB > ConventionalMemory 000007f10000 000007f10000 00029c43 WB > LoaderData 000031b53000 000031b53000 00000001 WB > LoaderCode 000031b54000 000031b54000 00004000 WB > LoaderData 000035b54000 000035b54000 00004000 WB > LoaderCode 000039b54000 000039b54000 000000ce WB > BootServicesData 000039c22000 000039c22000 00000008 WB > RuntimeServicesData 000039c2a000 000039c2a000 00000001 WB RUNTIME > BootServicesData 000039c2b000 000039c2b000 00000003 WB > RuntimeServicesData 000039c2e000 000039c2e000 00000001 WB RUNTIME > BootServicesData 000039c2f000 000039c2f000 00000001 WB > RuntimeServicesData 000039c30000 000039c30000 00000002 WB RUNTIME > BootServicesData 000039c32000 000039c32000 00000001 WB > RuntimeServicesData 000039c33000 000039c33000 00000004 WB RUNTIME > BootServicesData 000039c37000 000039c37000 00000009 WB > BootServicesCode 000039c40000 000039c40000 00001410 WB > RuntimeServicesCode 00003b050000 00003b050000 00000010 WB RUNTIME > BootServicesCode 00003b060000 00003b060000 000000a0 WB > BootServicesData 00003b100000 00003b100000 00000200 WB > Reserved 00003ee5c000 00003ee5c000 00000001 WB > BootServicesData 000040000000 000040000000 000bc000 WB > MemoryMappedIO 0000fe100000 0000fe100000 00000001 RUNTIME > Physical memory chunk(s): > 0x00002000 - 0x3b2fffff, 946 MB ( 242430 pages) > 0x40000000 - 0xfbffffff, 3008 MB ( 770048 pages) > Excluded memory regions: > 0x00000000 - 0x00001fff, 0 MB ( 2 pages) NoAlloc > 0x07ef0000 - 0x07f0ffff, 0 MB ( 32 pages) NoAlloc > 0x31c00000 - 0x331d6fff, 21 MB ( 5591 pages) NoAlloc > 0x39c2a000 - 0x39c2afff, 0 MB ( 1 pages) NoAlloc > 0x39c2e000 - 0x39c2efff, 0 MB ( 1 pages) NoAlloc > 0x39c30000 - 0x39c31fff, 0 MB ( 2 pages) NoAlloc > 0x39c33000 - 0x39c36fff, 0 MB ( 4 pages) NoAlloc > 0x3b050000 - 0x3b05ffff, 0 MB ( 16 pages) NoAlloc > 0x3ee5c000 - 0x3ee5cfff, 0 MB ( 1 pages) NoAlloc NoDump > 0x3ee5c000 - 0x3ee5cfff, 0 MB ( 1 pages) NoAlloc > 0xfe100000 - 0xfe100fff, 0 MB ( 1 pages) NoAlloc > > and: > > Physical memory chunk(s): > 0x00000000002000 - 0x00000007eeffff, 133095424 bytes (32494 pages) > 0x00000007f10000 - 0x00000031bfffff, 701431808 bytes (171248 pages) > 0x000000331d7000 - 0x00000039c29fff, 111489024 bytes (27219 pages) > 0x00000039c2b000 - 0x00000039c2dfff, 12288 bytes (3 pages) > 0x00000039c2f000 - 0x00000039c2ffff, 4096 bytes (1 pages) > 0x00000039c32000 - 0x00000039c32fff, 4096 bytes (1 pages) > 0x00000039c37000 - 0x0000003b04ffff, 21073920 bytes (5145 pages) > 0x0000003b060000 - 0x0000003b2fffff, 2752512 bytes (672 pages) > 0x00000040000000 - 0x000000f5e2ffff, 3051552768 bytes (745008 pages) > > So: Some of the oddities go back to some of this earlier > material. For example, somehow: > > Reserved 00003ee5c000 00003ee5c000 00000001 WB > > turned into: > > 0x3ee5c000 - 0x3ee5cfff, 0 MB ( 1 pages) NoAlloc NoDump > 0x3ee5c000 - 0x3ee5cfff, 0 MB ( 1 pages) NoAlloc > > > > =3D=3D=3D > Mark Millard > marklmi at yahoo.com