Re: qemu-user-static aarch64 lockup/race? (was Re: Python failure in poudriere on arm64 (via qemu-user-static cross compiling))

From: Warner Losh <imp_at_bsdimp.com>
Date: Sun, 28 Jan 2024 21:23:45 UTC
On Sun, Jan 28, 2024, 12:38 PM Guido Falsi <mad@madpilot.net> wrote:

> On 28/01/24 15:15, Guido Falsi wrote:
> > Hi all, again,
> >
> > I have some more findings about this, I'm top posting because the old
> > message is not really that much relevant anymore.
> >
> > I'm now running a machine with head (commit
> > b32d49cfbaa0437d08e65e7cd7c82c5951b1a852 Jan 25th), poudriere installed
> > in it, machine is amd64, with an arm64 jail, 14.0-RELEASE, installed
> > from official distribution binaries (https download method), with cross
> > tools.
> >
> > To make sure everything is aligned I rebuild everything: updated head,
> > rebuild cross tools in the jail, recompiled all ports for the host
> > architecture and force reinstalled them, especially qemu-user-static,
> > cleaned up all packages for the arm64 jail.
> >
> > If I missed something important please point it out.
> >
> > I have made some more tests and I'm getting python failures in poudriere
> > like the one described below from time to time (don't have hard stats
> > but feels like 50% chance). If I get past that it usually is able to
> > build all the not many packages, but locks up at:
> >
> > Creating repository in /tmp/packages:   0%
> >
>
> BTW, forgot to mention last time this worked without issue was around
> 20th December.
>

I think this is a bsd-user issue. There is a race somewhere in that code
that causes the hangs. I'd love a reproducible test case that is somewhat
smaller than python... there are bigger races with the newer stuff and I've
not had the time to chase it there either. 😞

Warner

Warner

Warner

-- 
> Guido Falsi <mad@madpilot.net>
>
>
>