[Bug 284743] System reproducably livelocks after a couple of hours in poudriere bulk -a

From: <bugzilla-noreply_at_freebsd.org>
Date: Mon, 17 Feb 2025 15:42:36 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=284743

Olivier Certner <olce@FreeBSD.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |olce@FreeBSD.org

--- Comment #2 from Olivier Certner <olce@FreeBSD.org> ---
Hi,

A shot in the dark, but might be related to new vnlru problems I'm observing on
some full-ZFS machines when building multiple worlds at once since an upgrade
to a recent stable/14 (it might be that commit
ab05a1cf321aca0fe632c1ab40f68630b477422c has something to do with it, but I
have not thoroughly analyzed the situation yet).

Are you able to obtain backtraces of all processes during a livelock (`procstat
-a -kk`)?  Try keeping a console with an open `top` running, which will allow
you to kill some processes during the livelock without spawning more processes,
which after some time apparently enables the system to run again (after a long
while, in my experiments).  Also, please post the output of `sysctl
kern.maxfiles kern.maxfilesperproc vfs.vnode` (before the livelock, and if
you're able to, also during it).

Regards.

-- 
You are receiving this mail because:
You are the assignee for the bug.