Re: ... was killed: a thread waited too long to allocate a page

In reply to: Karl Pielorz : "Re: ... was killed: a thread waited too long to allocate a page"
Go to: [ bottom of page ] [ top of archives ] [ this month ]

From: Miroslav Lachman <000.fbsd_at_quip.cz>
Date: Thu, 01 Feb 2024 16:32:00 UTC

On 01/02/2024 15:47, Karl Pielorz wrote:

[..]

> I seem to have run into something similar. I recently upgraded a 12.4 
> box to 13.2p9. The box has 32G of RAM, and runs ZFS. We do a lot of 
> rsync work to it monthly - the first month we've done this with 13.2p9 
> we get a lot of processes killed, all with a similar (but not identical) 
> message, e.g.
> 
> pid 11103 (ssh), jid 0, uid 0, was killed: failed to reclaim memory
> pid 10972 (local-unbound), jid 0, uid 59, was killed: failed to reclaim 
> memory
> pid 3223 (snmpd), jid 0, uid 0, was killed: failed to reclaim memory
> pid 3243 (mountd), jid 0, uid 0, was killed: failed to reclaim memory
> pid 3251 (nfsd), jid 0, uid 0, was killed: failed to reclaim memory
> pid 10996 (sshd), jid 0, uid 0, was killed: failed to reclaim memory
> pid 3257 (sendmail), jid 0, uid 0, was killed: failed to reclaim memory
> pid 8562 (csh), jid 0, uid 0, was killed: failed to reclaim memory
> pid 3363 (smartd), jid 0, uid 0, was killed: failed to reclaim memory
> pid 8558 (csh), jid 0, uid 0, was killed: failed to reclaim memory
> pid 3179 (ntpd), jid 0, uid 0, was killed: failed to reclaim memory
> pid 8555 (tcsh), jid 0, uid 1001, was killed: failed to reclaim memory
> pid 3260 (sendmail), jid 0, uid 25, was killed: failed to reclaim memory
> pid 2806 (devd), jid 0, uid 0, was killed: failed to reclaim memory
> pid 3156 (rpcbind), jid 0, uid 0, was killed: failed to reclaim memory
> pid 3252 (nfsd), jid 0, uid 0, was killed: failed to reclaim memory
> pid 3377 (getty), jid 0, uid 0, was killed: failed to reclaim memory
> 
> This 'looks' like 'out of RAM' type situation - but at the time, top 
> showed:

I remember something similar on our machines after upgrade to 13.x about 
a year ago. But don't remember what steps we take to walk around this 
issue (if any). I also see this on my FreeBSD based desktop from time to 
time... "ad more memory" and limit the ARC in loader.conf is my way.

[..]

> I've not seen any bug reports along this line, in fact very little 
> coverage at all of the specific error.
> 
> My only thought is to set a sysctl to limit ZFS ARC usage, i.e. to leave 
> more free RAM floating around the system. During the rsync it was 
> 'swapping' occasionally (few K in, few K out) - but it never ran out of 
> swap that I saw - and it certainly didn't look like an complete out of 
> memory scenario/box (which is what it felt like with everything getting 
> killed).