From nobody Mon Sep 09 17:31:44 2024 X-Original-To: freebsd-fs@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4X2YmQ3n20z5TVR8 for ; Mon, 09 Sep 2024 17:31:50 +0000 (UTC) (envelope-from allanjude@freebsd.org) Received: from tor1-11.mx.scaleengine.net (tor1-11.mx.scaleengine.net [209.51.186.6]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4X2YmP6CqZz46Nx for ; Mon, 9 Sep 2024 17:31:49 +0000 (UTC) (envelope-from allanjude@freebsd.org) Authentication-Results: mx1.freebsd.org; dkim=none; dmarc=fail reason="No valid SPF, No valid DKIM" header.from=freebsd.org (policy=none); spf=softfail (mx1.freebsd.org: 209.51.186.6 is neither permitted nor denied by domain of allanjude@freebsd.org) smtp.mailfrom=allanjude@freebsd.org Received: from [10.0.1.242] (unknown [38.242.13.190]) (Authenticated sender: allanjude.freebsd@scaleengine.com) by tor1-11.mx.scaleengine.net (Postfix) with ESMTPSA id 79C9136C97 for ; Mon, 9 Sep 2024 17:31:43 +0000 (UTC) DKIM-Filter: OpenDKIM Filter v2.10.3 tor1-11.mx.scaleengine.net 79C9136C97 Message-ID: Date: Mon, 9 Sep 2024 13:31:44 -0400 List-Id: Filesystems List-Archive: https://lists.freebsd.org/archives/freebsd-fs List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-fs@FreeBSD.org MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: Does a failed separate ZIL disk mean the entire zpool is lost? To: freebsd-fs@freebsd.org References: <535969cf-0b0b-48ca-a163-fc238f316bb7@gmx.at> Content-Language: en-CA From: Allan Jude In-Reply-To: <535969cf-0b0b-48ca-a163-fc238f316bb7@gmx.at> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spamd-Bar: - X-Spamd-Result: default: False [-1.63 / 15.00]; SUBJECT_ENDS_QUESTION(1.00)[]; NEURAL_HAM_SHORT(-1.00)[-1.000]; NEURAL_HAM_LONG(-0.90)[-0.896]; NEURAL_HAM_MEDIUM(-0.84)[-0.841]; ONCE_RECEIVED(0.10)[]; MIME_GOOD(-0.10)[text/plain]; DMARC_POLICY_SOFTFAIL(0.10)[freebsd.org : No valid SPF, No valid DKIM,none]; XM_UA_NO_VERSION(0.01)[]; FREEFALL_USER(0.00)[allanjude]; ASN(0.00)[asn:6939, ipnet:209.51.160.0/19, country:US]; TO_MATCH_ENVRCPT_ALL(0.00)[]; RCVD_COUNT_ONE(0.00)[1]; RCVD_TLS_ALL(0.00)[]; ARC_NA(0.00)[]; RCVD_VIA_SMTP_AUTH(0.00)[]; FROM_HAS_DN(0.00)[]; TO_DOM_EQ_FROM_DOM(0.00)[]; TO_DN_NONE(0.00)[]; R_SPF_SOFTFAIL(0.00)[~all:c]; FROM_EQ_ENVFROM(0.00)[]; RCPT_COUNT_ONE(0.00)[1]; PREVIOUSLY_DELIVERED(0.00)[freebsd-fs@freebsd.org]; R_DKIM_NA(0.00)[]; MLMMJ_DEST(0.00)[freebsd-fs@freebsd.org]; MID_RHS_MATCH_FROM(0.00)[]; MIME_TRACE(0.00)[0:+] X-Rspamd-Queue-Id: 4X2YmP6CqZz46Nx As the last person mentioned, you should be able to import with the -m flag, and only lose about 5 seconds worth of writes. The pool is already partially imported at boot by the other mechanisms, you might need to disable that to prevent the partial import at boot, so you can do the manual import. On 2024-09-09 12:20 p.m., infoomatic wrote: > did you use two mirrored ZIL devices? > > You can "zpool import -m", but you will probably be confronted with some > errors - you will probably lose the data the ZIL has not committed, but > most of your data in your pool should be there > > > On 09.09.24 17:51, andy thomas wrote: >> A server I look after had a 65TB ZFS RAIDz1 pool with 8 x 8TB hard disks >> plus one hot spare and separate ZFS intent log (ZIL) and L2ARC cache >> disks that used a pair of 256GB SSDs. This ran really well for 6 years >> until 2 weeks ago, when the main cooling system in the data centre where >> it was installed failed and the backup cooling system failed to start >> up. >> >> The upshot was the ZIL SSD went short-circuit across its power >> connector, shorting out the server's PSUs and shutting down the server. >> After replacing the failed SSD and verifying all the spinning hard disks >> and the cache SSD are undamaged, attempts to import the pool fail with >> the following message: >> >> NAME       SIZE  ALLOC   FREE  CKPOINT  EXPANDSZ   FRAG    CAP DEDUP >> HEALTH  ALTROOT >> clustor2      -      -      -        -         -      - -      - >> UNAVAIL  - >> >> Does this mean the pool's contents are now lost and unrecoverable? >> >> Andy >> >