From nobody Mon Nov 04 17:28:42 2024 X-Original-To: freebsd-fs@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4Xhz6K6nyyz5cLkg for ; Mon, 04 Nov 2024 17:31:37 +0000 (UTC) (envelope-from dch@FreeBSD.org) Received: from smtp.freebsd.org (smtp.freebsd.org [96.47.72.83]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "smtp.freebsd.org", Issuer "R10" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4Xhz6K69Dqz445t for ; Mon, 4 Nov 2024 17:31:37 +0000 (UTC) (envelope-from dch@FreeBSD.org) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1730741497; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=I/wz65bCGVbX+oX/tXMj8viiUHJo7T7AooMqXGT6KeE=; b=CDnhZJaDlb7UckjY1vqh9Lq3jpvi5oRgLsl6UK+jK/tlz5aULGN18lgFKO8iFyCKQ+H0Lx b32HmO5KYqCguZDPWyOVbnDUb/IP0/eXdznHewZqZf6TQ7DGvY9PafMnVS+2HrJZFDBf3K TgTwppTsHVyNBs2DgATvvo/MlB2pNjTG4bMe6Vi1YIiZ0rqHOI5KRQTEBN0vGH4Q2gcAk8 gs6BQ0QpE8lCL1Kvw2jSZol9ti0J8jd+kRdcL0gRrolpJJNeAK6KV+wQ33odqBToa2U10G zMxPsB2tyPEs1VVl0VqdTLHV+VgARICUtvawUZ6h1qH+dbvC7iEpBGiGkWlOqw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1730741497; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=I/wz65bCGVbX+oX/tXMj8viiUHJo7T7AooMqXGT6KeE=; b=XsoIcvserZVlIEHYWU2kurh4Ys8Qjb8xsM7AkOznSTGi5ikfaMfNo5Hip4m0DHRkJnRmr7 PYtMQ1INkOH41Pd/06B3NOwCbBsJd3hVzRE7DHvAP90XLFCCMtYaBTXOHknln4N7Fehtoj /I3DtFUQO4PhxuHzD3RmYx0Z2Q+5l/6nCn3xEx+8qCn7ipsppHEUNKLz2kVxbKl2Oa0ViC avoTf5ieD3g6/DOAE3DBbWWY8/+FtPGjqA7/+krEf1Qtz9Hh5CQqTl4/HZDYJt1ORgwqmP RTjD9utZvnG5hrIYQhfMbZXmbQLJqSdFJpZKuPKX3hxDAbBfOHjz+aqk0uig/w== ARC-Authentication-Results: i=1; mx1.freebsd.org; none ARC-Seal: i=1; s=dkim; d=freebsd.org; t=1730741497; a=rsa-sha256; cv=none; b=lcIweTYkcblrAuad+25Dj9GvJfd1/0anWM6FDS6Fu4EgPnXyKwBOP6Fx7a8IiRt7fJgbrh IB0zL+I3AGnFG46RXtMTMzeAXB6pIUioMv7pYWYf/Ac4JCBY78VBBFReKQ5XZK68BbG7eM PZhirw6IySSguD2E6dZ9YzYTb8kZ6jZaz0vvE8EEKaGFeDdQrnSQczmMeILWK5wyWF/p6G j+5PVQYd8BWJ57OzQmi2lvW6b78VzX/H+izAy+6AUeQ3a84UeWs+2UP1VX4xI2x+k/P7ul IIrXnMWkvWtflXIGMbXOZeHJp9e+ZfJC9SeWcwQzUaM8Pp26JF/FLIlF+/HPJw== Received: from fauth-a1-smtp.messagingengine.com (fauth-a1-smtp.messagingengine.com [103.168.172.200]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) (Authenticated sender: dch/mail) by smtp.freebsd.org (Postfix) with ESMTPSA id 4Xhz6K5Yvrzh36 for ; Mon, 4 Nov 2024 17:31:37 +0000 (UTC) (envelope-from dch@FreeBSD.org) Received: from phl-compute-02.internal (phl-compute-02.phl.internal [10.202.2.42]) by mailfauth.phl.internal (Postfix) with ESMTP id 62B74120006C for ; Mon, 4 Nov 2024 12:31:37 -0500 (EST) Received: from phl-imap-02 ([10.202.2.81]) by phl-compute-02.internal (MEProxy); Mon, 04 Nov 2024 12:31:37 -0500 X-ME-Sender: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddrvdeliedguddttdcutefuodetggdotefrod ftvfcurfhrohhfihhlvgemucfhrghsthforghilhdpggftfghnshhusghstghrihgsvgdp uffrtefokffrpgfnqfghnecuuegrihhlohhuthemuceftddtnecunecujfgurhepofggff fhvffkufgtgfesthhqredtredtjeenucfhrhhomhepfdffrghvvgcuvehothhtlhgvhhhu sggvrhdfuceouggthheshfhrvggvuefuffdrohhrgheqnecuggftrfgrthhtvghrnhephf evfeekjeekgfegueejgffhgeelvdduveelueehieetleelffelveffffdvveffnecuvehl uhhsthgvrhfuihiivgeptdenucfrrghrrghmpehmrghilhhfrhhomhepuggthhdomhgvsh hmthhprghuthhhphgvrhhsohhnrghlihhthidquddvgeeluddtfeeguddquddvudefuddu jeejqdgutghhpeephfhrvggvuefuffdrohhrghesfhgrshhtmhgrihhlrdhfmhdpnhgspg hrtghpthhtohepuddpmhhouggvpehsmhhtphhouhhtpdhrtghpthhtohepfhhrvggvsghs ugdqfhhssehfrhgvvggsshgurdhorhhg X-ME-Proxy: Feedback-ID: icedc46df:Fastmail Received: by mailuser.phl.internal (Postfix, from userid 501) id 25137B00068; Mon, 4 Nov 2024 12:31:37 -0500 (EST) X-Mailer: MessagingEngine.com Webmail Interface List-Id: Filesystems List-Archive: https://lists.freebsd.org/archives/freebsd-fs List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-fs@FreeBSD.org MIME-Version: 1.0 Date: Mon, 04 Nov 2024 17:28:42 +0000 From: "Dave Cottlehuber" To: freebsd-fs Message-Id: <3293802b-3785-4715-8a6b-0802afb6f908@app.fastmail.com> Subject: nvme device errors & zfs Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable What's the best way to see error counters or states on an nvme device? I have a typical mirrored nvme zpool, that reported enough errors in a burst last week, that 1 drive dropped off the bus [1]. After a reboot, it resilvered, I cleared the errors, and it seems fine according to repeated scrubs and a few days of use. I was unable to see any errors from the nvme drive itself, but as its (just) in warranty for 2 more weeks I'd like to know if I should return it. I installed ports `sysutils/nvme-cli` and didn't see anything=20 of note there either: $ doas nvme smart-log /dev/nvme1 0xc0484e41: opc: 0x2 fuse: 0 cid 0 nsid:0xffffffff cmd2: 0 cmd3: 0 : cdw10: 0x7f0002 cdw11: 0 cdw12: 0 cdw13: 0 : cdw14: 0 cdw15: 0 len: 0x200 is_read: 0 <--- 0 cid: 0 status 0 Smart Log for NVME device:nvme1 namespace-id:ffffffff critical_warning : 0 temperature : 39 C available_spare : 100% available_spare_threshold : 10% percentage_used : 3% data_units_read : 121681067 data_units_written : 86619659 host_read_commands : 695211450 host_write_commands : 2187823697 controller_busy_time : 2554 power_cycles : 48 power_on_hours : 6342 unsafe_shutdowns : 38 media_errors : 0 num_err_log_entries : 0 Warning Temperature Time : 0 Critical Composite Temperature Time : 0 Temperature Sensor 1 : 39 C Temperature Sensor 2 : 43 C Thermal Management T1 Trans Count : 0 Thermal Management T2 Trans Count : 0 Thermal Management T1 Total Time : 0 Thermal Management T2 Total Time : 0 =20 [1]: zpool status status: One or more devices are faulted in response to persistent errors. Sufficient replicas exist for the pool to continue functioning i= n a degraded state. action: Replace the faulted device, or use 'zpool clear' to mark the dev= ice repaired. scan: scrub repaired 0B in 00:17:59 with 0 errors on Thu Oct 31 16:24:= 36 2024 config: NAME STATE READ WRITE CKSUM zroot DEGRADED 0 0 0 mirror-0 DEGRADED 0 0 0 gpt/zfs0 ONLINE 0 0 0 gpt/zfs1 FAULTED 0 0 0 too many errors A+ Dave =E2=80=94=E2=80=94=E2=80=94 O for a muse of fire, that would ascend the brightest heaven of inventio= n!