Re: nvme device errors & zfs
- In reply to: Dave Cottlehuber: "Re: nvme device errors & zfs"
- Go to: [ bottom of page ] [ top of archives ] [ this month ]
Date: Tue, 05 Nov 2024 17:21:25 UTC
On Tue, Nov 5, 2024 at 4:20 AM Dave Cottlehuber <dch@freebsd.org> wrote: > On Tue, 5 Nov 2024, at 11:10, Tomek CEDRO wrote: > > Magician software can upgrade firmware and perform other checks, works > > on Windoze macOS and Android: > > that will be difficult, I don't have an nvme capable thing for any > of those. > nvmecontrol updates firmware just fine, though. > > Another idea is maybe disk overheats and resets itself to cool down? > > that is a great point, the mainboard comes with inbuilt heatsinks, but > when I assembled it, the 2nd nvme slot heatsink looked a lot less > bulky than the other one, I remember distinctly wondering if it would > cope. If it happens again I'll see if I can get a temp measurement > at the time of failure. > > I would hope temperature throttling would not be quite so brutal, to > remove itself from the bus entirely, but its a reasonable explanation. > What's supposed to happen is that the temperature climbs slowly enough that there's a chance for it to kick in. It might be thermals, but I'd expect at least some indications that it's thermal. Heat sinks are cheap enough, if it's really thermal. log page 2 has the temperature. How often does the reset happen? A firmware upgrade might solve that problem if they are older. There might just be a bug that causes the firmware to 'trap' and it takes several seconds for the SoC / controller to reboot. Warner > A+ > Dave > ——— > O for a muse of fire, that would ascend the brightest heaven of invention! >