[Bug 264141] nvme(4): Heavy load to SSD wedges 13.1 system: Controller in fatal status, resetting ... Resetting controller due to a timeout and possible hot unplug.

From: <bugzilla-noreply_at_freebsd.org>
Date: Sun, 22 May 2022 08:15:16 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=264141

--- Comment #9 from crb <crb@ChrisBowman.com> ---
running:
nvmecontrol logpage -p 1 nvme0
yields:

Error Information Log
=====================
No error entries found
root@eclipse:60> nvmecontrol logpage -p 0 nvme0
nvmecontrol: get log page request returned error
root@eclipse:61> nvmecontrol logpage -p 2 nvme0
SMART/Health Information Log
============================
Critical Warning State:         0x00
 Available spare:               0
 Temperature:                   0
 Device reliability:            0
 Read only:                     0
 Volatile memory backup:        0
Temperature:                    311 K, 37.85 C, 100.13 F
Available spare:                100
Available spare threshold:      10
Percentage used:                0
Data units (512,000 byte) read: 9999
Data units written:             1527715
Host read commands:             440839
Host write commands:            3023543
Controller busy time (minutes): 7
Power cycles:                   519
Power on hours:                 166
Unsafe shutdowns:               518
Media errors:                   0
No. error info log entries:     0
Warning Temp Composite Time:    0
Error Temp Composite Time:      0
Temperature 1 Transition Count: 0
Temperature 2 Transition Count: 0
Total Time For Temperature 1:   0
Total Time For Temperature 2:   0

So I bumped the case fans to full speed in the BIOS and

looping on 
nvmecontrol logpage -p 2 nvme0 | grep ^Temperature:

in one window while doing a "zpool scrub root" in another the temperature
doesn't budge:

Temperature:                    310 K, 36.85 C, 98.33 F

but the machine still wedges

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.