Re: nvme timeout issues with hardware and bhyve vm's

From: Pete Wright <pete_at_nomadlogic.org>
Date: Thu, 07 Dec 2023 23:49:29 UTC

On 12/7/23 3:16 PM, Craig Leres wrote:
> On 12/7/23 15:09, Tomoaki AOKI wrote:
>> If I myself encounter this kind of problem ON BARE METAL HARDWARE,
>> I would usually suspect
>>
>>   *Overheating caused hang of NVMe controller or PCI bridge on SSD, or
> 
> This would also be my first guess.
> 
> Five years ago I had an nmve in an intel nuc that would sometimes "go to 
> sleep", here's the thread
> 
> 
> https://lists.freebsd.org/pipermail/freebsd-hackers/2018-May/052783.html
> 
> @imp helpfully suggested running "nvmecontrol logpage -p 2 nvme0" which 
> showed mine was hot (60° C/140° F)! I adjusted the fan settings in the 
> bios and have never had an issue since.
> 

oh interesting, i'll run that next time it locks up.  the box is well 
ventilated, but that's not to say its not overheating.  right now its at:
Temperature:                    314 K, 40.85 C, 105.53 F

nvemecontrol doesn't list any errors or warnings though:
Media errors:                   0
No. error info log entries:     0
Warning Temp Composite Time:    0
Error Temp Composite Time:      0

thanks for the tip!
-pete

-- 
Pete Wright
pete@nomadlogic.org