From nobody Tue Jun 21 00:23:56 2022 X-Original-To: freebsd-current@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id A4AB786BBAE for ; Tue, 21 Jun 2022 00:23:59 +0000 (UTC) (envelope-from ler@lerctr.org) Received: from thebighonker.lerctr.org (thebighonker.lerctr.org [IPv6:2602:fcdb:0:10:7ae3:b5ff:fe1b:23b4]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "*.lerctr.org", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4LRnLk6LVPz4rKW for ; Tue, 21 Jun 2022 00:23:58 +0000 (UTC) (envelope-from ler@lerctr.org) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=lerctr.org; s=ler2019; h=Content-Type:Message-ID:References:In-Reply-To:Subject:Cc:To: From:Date:MIME-Version:Sender:Reply-To:Content-Transfer-Encoding:Content-ID: Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe: List-Post:List-Owner:List-Archive; bh=ikx7BGB5j1a3883+hjEVbBPNu7diBV0kxjsoM7ogdDg=; b=IoG8+S98enc7ZcxILN8WtjoIrR Fl0k97CFcek5x18DPdHKW5oQpoxz0VT2oeas0008aIJKfdl0ILqPojavmVNJ859rvcbMpIDpIsaZ5 nEW5bNJQfnVEFbWt6tqcVmUeOhunPFRbEn7P4KxOdKUO2yDE0lzSpgpYiVq7v3ULxZ27qLFYmB057 ViI95c+Gei0v1NrzRT21tIeoLPDEoa+6nBaYqZMVVF0eoip3tOogQiB2I/htnnNSqStOnWOi1LY2J hAg0XB/VdMxyHhduzc2ZlO94N5KqLvVZLost/7kj4tbqkN7LimoqjppPMPTQJIAk0edoo0PGCeKn3 +XkX6k9Q==; Received-SPF: pass (thebighonker.lerctr.org: domain of lerctr.org designates 2602:fcdb:0:10:7ae3:b5ff:fe1b:23b4 as permitted sender) client-ip=2602:fcdb:0:10:7ae3:b5ff:fe1b:23b4; envelope-from=ler@lerctr.org; helo=webmail.lerctr.org; Received: from thebighonker.lerctr.org ([2602:fcdb:0:10:7ae3:b5ff:fe1b:23b4]:59729 helo=webmail.lerctr.org) by thebighonker.lerctr.org with esmtpsa (TLS1.3) tls TLS_AES_256_GCM_SHA384 (Exim 4.95 (FreeBSD)) (envelope-from ) id 1o3RgS-000NYa-DI; Mon, 20 Jun 2022 19:23:56 -0500 Received: from 2600:1700:210:b18f:7139:7834:f65d:718c by webmail.lerctr.org with HTTP (HTTP/1.1 POST); Mon, 20 Jun 2022 19:23:56 -0500 List-Id: Discussions about the use of FreeBSD-current List-Archive: https://lists.freebsd.org/archives/freebsd-current List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-current@freebsd.org MIME-Version: 1.0 Date: Mon, 20 Jun 2022 19:23:56 -0500 From: Larry Rosenman To: Ultima Cc: Freebsd current Subject: Re: MCE: Does this look possibly like a slot issue? In-Reply-To: References: Message-ID: X-Sender: ler@lerctr.org Content-Type: multipart/alternative; boundary="=_8fe087faf2cbfc1449967c53350e6d2c" X-Rspamd-Queue-Id: 4LRnLk6LVPz4rKW X-Spamd-Bar: -- Authentication-Results: mx1.freebsd.org; dkim=pass header.d=lerctr.org header.s=ler2019 header.b=IoG8+S98; dmarc=pass (policy=none) header.from=lerctr.org; spf=pass (mx1.freebsd.org: domain of ler@lerctr.org designates 2602:fcdb:0:10:7ae3:b5ff:fe1b:23b4 as permitted sender) smtp.mailfrom=ler@lerctr.org X-Spamd-Result: default: False [-2.44 / 15.00]; RCVD_TLS_LAST(0.00)[]; RCVD_VIA_SMTP_AUTH(0.00)[]; R_DKIM_ALLOW(-0.20)[lerctr.org:s=ler2019]; FREEFALL_USER(0.00)[ler]; FROM_HAS_DN(0.00)[]; R_SPF_ALLOW(-0.20)[+mx]; NEURAL_HAM_LONG(-1.00)[-1.000]; MIME_GOOD(-0.10)[multipart/alternative,text/plain]; ARC_NA(0.00)[]; TO_MATCH_ENVRCPT_SOME(0.00)[]; TO_DN_ALL(0.00)[]; DKIM_TRACE(0.00)[lerctr.org:+]; RCPT_COUNT_TWO(0.00)[2]; DMARC_POLICY_ALLOW(-0.50)[lerctr.org,none]; NEURAL_HAM_SHORT(-0.44)[-0.442]; NEURAL_HAM_MEDIUM(-1.00)[-1.000]; MLMMJ_DEST(0.00)[freebsd-current]; FREEMAIL_TO(0.00)[gmail.com]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+,1:+,2:~]; SUBJECT_ENDS_QUESTION(1.00)[]; ASN(0.00)[asn:55103, ipnet:2602:fcdb::/36, country:US]; RCVD_COUNT_TWO(0.00)[2]; MID_RHS_MATCH_FROM(0.00)[] X-ThisMailContainsUnwantedMimeParts: N --=_8fe087faf2cbfc1449967c53350e6d2c Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII; format=flowed I'm seeing them constantly: root@freenas[~]# mcelog --dmi Hardware event. This is not a software error. MCE 0 CPU 22 BANK 8 TSC 20aab486464a MISC ac29890200046444 ADDR ee2f6e800 TIME 1655770989 Mon Jun 20 19:23:09 2022 MCG status: Memory read ECC error Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 44 Memory DIMM ID of error: 0 Memory channel ID of error: 1 Memory ECC syndrome: ac298902 STATUS 8c0000400001009f MCGSTATUS 0 MCGCAP 1c09 APICID 34 SOCKETID 0 CPUID Vendor Intel Family 6 Model 44 Step 2 WARNING: SMBIOS data is often unreliable. Take with a grain of salt! DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB Device Locator: P2-DIMM2C Bank Locator: BANK14 Manufacturer: Hyundai Serial Number: 40F3C20F Asset Tag: Part Number: HMT151R7BFR4C-H9 Hardware event. This is not a software error. MCE 1 CPU 22 BANK 8 TSC 296dfcc82582 MISC ac29890200041381 ADDR ee2f6e800 TIME 1655770989 Mon Jun 20 19:23:09 2022 MCG status: Memory read ECC error Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 81 Memory DIMM ID of error: 0 Memory channel ID of error: 1 Memory ECC syndrome: ac298902 STATUS 8c0000400001009f MCGSTATUS 0 MCGCAP 1c09 APICID 34 SOCKETID 0 CPUID Vendor Intel Family 6 Model 44 Step 2 DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB Device Locator: P2-DIMM2C Bank Locator: BANK14 Manufacturer: Hyundai Serial Number: 40F3C20F Asset Tag: Part Number: HMT151R7BFR4C-H9 Hardware event. This is not a software error. MCE 2 CPU 22 BANK 8 TSC 2a5604a6a070 MISC ac29890200044281 TIME 1655770989 Mon Jun 20 19:23:09 2022 MCG status: Memory ECC error occurred during scrub Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 81 Memory DIMM ID of error: 0 Memory channel ID of error: 1 Memory ECC syndrome: ac298902 STATUS 88000040000200cf MCGSTATUS 0 MCGCAP 1c09 APICID 34 SOCKETID 0 CPUID Vendor Intel Family 6 Model 44 Step 2 Hardware event. This is not a software error. MCE 3 CPU 22 BANK 8 TSC 31e141418eb8 MISC ac29890200046a4a ADDR ee2f6e800 TIME 1655770989 Mon Jun 20 19:23:09 2022 MCG status: Memory read ECC error Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 4a Memory DIMM ID of error: 0 Memory channel ID of error: 1 Memory ECC syndrome: ac298902 STATUS 8c0000400001009f MCGSTATUS 0 MCGCAP 1c09 APICID 34 SOCKETID 0 CPUID Vendor Intel Family 6 Model 44 Step 2 DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB Device Locator: P2-DIMM2C Bank Locator: BANK14 Manufacturer: Hyundai Serial Number: 40F3C20F Asset Tag: Part Number: HMT151R7BFR4C-H9 Hardware event. This is not a software error. MCE 4 CPU 22 BANK 8 TSC 3a014afee106 MISC ac29890200046646 ADDR ee2f6e800 TIME 1655770989 Mon Jun 20 19:23:09 2022 MCG status: Memory read ECC error Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 46 Memory DIMM ID of error: 0 Memory channel ID of error: 1 Memory ECC syndrome: ac298902 STATUS 8c0000400001009f MCGSTATUS 0 MCGCAP 1c09 APICID 34 SOCKETID 0 CPUID Vendor Intel Family 6 Model 44 Step 2 DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB Device Locator: P2-DIMM2C Bank Locator: BANK14 Manufacturer: Hyundai Serial Number: 40F3C20F Asset Tag: Part Number: HMT151R7BFR4C-H9 Hardware event. This is not a software error. MCE 5 CPU 22 BANK 8 TSC 41d1dbef1a6a MISC ac29890200046141 ADDR ee2f6e800 TIME 1655770989 Mon Jun 20 19:23:09 2022 MCG status: Memory read ECC error Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 41 Memory DIMM ID of error: 0 Memory channel ID of error: 1 Memory ECC syndrome: ac298902 STATUS 8c0000400001009f MCGSTATUS 0 MCGCAP 1c09 APICID 34 SOCKETID 0 CPUID Vendor Intel Family 6 Model 44 Step 2 DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB Device Locator: P2-DIMM2C Bank Locator: BANK14 Manufacturer: Hyundai Serial Number: 40F3C20F Asset Tag: Part Number: HMT151R7BFR4C-H9 Hardware event. This is not a software error. MCE 6 CPU 22 BANK 8 TSC 4a1b1ecef446 MISC ac29890200046a4a ADDR ee2f6e800 TIME 1655770989 Mon Jun 20 19:23:09 2022 MCG status: Memory read ECC error Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 4a Memory DIMM ID of error: 0 Memory channel ID of error: 1 Memory ECC syndrome: ac298902 STATUS 8c0000400001009f MCGSTATUS 0 MCGCAP 1c09 APICID 34 SOCKETID 0 CPUID Vendor Intel Family 6 Model 44 Step 2 DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB Device Locator: P2-DIMM2C Bank Locator: BANK14 Manufacturer: Hyundai Serial Number: 40F3C20F Asset Tag: Part Number: HMT151R7BFR4C-H9 Hardware event. This is not a software error. MCE 7 CPU 22 BANK 8 TSC 527bc27db776 MISC ac29890200040386 ADDR ee2f6e800 TIME 1655770989 Mon Jun 20 19:23:09 2022 MCG status: Memory read ECC error Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 86 Memory DIMM ID of error: 0 Memory channel ID of error: 1 Memory ECC syndrome: ac298902 STATUS 8c0000400001009f MCGSTATUS 0 MCGCAP 1c09 APICID 34 SOCKETID 0 CPUID Vendor Intel Family 6 Model 44 Step 2 DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB Device Locator: P2-DIMM2C Bank Locator: BANK14 Manufacturer: Hyundai Serial Number: 40F3C20F Asset Tag: Part Number: HMT151R7BFR4C-H9 Hardware event. This is not a software error. MCE 8 CPU 22 BANK 8 TSC 5aa4ecdd795a MISC ac29890200046646 ADDR ee2f6e800 TIME 1655770989 Mon Jun 20 19:23:09 2022 MCG status: Memory read ECC error Memory corrected error count (CORE_ERR_CNT): 1 Memory transaction Tracker ID (RTId): 46 Memory DIMM ID of error: 0 Memory channel ID of error: 1 Memory ECC syndrome: ac298902 STATUS 8c0000400001009f MCGSTATUS 0 MCGCAP 1c09 APICID 34 SOCKETID 0 CPUID Vendor Intel Family 6 Model 44 Step 2 DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB Device Locator: P2-DIMM2C Bank Locator: BANK14 Manufacturer: Hyundai Serial Number: 40F3C20F Asset Tag: Part Number: HMT151R7BFR4C-H9 root@freenas[~]# and I replaced the DIMM yesterday :( On 06/20/2022 7:19 pm, Ultima wrote: > Hey Larry, > > It is possible it's the motherboard itself, but it's rare. The way I > would determine this is to swap the DIMM module with another > populated slot on the motherboard and see if the error migrated > to the new slot or not. Also, this error doesn't necessarily mean > there is a problem that needs to be addressed. If you have been > running the system for many months and you see ECC errors a > handful of times, it can probably be safely ignored. > > Best regards, > Richard Gallamore > > On Mon, Jun 20, 2022 at 3:14 PM Larry Rosenman wrote: > >> I've gotten a BUNCH of these on my TrueNAS server. I've replaced this >> DIMM a couple of times, and still the MCE's continue. >> Is it possible it's Motherboard slot issue? >> >> Hardware event. This is not a software error. >> MCE 8 >> CPU 22 BANK 8 TSC 5aa4ecdd795a >> MISC ac29890200046646 ADDR ee2f6e800 >> TIME 1655762472 Mon Jun 20 17:01:12 2022 >> MCG status: >> Memory read ECC error >> Memory corrected error count (CORE_ERR_CNT): 1 >> Memory transaction Tracker ID (RTId): 46 >> Memory DIMM ID of error: 0 >> Memory channel ID of error: 1 >> Memory ECC syndrome: ac298902 >> STATUS 8c0000400001009f MCGSTATUS 0 >> MCGCAP 1c09 APICID 34 SOCKETID 0 >> CPUID Vendor Intel Family 6 Model 44 Step 2 >> DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB >> Device Locator: P2-DIMM2C >> Bank Locator: BANK14 >> Manufacturer: Hyundai >> Serial Number: 40F3C20F >> Asset Tag: >> Part Number: HMT151R7BFR4C-H9 >> >> -- >> Larry Rosenman http://www.lerctr.org/~ler >> Phone: +1 214-642-9640 E-Mail: ler@lerctr.org >> US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106 -- Larry Rosenman http://www.lerctr.org/~ler Phone: +1 214-642-9640 E-Mail: ler@lerctr.org US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106 --=_8fe087faf2cbfc1449967c53350e6d2c Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=UTF-8

I'm seeing them constantly:

root@freenas[~]# mcelog --dmi
Hardware event. This is not a softwar= e error.
MCE 0
CPU 22 BANK 8 TSC 20aab486464a
MISC ac2989020= 0046444 ADDR ee2f6e800
TIME 1655770989 Mon Jun 20 19:23:09 2022
M= CG status:
Memory read ECC error
Memory corrected error count (CO= RE_ERR_CNT): 1
Memory transaction Tracker ID (RTId): 44
Memory DI= MM ID of error: 0
Memory channel ID of error: 1
Memory ECC syndro= me: ac298902
STATUS 8c0000400001009f MCGSTATUS 0
MCGCAP 1c09 APIC= ID 34 SOCKETID 0
CPUID Vendor Intel Family 6 Model 44 Step 2
WARN= ING: SMBIOS data is often unreliable. Take with a grain of salt!
DDR3 = DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
Device Locator: P2= -DIMM2C
Bank Locator: BANK14
Manufacturer: Hyundai
Serial Nu= mber: 40F3C20F
Asset Tag:
Part Number: HMT151R7BFR4C-H9
Hard= ware event. This is not a software error.
MCE 1
CPU 22 BANK 8 TSC= 296dfcc82582
MISC ac29890200041381 ADDR ee2f6e800
TIME 165577098= 9 Mon Jun 20 19:23:09 2022
MCG status:
Memory read ECC error
Memory corrected error count (CORE_ERR_CNT): 1
Memory transaction Tra= cker ID (RTId): 81
Memory DIMM ID of error: 0
Memory channel ID o= f error: 1
Memory ECC syndrome: ac298902
STATUS 8c0000400001009f = MCGSTATUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel F= amily 6 Model 44 Step 2
DDR3 DIMM 800 Mhz Other Width 72 Data Width 64= Size 4 GB
Device Locator: P2-DIMM2C
Bank Locator: BANK14
Ma= nufacturer: Hyundai
Serial Number: 40F3C20F
Asset Tag:
Part = Number: HMT151R7BFR4C-H9
Hardware event. This is not a software error.=
MCE 2
CPU 22 BANK 8 TSC 2a5604a6a070
MISC ac29890200044281<= br />TIME 1655770989 Mon Jun 20 19:23:09 2022
MCG status:
Memory = ECC error occurred during scrub
Memory corrected error count (CORE_ERR= _CNT): 1
Memory transaction Tracker ID (RTId): 81
Memory DIMM ID = of error: 0
Memory channel ID of error: 1
Memory ECC syndrome: ac= 298902
STATUS 88000040000200cf MCGSTATUS 0
MCGCAP 1c09 APICID 34 = SOCKETID 0
CPUID Vendor Intel Family 6 Model 44 Step 2
Hardware e= vent. This is not a software error.
MCE 3
CPU 22 BANK 8 TSC 31e14= 1418eb8
MISC ac29890200046a4a ADDR ee2f6e800
TIME 1655770989 Mon = Jun 20 19:23:09 2022
MCG status:
Memory read ECC error
Memor= y corrected error count (CORE_ERR_CNT): 1
Memory transaction Tracker I= D (RTId): 4a
Memory DIMM ID of error: 0
Memory channel ID of erro= r: 1
Memory ECC syndrome: ac298902
STATUS 8c0000400001009f MCGSTA= TUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Family = 6 Model 44 Step 2
DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size = 4 GB
Device Locator: P2-DIMM2C
Bank Locator: BANK14
Manufact= urer: Hyundai
Serial Number: 40F3C20F
Asset Tag:
Part Number= : HMT151R7BFR4C-H9
Hardware event. This is not a software error.
= MCE 4
CPU 22 BANK 8 TSC 3a014afee106
MISC ac29890200046646 ADDR e= e2f6e800
TIME 1655770989 Mon Jun 20 19:23:09 2022
MCG status:
Memory read ECC error
Memory corrected error count (CORE_ERR_CNT): 1=
Memory transaction Tracker ID (RTId): 46
Memory DIMM ID of error= : 0
Memory channel ID of error: 1
Memory ECC syndrome: ac298902STATUS 8c0000400001009f MCGSTATUS 0
MCGCAP 1c09 APICID 34 SOCKETID= 0
CPUID Vendor Intel Family 6 Model 44 Step 2
DDR3 DIMM 800 Mhz = Other Width 72 Data Width 64 Size 4 GB
Device Locator: P2-DIMM2C
= Bank Locator: BANK14
Manufacturer: Hyundai
Serial Number: 40F3C20= F
Asset Tag:
Part Number: HMT151R7BFR4C-H9
Hardware event. T= his is not a software error.
MCE 5
CPU 22 BANK 8 TSC 41d1dbef1a6a=
MISC ac29890200046141 ADDR ee2f6e800
TIME 1655770989 Mon Jun 20 = 19:23:09 2022
MCG status:
Memory read ECC error
Memory corre= cted error count (CORE_ERR_CNT): 1
Memory transaction Tracker ID (RTId= ): 41
Memory DIMM ID of error: 0
Memory channel ID of error: 1Memory ECC syndrome: ac298902
STATUS 8c0000400001009f MCGSTATUS 0MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Family 6 Model= 44 Step 2
DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GBDevice Locator: P2-DIMM2C
Bank Locator: BANK14
Manufacturer: H= yundai
Serial Number: 40F3C20F
Asset Tag:
Part Number: HMT15= 1R7BFR4C-H9
Hardware event. This is not a software error.
MCE 6CPU 22 BANK 8 TSC 4a1b1ecef446
MISC ac29890200046a4a ADDR ee2f6e80= 0
TIME 1655770989 Mon Jun 20 19:23:09 2022
MCG status:
Memor= y read ECC error
Memory corrected error count (CORE_ERR_CNT): 1
M= emory transaction Tracker ID (RTId): 4a
Memory DIMM ID of error: 0
Memory channel ID of error: 1
Memory ECC syndrome: ac298902
STA= TUS 8c0000400001009f MCGSTATUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Family 6 Model 44 Step 2
DDR3 DIMM 800 Mhz Other W= idth 72 Data Width 64 Size 4 GB
Device Locator: P2-DIMM2C
Bank Lo= cator: BANK14
Manufacturer: Hyundai
Serial Number: 40F3C20F
= Asset Tag:
Part Number: HMT151R7BFR4C-H9
Hardware event. This is = not a software error.
MCE 7
CPU 22 BANK 8 TSC 527bc27db776
M= ISC ac29890200040386 ADDR ee2f6e800
TIME 1655770989 Mon Jun 20 19:23:0= 9 2022
MCG status:
Memory read ECC error
Memory corrected er= ror count (CORE_ERR_CNT): 1
Memory transaction Tracker ID (RTId): 86Memory DIMM ID of error: 0
Memory channel ID of error: 1
Memo= ry ECC syndrome: ac298902
STATUS 8c0000400001009f MCGSTATUS 0
MCG= CAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Family 6 Model 44 Ste= p 2
DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
Devi= ce Locator: P2-DIMM2C
Bank Locator: BANK14
Manufacturer: Hyundai<= br />Serial Number: 40F3C20F
Asset Tag:
Part Number: HMT151R7BFR4= C-H9
Hardware event. This is not a software error.
MCE 8
CPU= 22 BANK 8 TSC 5aa4ecdd795a
MISC ac29890200046646 ADDR ee2f6e800
= TIME 1655770989 Mon Jun 20 19:23:09 2022
MCG status:
Memory read = ECC error
Memory corrected error count (CORE_ERR_CNT): 1
Memory t= ransaction Tracker ID (RTId): 46
Memory DIMM ID of error: 0
Memor= y channel ID of error: 1
Memory ECC syndrome: ac298902
STATUS 8c0= 000400001009f MCGSTATUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID = Vendor Intel Family 6 Model 44 Step 2
DDR3 DIMM 800 Mhz Other Width 72= Data Width 64 Size 4 GB
Device Locator: P2-DIMM2C
Bank Locator: = BANK14
Manufacturer: Hyundai
Serial Number: 40F3C20F
Asset T= ag:
Part Number: HMT151R7BFR4C-H9
root@freenas[~]#


and I replaced the DIMM yesterday :( 



On 06/20/2022 7:19 pm, Ultima wrote:

Hey Larry,
 
 It is possible it's the motherboard itself, but it's rare. The w= ay I
would determine this is to swap the DIMM module with another
populated slot on the motherboard and see if the error migrated
to the new slot or not. Also, this error doesn't necessarily mean
there is a problem that needs to be addressed. If you have been
running the system for many months and you see ECC errors a
handful of times, it can probably be safely ignored.
 
Best regards,
Richard Gallamore

On Mon, Jun 20, 2022 at 3:14 PM Lar= ry Rosenman <ler@le= rctr.org> wrote:
I've gotten a BUNCH of the= se on my TrueNAS server.  I've replaced this
DIMM a couple of ti= mes, and still the MCE's continue.
Is it possible it's Motherboard slo= t issue?

Hardware event. This is not a software error.
MCE = 8
CPU 22 BANK 8 TSC 5aa4ecdd795a
MISC ac29890200046646 ADDR ee2f6= e800
TIME 1655762472 Mon Jun 20 17:01:12 2022
MCG status:
Me= mory read ECC error
Memory corrected error count (CORE_ERR_CNT): 1
Memory transaction Tracker ID (RTId): 46
Memory DIMM ID of error: 0<= br />Memory channel ID of error: 1
Memory ECC syndrome: ac298902
= STATUS 8c0000400001009f MCGSTATUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0CPUID Vendor Intel Family 6 Model 44 Step 2
DDR3 DIMM 800 Mhz Othe= r Width 72 Data Width 64 Size 4 GB
Device Locator: P2-DIMM2C
Bank= Locator: BANK14
Manufacturer: Hyundai
Serial Number: 40F3C20FAsset Tag:
Part Number: HMT151R7BFR4C-H9



-- =
Larry Rosenman               =      http://www.lerctr.org/~ler
Phone: += 1 214-642-9640                 = ;E-Mail: ler@lerctr.or= g
US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106


= -- 
Larry Rosenman     &n= bsp;            = ;   http://www.lerctr.org/~ler
Phone: +1 = 214-642-9640           &n= bsp;     E-Mail: ler@lerctr.org
US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106=
--=_8fe087faf2cbfc1449967c53350e6d2c--