Ominous smartd messages ....
William A. Mahaffey III
wam at hiwaay.net
Wed Aug 3 19:20:28 UTC 2016
On 08/03/16 11:45, Ken Moffat wrote:
> On Wed, Aug 03, 2016 at 07:37:50AM -0453, William A. Mahaffey III wrote:
>> On 08/03/16 07:24, Matthew Seaman wrote:
>>> On 08/03/16 04:41, William A. Mahaffey III wrote:
>>>> My question is: Are these messages benign, or am I in the market for
>>>> more hardware ? *ANY* more questions, please ask. TIA & have a good one.
>>> It doesn't look particularly good. Do you run the smartd selftests at
>>> all? Worth giving that a go --
>>>
>>> # smartctl -t long /dev/ada5
>>>
>>> This can be done while the disk is in use without upsetting anything.
>>> It will scan the disk for unreadable areas. The disk does have a number
>>> of spare sectors it can use instead of any broken ones, but it generally
>>> needs to see a failed write to the affected area to trigger the
>>> substitution mechanism. Once you run out of substitute sectors, the
>>> disk is basically toast and should be replaced. With modern drives,
>>> seeing that mechanism in use at all typically means the drive is on the
>>> downward spiral and you should plan on replacing it PDQ.
>>>
>>> Cheers,
>>>
>>> Matthew
>>>
>> Thanks, I kicked that off just now, it says it will take 169 min. to
>> complete, so we'll see. No more new messages overnight, maybe my luck is
>> good ....
>>
> I agree with Matthew, and since your custom script seemed to ignore
> all the items which I regard as relevant, after the long test has
> finished run smartctl -a on that drive and look at things like
> Reallocated Sector Count and Pending (if that is still present).
> plus any other error fields.
>
> ĸen
OK:
[root at devbox, /etc, 2:22:57pm] 373 % smartctl -a /dev/ada5
smartctl 6.4 2015-06-04 r4109 [FreeBSD 9.3-RELEASE-p33 amd64] (local build)
Copyright (C) 2002-15, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: HGST Travelstar 7K1000
Device Model: HGST HTS721010A9E630
Serial Number: JR10046P1E69HN
LU WWN Device Id: 5 000cca 7dcd41800
Firmware Version: JB0OA3J0
User Capacity: 1,000,204,886,016 bytes [1.00 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 7200 rpm
Form Factor: 2.5 inches
Device is: In smartctl database [for details use: -P show]
ATA Version is: ATA8-ACS T13/1699-D revision 6
SATA Version is: SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Wed Aug 3 14:23:03 2016 MCDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection:
Disabled.
Self-test execution status: ( 121) The previous self-test completed
having
the read element of the test
failed.
Total time to complete Offline
data collection: ( 45) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline immediate.
Auto Offline data collection
on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 169) minutes.
SCT capabilities: (0x003d) SCT Status supported.
SCT Error Recovery Control
supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED
WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000b 091 091 062 Pre-fail
Always - 3538944
2 Throughput_Performance 0x0005 100 100 040 Pre-fail
Offline - 0
3 Spin_Up_Time 0x0007 100 100 033 Pre-fail
Always - 2
4 Start_Stop_Count 0x0012 100 100 000 Old_age
Always - 5
5 Reallocated_Sector_Ct 0x0033 100 100 005 Pre-fail
Always - 0
7 Seek_Error_Rate 0x000b 100 100 067 Pre-fail
Always - 0
8 Seek_Time_Performance 0x0005 100 100 040 Pre-fail
Offline - 0
9 Power_On_Hours 0x0012 085 085 000 Old_age
Always - 6678
10 Spin_Retry_Count 0x0013 100 100 060 Pre-fail
Always - 0
12 Power_Cycle_Count 0x0032 100 100 000 Old_age
Always - 5
191 G-Sense_Error_Rate 0x000a 100 100 000 Old_age
Always - 0
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age
Always - 2
193 Load_Cycle_Count 0x0012 034 034 000 Old_age
Always - 668237
194 Temperature_Celsius 0x0002 230 230 000 Old_age
Always - 26 (Min/Max 14/31)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age
Always - 0
197 Current_Pending_Sector 0x0022 100 100 000 Old_age
Always - 8
198 Offline_Uncorrectable 0x0008 100 100 000 Old_age
Offline - 0
199 UDMA_CRC_Error_Count 0x000a 200 200 000 Old_age
Always - 0
223 Load_Retry_Count 0x000a 100 100 000 Old_age
Always - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining
LifeTime(hours) LBA_of_first_error
# 1 Extended offline Completed: read failure 90% 6671
167940256
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
[root at devbox, /etc, 2:23:04pm] 374 %
Reallocated sectors zero (0) .... so where am I ? Thanks & TIA & have a
good one.
--
William A. Mahaffey III
----------------------------------------------------------------------
"The M1 Garand is without doubt the finest implement of war
ever devised by man."
-- Gen. George S. Patton Jr.
More information about the freebsd-questions
mailing list