[ILUG] disk failure (and advance warning)

John P. Looney valen at tuatha.org
Fri Oct 11 10:00:03 IST 2002


 I've a few boxes that I don't want to go down.

 But disks die, so likely they will at some stage.

 To avoid this, I'd love to be able to have something email me when the
S.M.A.R.T software detects disk problems. But, data like:


Attribute                    Flag     Value Worst Threshold Raw Value
(  1)Raw Read Error Rate     0x0e00   071   050   025       241820622
(  3)Spin Up Time            0x0200   071   070   000       0
(  4)Start Stop Count        0x3300   100   100   020       0
(  5)Reallocated Sector Ct   0x3300   100   100   036       0
(  7)Seek Error Rate         0x0f00   067   054   030       64899951
(  9)Power On Hours          0x3200   084   084   000       14183
( 10)Spin Retry Count        0x1300   100   100   097       0
( 12)Power Cycle Count       0x3300   100   100   020       144
(194)Temperature             0x2200   042   053   000       45
(195)Hardware ECC Recovered  0x1a00   071   059   000       235506100
(197)Current Pending Sector  0x1200   100   100   000       0
(198)Offline Uncorrectable   0x1000   100   100   000       0
(199)UDMA CRC Error Count    0x3e00   200   200   000       0
(200)Unknown Attribute       0x0000   100   100   000       0
(202)Unknown Attribute       0x3200   100   253   000       0
SMART Error Log:
SMART Error Logging Version: 1
No Errors Logged

 Is a little....useless, without context. Is there any software out there
for reading this in & deciding how close a disk is to failure, or are you
better off just running: 

barney:/home/john# smartctl  -c /dev/hda
Device: ST39111A  Supports ATA Version 4
Drive supports S.M.A.R.T. and is enabled
Check S.M.A.R.T. Passed.

 And grepping for "Passed" ?

John




More information about the ILUG mailing list