Errors in Flags and descriptions of Smart Data
Posted: 2012.06.15. 13:05
re
viewtopic.php?f=32&t=1524&st=0&sk=t&sd=a
After starting a long topic on this forum as above, followed by a further long topic elsewhere,
I believe there are defects on the Smart Data display for at least my OCZ SSD, and probably for all SSD, and perhaps HDD needs a review.
Attribute no. 1 :- Raw Read Error Rate.
In the bottom right corner you list the Flags as
Error-Rate, Performance, Statistical, Critical.
I am advised by the OCZ Support manager that such errors are NOT critical but normal for any SSD. see post #14 at
http://www.ocztechnologyforum.com/forum ... inutes-Why
Both Hard Disk Sentinel 4.00 PRO and CrystalDiskInfo 4.3.0 describe this as
"On-the-fly ECC UNcorrectable Error Count"
HD Tune 2.55 Hard Disk Utility puts an opposite spin on this and calls it
"Hardware ECC Recovered"
The official OCZ TOOLBOX SMART READ DATA working to Sandforce firmware specifications call this
"ECC On-the-fly Count Normalized Rate"
According to Sandforce this simply shows that Error Correcting Code detected an error that HAD A NEED for correcting but neither indicates success or failure of correction.
OCZ advise that correction failure is indicated by Attribute 187:
187: SSD Reported Uncorrectable Errors Uncorrectable RAISE errors reported to the host for all data access: 0
I have closely observed 3 instances, each lasting just over 4 minutes,
where creating a Macrium Reflect Partition Image backup file of the SSD, 11 GB of Used Space was compressed into a 6.5 GB backup file,
and each time as 11 GB was read, both 1:Raw Data and 195:"uncorrectable" increased by 25,000,000+.
I am guessing that each unit is not an erroneous 4 kB file cluster but a 64 bit word on my 64 bit system,
so 11 GB are read back with 200 MB of error.
Two percent of my operating system uses corrupt Raw Data and yet suffers no BSOD
I really do believe that every Raw Data error has always been corrected, as show by 187: value 0.
I suggest you remove the word UNcorrectable
You could follow "HD Tune 2.55 Hard Disk Utility" and use the word Recovered,
BUT ONLY if attribute 187: has not increased
N.B. Although 1: and 195: raw data values are always zero at startup,
and 187: is always zero at every time in my experience,
I understand from OCZ... posts #16 and #17 that if an error occurs and 187: increments it will never be zeroed by a power down or even a secure erase.
HDS and C.D.I. and HDTune show Raw Data values jumping from 0 up to 25,000,000+ as 11 GB is read.
The official OCZ ToolBox gadget shows a "Normalized Rate" that starts at 100 when no errors occurred, and rises up to 109 after Raw Data reaches 25,000,000.
I believe Sandforce have stipulated some bit-pattern to determine this,
and assume the possibility that Sandforce Marketing department decided against frightening users such as myself with big Raw Date values like 25,000,000+
This is the official Toolbox report which was created when HDS and the two other tools were reporting error counts of 25,000,000+
ModelNumber : OCZ-VERTEX2
Serial Number : OCZ-96FXFXCDVTA602Q9
WWN : 5-e8-3a-97 f8a9391b0
Revision: 10
Attributes List
1: SSD Raw Read Error Rate Normalized Rate: 109 total ECC and RAISE errors
5: SSD Retired Block Count Reserve blocks remaining: 100%
9: SSD Power-On Hours Total hours power on: 764
12: SSD Power Cycle Count Count of power on/off cycles: 403
171: SSD Program Fail Count Total number of Flash program operation failures: 1
172: SSD Erase Fail Count Total number of Flash erase operation failures: 0
174: SSD Unexpected power loss count Total number of unexpected power loss: 14
177: SSD Wear Range Delta Delta between most-worn and least-worn Flash blocks: 0
181: SSD Program Fail Count Total number of Flash program operation failures: 1
182: SSD Erase Fail Count Total number of Flash erase operation failures: 0
187: SSD Reported Uncorrectable Errors Uncorrectable RAISE errors reported to the host for all data access: 0
194: SSD Temperature Monitoring Current: 30 High: 30 Low: 30
195: SSD ECC On-the-fly Count Normalized Rate: 109
196: SSD Reallocation Event Count Total number of reallocated Flash blocks: 0
231: SSD Life Left Approximate SDD life Remaining: 100%
241: SSD Lifetime writes from host lifetime writes 64 GB
242: SSD Lifetime reads from host lifetime reads 512 GB
PLEASE NOTE :-
HDS starts up with Windows and right now shows the system has read 100 MB
HDS SMART is showing that both 1: and 195: have Value = 100 and Worst = 99 and data = 307AD4
OCX Toolbox reports
1: SSD Raw Read Error Rate Normalized Rate: 100 total ECC and RAISE errors
195: SSD ECC On-the-fly Count Normalized Rate: 100
I am guessing the normalized rate will reach 101 after the "data" which they do not reveal has going higher than 3249BA
(HDS value climbed whilst I launched Toolbox and typed results)
PLEASE DO NOT CONCEAL "data = 307AD4" etc.
I like all the data I can get,
my intention with this bug report is to ask that you refrain from saying that 25,000,000 errors were NOT corrected when 187:...0 shows they were corrected
Regards
Alan
viewtopic.php?f=32&t=1524&st=0&sk=t&sd=a
After starting a long topic on this forum as above, followed by a further long topic elsewhere,
I believe there are defects on the Smart Data display for at least my OCZ SSD, and probably for all SSD, and perhaps HDD needs a review.
Attribute no. 1 :- Raw Read Error Rate.
In the bottom right corner you list the Flags as
Error-Rate, Performance, Statistical, Critical.
I am advised by the OCZ Support manager that such errors are NOT critical but normal for any SSD. see post #14 at
http://www.ocztechnologyforum.com/forum ... inutes-Why
Attribute no. 195 :- Every one but OCZ seems to get this one wrong.As for HD Sentinel.. NO OCZ is not setting any critical flag, that threshold is programmed into HD Sentinel and it would be bad on a normal HDD, but it isn't on an SSD.
Both Hard Disk Sentinel 4.00 PRO and CrystalDiskInfo 4.3.0 describe this as
"On-the-fly ECC UNcorrectable Error Count"
HD Tune 2.55 Hard Disk Utility puts an opposite spin on this and calls it
"Hardware ECC Recovered"
The official OCZ TOOLBOX SMART READ DATA working to Sandforce firmware specifications call this
"ECC On-the-fly Count Normalized Rate"
According to Sandforce this simply shows that Error Correcting Code detected an error that HAD A NEED for correcting but neither indicates success or failure of correction.
OCZ advise that correction failure is indicated by Attribute 187:
187: SSD Reported Uncorrectable Errors Uncorrectable RAISE errors reported to the host for all data access: 0
I have closely observed 3 instances, each lasting just over 4 minutes,
where creating a Macrium Reflect Partition Image backup file of the SSD, 11 GB of Used Space was compressed into a 6.5 GB backup file,
and each time as 11 GB was read, both 1:Raw Data and 195:"uncorrectable" increased by 25,000,000+.
I am guessing that each unit is not an erroneous 4 kB file cluster but a 64 bit word on my 64 bit system,
so 11 GB are read back with 200 MB of error.
Two percent of my operating system uses corrupt Raw Data and yet suffers no BSOD
I really do believe that every Raw Data error has always been corrected, as show by 187: value 0.
I suggest you remove the word UNcorrectable
You could follow "HD Tune 2.55 Hard Disk Utility" and use the word Recovered,
BUT ONLY if attribute 187: has not increased
N.B. Although 1: and 195: raw data values are always zero at startup,
and 187: is always zero at every time in my experience,
I understand from OCZ... posts #16 and #17 that if an error occurs and 187: increments it will never be zeroed by a power down or even a secure erase.
N.B.187 - you are correct. That point in time is from the 1st time the SSD was powered on, it should NEVER be reset by a power down or even a Secure erase.
It MAY be reset by a firmware update at some point, but up until now, none have required that level of "destructiveness".
HDS and C.D.I. and HDTune show Raw Data values jumping from 0 up to 25,000,000+ as 11 GB is read.
The official OCZ ToolBox gadget shows a "Normalized Rate" that starts at 100 when no errors occurred, and rises up to 109 after Raw Data reaches 25,000,000.
I believe Sandforce have stipulated some bit-pattern to determine this,
and assume the possibility that Sandforce Marketing department decided against frightening users such as myself with big Raw Date values like 25,000,000+
This is the official Toolbox report which was created when HDS and the two other tools were reporting error counts of 25,000,000+
ModelNumber : OCZ-VERTEX2
Serial Number : OCZ-96FXFXCDVTA602Q9
WWN : 5-e8-3a-97 f8a9391b0
Revision: 10
Attributes List
1: SSD Raw Read Error Rate Normalized Rate: 109 total ECC and RAISE errors
5: SSD Retired Block Count Reserve blocks remaining: 100%
9: SSD Power-On Hours Total hours power on: 764
12: SSD Power Cycle Count Count of power on/off cycles: 403
171: SSD Program Fail Count Total number of Flash program operation failures: 1
172: SSD Erase Fail Count Total number of Flash erase operation failures: 0
174: SSD Unexpected power loss count Total number of unexpected power loss: 14
177: SSD Wear Range Delta Delta between most-worn and least-worn Flash blocks: 0
181: SSD Program Fail Count Total number of Flash program operation failures: 1
182: SSD Erase Fail Count Total number of Flash erase operation failures: 0
187: SSD Reported Uncorrectable Errors Uncorrectable RAISE errors reported to the host for all data access: 0
194: SSD Temperature Monitoring Current: 30 High: 30 Low: 30
195: SSD ECC On-the-fly Count Normalized Rate: 109
196: SSD Reallocation Event Count Total number of reallocated Flash blocks: 0
231: SSD Life Left Approximate SDD life Remaining: 100%
241: SSD Lifetime writes from host lifetime writes 64 GB
242: SSD Lifetime reads from host lifetime reads 512 GB
PLEASE NOTE :-
HDS starts up with Windows and right now shows the system has read 100 MB
HDS SMART is showing that both 1: and 195: have Value = 100 and Worst = 99 and data = 307AD4
OCX Toolbox reports
1: SSD Raw Read Error Rate Normalized Rate: 100 total ECC and RAISE errors
195: SSD ECC On-the-fly Count Normalized Rate: 100
I am guessing the normalized rate will reach 101 after the "data" which they do not reveal has going higher than 3249BA
(HDS value climbed whilst I launched Toolbox and typed results)
PLEASE DO NOT CONCEAL "data = 307AD4" etc.
I like all the data I can get,
my intention with this bug report is to ask that you refrain from saying that 25,000,000 errors were NOT corrected when 187:...0 shows they were corrected
Regards
Alan