Re: [9fans] ata drive capabilities



the google paper shows a 40% afr for the first 6 months after some
smart errors appear. (unfortunately they don't do numbers for
a simple smart status.)

Yes, and I rather mischaracterized the google paper's comments on SMART. A reread (I first read them a few months ago) shows the above. Further, the CMU paper even references the google study on the SMART subject:

``They find that [ ... ] the value of several SMART counters correlate highly with failures.''

So SMART appears a little less dumb. I'd say meets the better than nothing criterion.

from my understanding of how google do things, loosing a drive just
means they need to replace it. so it's cheeper to let drives fail.
on the other hand, we have our main filesystem raided on an aoe
appliance. suppose that one of those raids has two disks showing
a smart status of "will fail". in this case i want to know the elevated
risk and i will allocate a spare drive to replace at least one of the
drives.

i guess this is the long way of saying, it all depends on how painful
loosing your data might be. if it's painful enough, even a poor tool
like smart is better than nothing.

I agree (plus I was just wrong about SMART at first), though I do think your example above is about preventing downtime, not so much data loss (Even without smart entirely, and all the disks come up corrupt, we're all backed up within some acceptable window, right?)


what a pity! it would have been so great to have had
an objective assessment of reliability by manufacturer.

Since the CMU thing found no difference between disk *types*, I wonder if it might be that there's little difference between manufacturers either -- instead the difference is in manufacturing, i.e., `vintage' & the like.

i've found it really quite hard to find useful data to
indicate how reliable a drive might be.


I think Fig. 2, Sec. 4.2 of the CMU paper relates to that; the `infant mortality' of manufactured mechanical parts isn't captured in MTTF -- but IDEMA is apparently going to solve this by replacing the single MTTF number that I don't quite understand with 4 different MTTF numbers, one for each `phase' of a disk's life.

--
Josh



.



Relevant Pages

  • Re: building on your own a large data storage ...
    ... Which means we should not consider SATA drives for enterprise/ ... that it could not handle in its own write-back cache it might matter some). ... I think google probably uses caches aggreesively. ...
    (comp.os.linux.hardware)
  • Re: Google buys Motorola
    ... manufacturer that is actually losing market share makes perfect ... Google spent $9.3 billion (12.3b - ... an investment to secure Apple's iOS business that ...
    (comp.sys.mac.advocacy)
  • Re:ubuntu-users Digest, Vol 59, Issue 109
    ... Mouse button 4 does scroll up, ... OS Error on Boot ... Sure it might seem nice that Google is contributing a new Windowing System ... If bios isn't seeing both drives then you have a cable and/or jumper issue. ...
    (Ubuntu)
  • Re: Bare Drive vs. clothed?
    ... The original says nothing iike that, fool. ... it "WILL" tell you to deal with the PC manufacturer. ... are selling you drives that were not meant for direct end user purchase. ... Samsung etc and the WILL NOT cover the warranty if you buy such a drive. ...
    (comp.sys.ibm.pc.hardware.storage)
  • Re: building on your own a large data storage ...
    ... why is it you pay them a dollar amount they can do whatever ... Since Google buys so many gazillion hard drives I have speculated that they have contracts with HDD manufacturers to get better pricing, if Google promises to not publish data about HDD performance, reliability, speed, whatever that includes Make/Model. ... If Google wanted to publish that info, they most certainly could if they didn't sign a contract preventing such. ...
    (comp.os.linux.hardware)