Why a HOT-SPARE Hard Disk is a bad idea?

Optimizing Data Storage Costs & Efficiency with Open-E JovianDSS

In today’s data-driven world, the importance of optimizing data storage cannot be overstated. As data continues to grow at an unprecedented rate, businesses face significant challenges in managing, storing, and ...

Data Storage Monitoring in Open-E JovianDSS with Checkmk and Diagnostic Tools

Among the characteristics of an optimal data storage solution, several features should stand out. It should provide full checksumming, self-repair, and backup and restore capabilities with short RPOs and RTOs. ...

How To Improve Your Business With ZFS

The smooth workflow of almost any business today is mainly based on data management. Media, transportation and logistics, finance, the public, government, or medical sectors – basically, you can list ...

Welcome to Open-Experts — The Data Storage Podcast!

Our charismatic host, Todd Maxwell, with almost 20 years of experience in the data storage market, delves into the world of data storage solutions. Learn about key trends, technologies, and ...

Find the Exact License for Your Storage Setup

This calculator helps you to find the exact license required for your storage setup with Open-E JovianDSS, based on your individual specification.

Enter the configuration of your choice into the calculator and generate a PDF report.

Try the Calculator

45 Comments

Bill /
13, 11 2016 09:25:40

I agree with many of the comments here. Not having a spare drive, as a POLICY, is retarded.
If you can’t afford a spare drive straight away, then go without one for a while. But add one later!

A company that actually values it’s data will have am automated backup mechanism in place that matches the required RPO and RTO as defined by the IT strategy – and signed off by management/executive levels.

If a disk fails in a RAID array, replace it IMMEDIATELY. Having a hot-spare accomplishes this for you.
And since you have a backup strategy in place that already satisfies the RPO/RTO of the organization, there is no need to take another backup from a DEGRADED array. If the array is a parity RAID (heaven forbid you are using this on your primary storage) then the performance will be lower than normal and you are leaving your array in this state for longer – further increasing the likelihood that another drive will fail before the rebuild is done.

If you don’t have a backup of your primary data that meets organization RPO/RTO, or your organization hasn’t even though of these things, then obviously data integrity doesn’t matter and you basically just have a load of junk on disk – so why bother with the backup if the data is just junk anyway?

Oh, and don’t forget that as well as a proper backup mechanism, you also need monitoring/notifications of the array status – so you KNOW that a disk has failed and can organize the replacement immediately.
Scott in Texas /
28, 12 2016 01:35:26

Struggling with your approach. I agree with the comments above about how a failure during a write of ALL your data to a back up set is just as risky as going straight to a rebuild. I also think it moronic of you to recommend data validation AFTER you have had a failure. I run validation EVERY NIGHT for a couple of hours a night, resulting in a full validation every week. Does it stress the drives, yes it does, and if any drives get flakey, S.M.A.R.T will identify them and warn me… more importantly, I would rather validate the data BEFORE it becomes critical that it is validated, and risky to perform the validation.

I also run Raid 6, so that in the event of a failure, it will take two additional failures to lose data. So no, a 14 drive RAID 6 array is not truly a “backup” but it is damned close… not to mention the problems backing up a 22Tb array would present.

FWIW, I also run a REAL RAID controller card (3Ware 9650SE), not motherboard raid and not software raid… so yes, I sleep quite well at night having a hot swap drive and skipping your first two steps.
Jon Redwood /
12, 02 2017 07:01:17

Why not use RAID 6. You could do this with or without a hot spare but if one disk goes then whenever you decide to rebuild the array (straight away with a hot spare or when you have replaced the malfunctioning disk) then the array will need 2 more failed disks before you lose your data.
DTEAdmin /
13, 02 2017 11:40:09

Two years after posting. A worthwhile read, and we went with a 4TB Hot Spare for a (3)-3TB RAID1E.

The logic behind the OP was solid, but also was the fact of “It should be backed up scheduled nightly versus a workday past”.

Thanks for the civil forum and superb input.

Why a HOT SPARE Hard Disk Is a Bad Idea

The Problematic Aspects of Using a Hot Spare Disk

Hot Spare Disks Add Stress to Vulnerable Systems

Problems in Overall Hot Spare Disk Design

Hot Spare Disks Create a Single Point of Failure

Our Solution

Janusz Bak

Chief Technology Officer

45 Comments

Bill /

Scott in Texas /

Jon Redwood /

DTEAdmin /

Leave a Comment

Optimizing Data Storage Costs & Efficiency with Open-E JovianDSS

Data Storage Monitoring in Open-E JovianDSS with Checkmk and Diagnostic Tools

How To Improve Your Business With ZFS

Welcome to Open-Experts — The Data Storage Podcast!

Want to Learn More?

3-in-1 Complete Data Storage Solution

Find the Exact License for Your Storage Setup

This calculator helps you to find the exact license required for your storage setup with Open-E JovianDSS, based on your individual specification.

Enter the configuration of your choice into the calculator and generate a PDF report.

Open-E Library

Manuals and Quick Starts

How-to Resources

Video Tutorials

Courses