The Hidden Risks of Using Hot Spare Hard Disks in Data Storage

Understanding Data Immutability with Open-E JovianDSS & Veeam

One of the key data safety characteristics is data immutability, meaning that once data is written, it cannot be changed, deleted, or modified for a set time. Historically, immutability was ...

Business Benefits of Using Open-E JovianDSS as a Data Storage Solution for Proxmox Virtual Environment

A single decision made by your hardware or hypervisor provider can unexpectedly restrict your business agility. A strategic approach involves adopting technologies that deliver both robust performance and operational flexibility, ...

Open-Experts – The Data Storage Podcast: Exploring Data Storage Trends for 2025 & Beyond

Welcome to the Open-Experts – The Data Storage Podcast, episode #5! This time, our new host, Topper Power, delves into the biggest trends shaping the world of data storage. From ...

Why a HOT SPARE Hard Disk Is a Bad Idea

Updated on 19/08/2025 Due, in part, to the different views and opinions regarding the usage of hot spare disks in our previous post, we’ve decided to add an update for ...

Find the Exact License for Your Storage Setup

This calculator helps you to find the exact license required for your storage setup with Open-E JovianDSS, based on your individual specification.

Enter the configuration of your choice into the calculator and generate a PDF report.

Try the Calculator

45 Comments

PassingBy /
18, 04 2017 02:25:52
In my experience, RAIDs consisting of disks from the same manufacturer are prone to have cluster failures.
The time frame can be days, maybe weeks. Spare (or RAID6) is a must. Do not wait until the end of the backup.
Actually I do not understand the full data backup statement at all. The RAID requirements and backup policies should be based on risk analysis, not some home grown ideas.
Confused /
12, 05 2017 07:55:28
This really doesn’t make any sense at all.
Most controllers rebuild using a very sequential process. You advise to back-up and check consistency first. How on earth is that less stressful to the now broken RAID? What do you think will happen if it finds a bad block on the RAID-set during the backup/verify now that it no longer has parity to correct it with? Pretty much the same as it would during the rebuild…
Mike Uchima /
13, 09 2018 12:26:55
I tend to agree with those who are questioning the logic here.
Running a full backup is probably going to stress the remaining disks just as much (if not more) than doing a RAID rebuild. A RAID rebuild is going to involve sequentially reading all of the other disks to reconstruct the contents of the failed one. A file level backup — while it will only require reading the parts of the disks which contain valid data — is going to involve more random seeking, stressing the head actuator assemblies and causing the drives to heat up more.
Another thing I question is how this article implies you’re not keeping your backups current! You shouldn’t be waiting until you’ve got a degraded array to do a full backup; if anything, all you should need to do is an incremental backup of anything that has changed since the last full backup, which is (hopefully) not a large amount of data. Steps 1 and 2 (full backup and verify) on a large array could take many hours, possibly even days; that’s an unacceptably long period of time where your degraded array could turn into a failed array, exposing you to downtime and data loss.
I think a much better approach would be a sensible backup regime combined with a RAID-6 (or raidz2) array. The double redundancy of RAID-6/raidz2 protects you against a second failure during the rebuild. In this scheme, whether you use a cold spare vs. hot spare, and whether you run an incremental backup prior to the rebuild, are judgement calls that I’m not going to take a strong stance on.
The only reasons I can think of to avoid hot spares are that keeping the drive powered up is causing additional wear on the spare drive itself and consuming additional power. If your RAID controller/software keeps the spare drive in a low-power (spun down) state until it is needed, then even these justifications go away.
zman /
26, 04 2019 11:42:46
This is actually pretty naive to think re-building an array will stress the disks. You know on a lot of servers disks run almost 24/7 for years w/o failing. So thinking that an enterprise grade will fail in a few hours is just silly. You should always already have a backup from couple of hours ago or even 15 minutes or so. You don’t run a back when a disk has failed. I have run backups on consumers grade external HDs for up to 48 hours non-stop and one has never failed.

Why a HOT SPARE Hard Disk Is a Bad Idea

The Problematic Aspects of Using a Hot Spare Disk

Hot Spare Disks Add Stress to Vulnerable Systems

Problems in Overall Hot Spare Disk Design

Hot Spare Disks Create a Single Point of Failure

Recommended Procedure in Case of a Disk Failure

Recommended Procedure in Case of a Disk Failure:

Janusz Bak

Chief Technology Officer

45 Comments

PassingBy /

Confused /

Mike Uchima /

zman /

Leave a Comment

Understanding Data Immutability with Open-E JovianDSS & Veeam

Business Benefits of Using Open-E JovianDSS as a Data Storage Solution for Proxmox Virtual Environment

Open-Experts – The Data Storage Podcast: Exploring Data Storage Trends for 2025 & Beyond

Why a HOT SPARE Hard Disk Is a Bad Idea

Want to Learn More?

3-in-1 Complete Data Storage Solution

Find the Exact License for Your Storage Setup

This calculator helps you to find the exact license required for your storage setup with Open-E JovianDSS, based on your individual specification.

Enter the configuration of your choice into the calculator and generate a PDF report.

Open-E Library

Manuals and Quick Starts

How-to Resources

Video Tutorials

Courses