Diary of an admin: Backup and Data Replication
Among many technologies that increase data storage protection, the term ‘replication’ is used very frequently.…
Read MoreThe capacity of storage systems grows all the time, providing more space and longer retention time for all sorts of data, produced and processed by business.
Accumulating data brings questions about its safety. Protecting from storage failure and introducing mechanisms preventing data loss are one of the matters to be carefully planned.
There are various techniques you can choose to protect your data and storage. Here, we will get a general concept of the following:
All of the above techniques are featured in Open-E software.
Data replication is a service responsible for copying/duplicating files and folders from a primary storage (production) to a secondary storage (replica). File level replication is asynchronous, so files are first written to the primary storage and then, based on a defined schedule/time interval changes, are collected and replicated to the secondary storage unit. Because data is copied with latency, primary storage failure can cause data loss if most recent modifications were not transferred to the secondary storage.
File replication can be used to restore the content of production storage unit after failure. In practice, if the primary storage is damaged or completely destroyed, with no ability to restore its content, data can be transferred from the secondary storage (replica) once the primary unit is replaced or repaired.
The whole replication process can use different methods of transferring data from the source (primary storage system) to the destination (secondary storage system). Data can be replicated locally within the system or via local area network (LAN) or wide area network (WAN).
Snapshots can capture state of data at a particular point-in-time. Some snapshots are made instantly and the content of them is immediately available for applications that require read-only data. This includes a data protection application like backup service, replication service or reporting applications, which can rely on read-only data repository.
The most common methods of taking snapshots offered by storage systems include:
Backup is a process of copying and archiving data. Information preserved by backup can be restored after a storage failure, or just referred back to historical information if necessary. It is important, that backup has at least one full copy of data repository which needs to be preserved. Further changes to the data might be captured by using different methods defined in the backup policy.
Types of backup:
Backup offers a variety of destinations to preserve data:
As a source of data, backup can use a snapshot taken prior to the procedure. This will preserve/archive data from a certain point-in-time, and will interfere less with normal storage operations and have less impact on the production environment.
One of the purposes for using data backup is a Disaster Recovery.
Backup allows storing a full copy of data in an off-site facility. That allows restoring all systems and data from scratch in the event of total system failure or in case servers are physically destroyed. Backups can also be used in less critical environments, for example to restore files removed intentionally or accidentally by individual users.
Certain methods of protecting your data have their pros and cons, some of them are more expensive to implement than others. Choosing a particular method of data protection usually depends on the budget, as well as what RTO (Recovery Time Objective) and RPO (Recovery Point Objective) required by a company and defined in its BC (Business Continuity) or DR (Disaster Recovery) plan.
You can browse our how-to resources to learn how each technique operate in Open-E products:
http://www.open-e.com/service-and-support/library/how-to-resources/
2 Comments
San Dieg
December 23, 12 2012 10:19:57Thanks, now i know what backup is.
Huck
January 19, 01 2013 06:17:29Amazingly well written article.