Using different types of storage snapshot technologies for data protection | TechTarget (2024)

Storage snapshots are commonly used to enhance data protection systems and dramatically shorten recovery time objectives (RTOs) and recovery point objectives (RPOs). Here's a look at the different types of snapshot technologies and the pros and cons of each.

Using different types of storage snapshot technologies for data protection | TechTarget (1)

By

  • Marc Staimer,Dragon Slayer Consulting

Published: 21 Dec 2009

Snapshot technologies are commonly used to enhance data protection systems and dramatically shorten recovery time objectives (RTOs) and recovery point objectives (RPOs). Here's a look at the different types of snapshot technologies and the pros and cons of each.

There are six general types of snapshot technologies (see table below):

  • Copy-on-write
  • Redirect-on-write
  • Clone or split-mirror
  • Copy-on-write (COW) with background copy
  • Incremental
  • Continuous data protection (CDP)

A quick guide to snapshot technologies

Snapshot technology
Copy-
on-write
Redirect-
on-write
Clone/
split mirror
COW
w/back-
ground
copy
Incremental CDP
Snapshot is tightly
coupled to original data
Yes Yes No Yes, until
background
copy finishes
Depends on how
original snapshot
is generated
No
Space efficient Yes Yes No No No Yes, versus multiple
point-in-time snapshots
Original data system
IO and CPU resource overhead
High Medium Low Low Low Low
Write overhead on orig. data copy High None None High High High
Protects against logical data errors
by rolling back to orig. copy
Yes Yes Yes Yes Yes Yes
Protects against physical media
failures of orig. copy
No No Yes After background
copy completes
Depends on
underlying
snapshot tech.
Yes

Copy-on-write snapshot

Copy-on-write requires storage capacity to be provisioned for snapshots, and then a snapshot of a volume has to be initiated using the reserved capacity. The copy-on-write snapshot stores only the metadata about where the original data is located, but doesn't copy the actual data at the initial creation. This makes snapshot creation virtually instantaneous, with little impact on the system taking the snapshot.

The snapshot then tracks the original volume paying attention to changed blocks as writes are performed. As the blocks change, the original data is copied into the reserved storage capacity set aside for the snapshot prior to the original data being overwritten. The original data blocks snapped are copied just once at the first write request. This process ensures snapshot data is consistent with the exact time the snapshot was taken, and it's why the process is called "copy-on-write."

Read requests to unchanged data are directed to the original volume. Read requests to changed data are directed to the copied blocks in the snapshot. Each snapshot contains metadata describing the data blocks that have changed since the snapshot was first created.

The major advantage of copy-on-write is that it's incredibly space efficient because the reserved snapshot storage only has to be large enough to capture the data that's changed. But the well-known downside to copy-on-write snapshot is that it will reduce performance on the original volume. That's because write requests to the original volume must wait to complete until the original data is "copied out" to the snapshot. One key aspect of copy-on-write is that each snapshot requires a valid original copy of the data.

Redirect-on-write (ROW) snapshot

Redirect-on-write is comparable to copy-on-write, but it eliminates the double write performance penalty. ROW also provides storage space-efficient snapshots like copy-on-write. What allows ROW to eliminate the write performance penalty is that the new writes to the original volume are redirected to the storage provisioned for snapshots. ROW redirection of new writes reduces the number of writes from two to one. So instead of writing one copy of the original data to the storage space plus a copy of the changed data required with COW, ROW writes only the changed data.

With redirect-on-write, the original copy contains the point-in-time snapshot data, and it's the changed data that ends up residing on the snapshot storage. There's some complexity when a snapshot is deleted. The deleted snapshot's data must be copied and made consistent back on the original volume. The complexity goes up exponentially as more snapshots are created, which complicates original data access, snapshot data and original volume data tracking, and snapshot deletion data reconciliation. Serious problems can occur when the original data set (upon which the snapshot is dependent) becomes fragmented.

Clone or split-mirror snapshot

A clone or split-mirror snapshot creates an identical copy of the data. The clone or split-mirror can be of a storage volume, file system or a logical unit number (LUN). The good thing about clones is that they're highly available. The bad thing is that because all of the data has to be copied, it can't be done instantaneously. A clone can be made instantaneously available by splitting a pre-existing synchronous volume mirror into two. However, when a split-mirror is used as a clone, the original volume has lost a synchronized mirror.

A very significant downside to this snapshot methodology is that each snapshot requires as much storage capacity as the original data. This can be expensive, especially if more than one snapshot clone is required to be kept live at any given time. One other downside is the impact to system performance because of the overhead of writing synchronously to the mirror copy.

Copy-on-write with background copy snapshot

Copy-on-write with background copy takes the COW instantaneous snapshot data and uses a background process to copy that data from its original location to the snapshot storage location. This creates a clone or mirror of the original data.

Copy-on-write with background copy attempts to take the best aspects of copy-on-write while minimizing its downsides. It's often described as a hybrid between COW and cloning.

Incremental snapshot

An incremental snapshot tracks changes made to the source data and snapshot data when the snapshot is generated. When an incremental snapshot is generated, the original snapshot data is updated or refreshed. There's a time stamp on the original snapshot data and on each subsequent incremental snapshot. The time stamp provides the capability to roll back to any point-in-time snapshot. Incremental snapshots allow you to get faster snapshots after the first one, and you use only nominally more storage space than the original data. This enables more frequent snapshots and longer retention of snapshots.

The downside to incremental snapshots is that they're dependent on the underlying baseline technology used in the first snapshot (copy-on-write, redirect-on-write, clone/split-mirror or copy-on-write with background copy). If cloned, the first snapshot will take a while; if COW, there will be a performance penalty on writes to the original data, etc.

Continuous data protection

continuous data protection was developed to provide zero data loss recovery point objectives (RPOs) and instantaneous recovery time objectives (RTOs). It's similar to synchronous data mirroring except that it eliminates the rolling disaster (a problem in the primary data is automatically a problem with the mirrored data long before human intervention can stop it) and protects against human errors, malware, accidental deletions and data corruption.

Continuous data protection is like incremental snapshots on steroids. It captures and copies any changes to the original data whenever they occur and time stamps them. It essentially creates an incremental snapshot for every moment in time, providing very fine-grain recoveries. Some CDP implementations are both time and event based (such as an application upgrade). A good way to think of CDP is as a journal of complete storage snapshots.

Continuous data protection is an excellent form of data protection for email, databases and applications that are based on databases. The ability to roll back to any point-in-time makes recoveries simple and fast. FalconStor's IPStor is an example of a storage system and/or virtualization appliance that provides CDP.

With more and more data to protect and often less time to do it, snapshots will play a bigger role in data protection and daily storage operations. Although the differences among snapshot technologies may seem subtle, how they operate in your environment could have a significant effect on the level of protection provided and how quickly recoveries can occur.

This article originally appeared in Storage magazine.

About the author:
Marc Staimer is the founder, senior analyst, and CDS of Dragon Slayer Consulting in Beaverton, OR. The consulting practice of 11 years has focused in the areas of strategic planning, product development, and market development. With over 28 years of marketing, sales and business experience in infrastructure, storage, server, software, and virtualization, he's considered one of the industry's leading experts. Marc can be reached at [emailprotected]

Next Steps

Plan ahead to avoid bare-metal restore frustration

Data growth continues to challenge IT professionals

VMware and virtual data backup and recovery technology tutorial

Dig Deeper on Data backup and recovery software

Using different types of storage snapshot technologies for data protection | TechTarget (2024)

FAQs

What is storage snapshot technology? ›

Storage Snapshot is an effective data protection feature that also supports data mining and data cloning use-cases. Most storage hardware and software vendors offer storage snapshot technology support, as it offers enables mission-critical data recovery and ransomware protection.

What are the different types of snapshots? ›

Snapshot types. There are two fundamental types of snapshot mechanism: copy-on-write and redirect-on-write. In copy-on-write, when an I/O request seeks to change a storage block, that block is copied first and retained by the snapshot to which it belongs.

What is snapshot in data protection? ›

A storage snapshot is a set of reference markers for data at a particular point in time. A snapshot acts like a detailed table of contents, providing the user with accessible copies of data that they can roll back to.

What is the difference between storage snapshot and backup? ›

The main difference between snapshots and backups is that the latter is isolated from the mainframe. That means snapshot copies are stored on the original network, unlike backup copies. Snapshots work well for frequent protection measured in minutes or hours while backups are used for regular protection.

What is snapshot and how does it work? ›

With Snapshot, we measure a variety of factors related to your driving, including things such as the time of day you drive, sudden changes in speed (hard brakes and rapid accelerations), the amount you drive, and, for customers using the mobile app in some states, how you're using your mobile phone while driving.

What is snapshot and how do you use? ›

Snapshot. Snapshots are an instantaneous "picture" of your server's file system at a certain period of time. This picture captures the entire file system as it was when the snapshot is taken. When a snapshot is used to restore the server, the server will revert to exactly how it was at the time of the snapshot.

What is an example of snapshot? ›

Snapshot Sentence Examples. It gave her a snapshot of their interests and personalities as well. The human subjects in portrait photography are already used to having their snapshot taken, but the subject having sensual pictures taken of herself might not be as experienced.

What is the best use for system snapshot? ›

Snapshots are most often used to roll back entire file-systems or pull specific files that were accidentally deleted or corrupted. Both tasks that would initially be thought of as something a backup would be used for, and they are both tasks snapshots can usually do better than backups.

What is a data snapshot? ›

What is a Data Snapshot and what are its uses? Data Snapshot allows you to capture and maintain historical data from a report. It periodically snapshots the data from a chosen report, and stores them in a table for historical report creation & analysis.

What type of storage is snapshot? ›

By default, the snapshot uses locally redundant standard storage. We recommend that you store your snapshots in standard storage instead of premium storage whatever the storage type of the parent disk or target disk.

What is the difference between disk and snapshot? ›

A snapshot is actually a copy of a specific disk at a specific point in time. That being the case, it applies to only one disk. If you have a VM with more than one managed disk attached, a single snapshot will only include the disk that you took the snapshot of.

What is the difference between EBS and snapshot? ›

A snapshot is of an EBS volume where you are able to save state and reboot with the same data at a certain point in time. An AMI is similar, but it's for the EC2 instances themselves. You cannot take a snapshot of a non-ebs backed instance, but you can create an AMI (system image) of one.

What is the difference between storage snapshot and clone? ›

A storage snapshot is taking that original picture, and then each subsequent picture is only of the difference from the previous one. So a clone is making a complete copy of something, while snapshots make one initial copy, then just make simple subsequent changes.

Top Articles
Latest Posts
Article information

Author: Nathanial Hackett

Last Updated:

Views: 5514

Rating: 4.1 / 5 (72 voted)

Reviews: 87% of readers found this page helpful

Author information

Name: Nathanial Hackett

Birthday: 1997-10-09

Address: Apt. 935 264 Abshire Canyon, South Nerissachester, NM 01800

Phone: +9752624861224

Job: Forward Technology Assistant

Hobby: Listening to music, Shopping, Vacation, Baton twirling, Flower arranging, Blacksmithing, Do it yourself

Introduction: My name is Nathanial Hackett, I am a lovely, curious, smiling, lively, thoughtful, courageous, lively person who loves writing and wants to share my knowledge and understanding with you.