Differences between snapshots and continuous data protection.
End-users following the meteoric me·te·or·ic
1. Of, relating to, or formed by a meteoroid.
2. Of or relating to the earth's atmosphere.
3. rise of media attention to continuous data protection (CDP CDP (cytidine diphosphate): see cytosine.
(1) (Certificate in Data Processing) An earlier award for the successful completion of an examination in hardware, software, systems analysis, programming, management and accounting, ) solutions have been enticed by the promise of instant data recovery and zero data loss, but one question remains to be answered: What is the difference between snapshots and CDP, and why use one versus the other?
On the surface, these two approaches to data protection share much in common. Upon closer examination, however, important distinctions emerge. Snapshots.
According to according to
1. As stated or indicated by; on the authority of: according to historians.
2. In keeping with: according to instructions.
3. the Storage Networking Industry Association An association of producers and consumers of storage networking products, whose goal is to further storage networking technology and applications. The Storage Networking Industry Association, or SNIA (SNIA (Storage Networking Industry Association, San Francisco, CA, www.snia.org) An organization devoted to the advancement of mission critical storage systems. Founded in 1997, its goal is to determine the standards that must be developed to allow hosts and storage systems to interact via ), snapshots are copies of data as it appeared at a specific point in time. In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke"
put differently , a frozen picture of a data set--an Oracle database or a home file system, for instance--at one specific point in time.
The many implementations of snapshot functionality fall into one of two general categories: full or differential copy.
A full copy snapshot replicates the data set in its entirety. Often implemented via a process called mirror splitting, a full copy snapshot of a 3 terabyte database consumes an additional three terabytes of disk space.
A differential snapshot manages only deltas that have occurred since the snapshot was last taken. It uses less disk space but requires more processing overhead. There are a number of differential snapshot technical implementations--Copy on Write (CoW), Redirect on Write (RoW), and Write Anywhere (WA).
The important differences between these various snapshot methods lie in storage cost and performance. Additionally, since full copy snapshots don't depend on the main data store they tend to be more resilient to certain types of corruption.
Snapshot Value Propositions
Snapshots can be created quickly, and used as a source for traditional backup and archiving. Since the window of downtime or application interruption is reduced to the time it takes the snapshot to be created, snapshots can reduce or eliminate backup windows.
Snapshots are stored on disk, which provides fast, direct access rather than the slower mechanical streaming access provided by tape. Restore from disk, sometimes with the benefit of differential maps that negate ne·gate
tr.v. ne·gat·ed, ne·gat·ing, ne·gates
1. To make ineffective or invalid; nullify.
2. To rule out; deny. See Synonyms at deny.
3. the need to copy the entire data set, is faster than tape. The time it takes to recover is often referred to as the recovery time objective (RTO (Recovery Time Objective) The amount of time a computer system or application can stop functioning before it is considered intolerable to the enterprise. It can be computed to be from seconds to days, depending on how critical the application is to the organization. ).
Multiple Recovery Points
Many times the effects of data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors. are delayed. A snapshot created at noon is of no use if the corruption occurred at 11:59 a.m. Therefore, it is important to have flexible and frequent recovery points, or recovery point objectives (RPO RPO Recruitment Process Outsourcing
RPO Recovery Point Objective (disaster recovery)
RPO Royal Philharmonic Orchestra
RPO Rochester Philharmonic Orchestra
RPO Representative Poetry Online
RPO Railway Post Office ). Customers sometimes implement rotating or revolving snapshots to achieve higher granularity of RPOs. However, these additional snapshots come at a significant additional cost.
Often the user wants to perform ancillary operations that are too costly or disruptive to run on the production system. These might include audit prior to archiving, reporting, application or upgrade testing, data mining, etc. Snapshots provide a copy of near live data for these operations.
Continuous Data Protection (CDP)
The key word in "Continuous Data Protection" is continuous. CDP provides a continuum of recovery points for a given data set--offering the ability to access ANY point in time, not just those specific times predetermined pre·de·ter·mine
v. pre·de·ter·mined, pre·de·ter·min·ing, pre·de·ter·mines
1. To determine, decide, or establish in advance: by a snapshot process. CDP allows the restoration of applications to just prior to specific events instead of restoring to a pre-determined point in time. The restoration point is selected and dynamically recreated after the event occurs.
CDP provides infinitely granular recovery points (RPO) and some implementations can provide near instant recovery times (RTO). This is because CDP protection is typically done on a write transaction by write transaction basis. Every modification of data is recorded and the recovery can occur to any point, down to the demarcation of individual write operations.
An important aspect of CDP protection is atomicity at·o·mic·i·ty
1. The state of being composed of atoms.
a. The number of atoms in a molecule.
b. Valence. . Copy operations on single point in time images lack atomicity across entire datasets by default. There are complex methods which can be utilized to attempt consistency, but in most cases, single point in time technologies cannot deliver atomicity. This means that the application must be impacted to create a recovery point. The application is typically quiesced in some way while a snapshot is taken. This process is complicated and failure-prone.
CDP-based restoration provides atomicity, offering a holistic data set which can be recovered whether or not an application was quiescent quiescent
at rest; latent; the G0 stage of the cell cycle. at the moment of recovery. This capability--often termed crash consistent recovery--is the same recovery modality modality /mo·dal·i·ty/ (mo-dal´i-te)
1. a method of application of, or the employment of, any therapeutic agent, especially a physical agent.
2. used in fault tolerant The ability to continue non-stop when a hardware failure occurs. A fault-tolerant system is designed from the ground up for reliability by building multiples of all critical components, such as CPUs, memories, disks and power supplies into the same computer. and high availability Also called "RAS" (reliability, availability, serviceability) or "fault resilient," it refers to a multiprocessing system that can quickly recover from a failure. There may be a minute or two of downtime while one system switches over to another, but processing will continue. systems.
Because atomicity in inherent to CDP, it enables very powerful and easy-to-use restoration processes.
Some CDP implementations allow their RPO options to devolve devolve v. when property is automatically transferred from one party to another by operation of law, without any act required of either past or present owner. The most common example is passing of title to the natural heir of a person upon his death. over time. Any-point-in-time recovery is provided for a time period set by policy, while later periods are recoverable only from specific stored recovery points. CDP systems with this capability can extend their recovery timeline while reducing their storage capacity needs.
CDP Value Propositions
Like snapshots, CDP eliminates the backup window. Unlike most snapshots, CDP can do so atomically with no application interaction or involvement. Because of the dynamic and non-destructive nature of CDP recovery, the user is able to audit the backup without jeopardizing the integrity of the content.
Unlike snapshots, CDP images are not single fixed points in time that must be made, managed and stored. CDP images are created at will. This allows for a much wider degree of backup operations to be preformed. With a CDP system, for instance, it is possible to run the midnight backup at 7 a.m. after having audited the condition and recoverability of the data as it existed at midnight, or just because it allows the day staff to perform the backup.
CDP systems fall into two categories regarding recovery. The first set offer virtualized recovery, which means near-instant restoration. Virtualized recovery allows for the data movement part of a recovery to be attended to after the application data is made available for use--after the customer systems are back online and operational. There is nothing faster from a recovery time objective standpoint than a CDP product that offers virtualized recovery.
The other possibility is known as directional recovery--also known as roll back or roll forward recovery. This recovery method uses redo To reverse an undo operation. See undo. or rollback logs of write operations to move the application data back in time. The recovery times in this system are dependent on how far back in time the restore operation is targeting, but typically is the same as or much faster than a snapshot restoration depending on the snapshots ability to use differential maps.
Multiple Recovery Point Objectives
CDP delivers infinite RPOs, while snapshot systems deliver individual RPOs. Some vendors recommend attempting to approximate the RPO capability of CDP by creating many snapshots. This is typically unattractive as each snapshot is another object that needs to be created (with application involvement), stored and managed (non-trivial and costly). In other words, snapshots require the user to predetermine pre·de·ter·mine
v. pre·de·ter·mined, pre·de·ter·min·ing, pre·de·ter·mines
1. To determine, decide, or establish in advance: potential restoration points before the failure event, and then to manage each predetermined point. CDP allows the user to dynamically recreate any previous point of the data, and to do so after the failure occurred--so the recovery point can be pinpointed to just before the failure event.
Additionally, most CDP solutions provide the ability to annotate annotate - annotation a timeline with specific events or special times in the history of the data, which gives the user the ability to restore to any specific point in time, as well as anything in between these marked times.
CDP provides atomic data sets for repurposing with far more flexibility than snapshots. The data image is created dynamically with CDP, can be recreated repeatedly, can be audited, saved and archived independent of the main line data store, and can be used as a single solution to deliver data to multiple re-purposing activities.
More than just media hype, CDP is now in regular use at mainstream data centers because it offers all the inherent benefits of snapshots, plus far more granular recovery points and much faster recovery times.
When zero downtime and zero data loss is the objective, CDP is the only real option.
Michael Rowan is founder and chief technology officer (CTO (Chief Technical Officer) The executive responsible for the technical direction of an organization. See CIO and salary survey. ) of Revivio, Inc.