Printer Friendly
The Free Library
5,677,343 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Understanding the new generation of data protection solutions.


A new wave of disk-based solutions has entered the market to address long-standing challenges in enterprise data protection. Because these solutions use disk, they are often lumped together, painting a confusing picture of the data protection landscape. Certainly, all data protection solutions have common basic requirements. Data will always need to be retained safely and securely; recovery should be possible with a reasonable amount of effort.

However, by examining key differentiating factors--including retention period, speed and cost--we can segment the data protection market into three areas: Staging solutions, designed for data kept briefly on the way to longer-term media; Backup/Restore solutions, designed for data kept in the medium term, when a restore will likely be required; and Archiving solutions, designed for data kept long term for legal or corporate requirements. Further, by breaking down your data protection requirements by segment, it becomes easier to set evaluation criteria and identify the right components for a complete solution.

Staging

The cost of Staging solutions tends to be much higher than traditional tape; however, this is a secondary concern to speeding backups. Because of their cost, Staging devices are often used to keep the most recent backup data, typically for a few days maximum. Most technologies in this category enable data to be moved to tape at any time without disturbing business processes. Disk-based solutions in the Staging category include ATA (1) (AT Attachment) The specification for IDE drives. See IDE.

(2) See analog telephone adapter.

ATA - Advanced Technology Attachment
 arrays, Virtual Tape Libraries A hard disk array that emulates a tape library. A virtual tape library (VTL) enables the storage medium to be switched from tapes to disks while continuing to use the existing tape backup software. See virtual tape system and storage virtualization. , Snapshot Technologies and Continuous Data Protection.

[FIGURE 1 OMITTED]

ATA Arrays

ATA Arrays are used as target data repositories See repository.  for backup operations. Most modern back-up software solutions from vendors like Veritas or EMC (1) (EMC Corporation, Hopkinton, MA, www.emc.com) The leading supplier of storage products for midrange computers and mainframes. Founded in 1979 by Richard J. Egan and Roger Marino, EMC has developed advanced storage and retrieval technologies for the world's largest companies.  Legato (Legato Systems, Inc., Mountain View, CA, www.legato.com) A leading provider of storage management and high-availability software founded in 1988 and acquired by EMC Corporation in 2003. Legato software, including Celestra data management (data mining, data migration, etc.  support backup-to-disk options, in addition to their more traditional backup-to-tape options. Backup to standard ATA Libraries tends to be extremely fast and easy to implement, but also extremely expensive. The cost per gigabyte for ATA array based storage in 2004 was between $3 and $15, whereas tape storage was one-sixth that cost.

VTL See virtual tape library.  Solutions

Virtual Tape Libraries are increasingly common; today no major tape library vendor remains without one in its product portfolio. VTL solutions add to the standard ATA array by providing an interface that mimics a tape library, allowing seamless integration An addition of a new application, routine or device that works smoothly with the existing system. It implies that the new feature or program can be installed and used without problems. Contrast with "transparent," which implies that there is no discernible change after installation.  into existing backup infrastructures with little or no process change. VTL solutions tend to run faster than even ATA arrays, as no host operating system operating system (OS)

Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs.
 or file system is required. Most VTL solutions today also allow users to manage an attached tape library, backing up or restoring virtual tape images to real physical tape. Since VTL solutions are intended to speed backups, it's essential to examine the backup environment in its entirety. Bottlenecks in application servers, media managers and software compression can often reduce throughput substantially; in some instances it may be sufficient or more cost-effective to use a traditional array.

Snapshot Technologies

Most modern storage systems support snapshot technologies in one form or another. Snapshot technologies make a quick point-in-time image of the block-level storage or file system to be kept available to restore data, even when that data then subsequently changes. Snapshot images can often be made end-user accessible, allowing users to locate previous versions of data or recover accidentally deleted files. As the data is in a read-only format, the backup of a snapshot to tape is extremely simple. Most snap-shot technologies tend to be reasonably efficient in their use of disk space, storing only changes to stored data rather than whole copies of changed files.

Continuous Data Protection

Continuous Data Protection (CDP CDP (cytidine diphosphate): see cytosine.


(1) (Certificate in Data Processing) An earlier award for the successful completion of an examination in hardware, software, systems analysis, programming, management and accounting,
) is an emerging technology designed to provide the same level of protection as snapshots but without the atomic point Atomic Point is a Tetris-variant arcade game released by Philko in 1990. Your task is to complete a horizontal row on the same level that 2 flashing diamonds appear, as can be seen in the screenshot of level 1.  in time. Using CDP, storage administrators can 'roll back' data storage systems to any previous point in time (within certain limits) rather than a predefined snapshot point. Questions remain regarding CDP technology's integration with heterogeneous application environments and the real tangible benefit it offers over traditional snapshots.

When evaluating Staging solutions, it's important to weigh the following criteria and consider:

* What level of performance does it offer in meeting enterprise backup windows?

* How easy is it to perform data recovery--for administrators and users?

* How well does it integrate with a disk library or tape library for longer-term retention?

* How well does it integrate with your existing backup software See backup program.

(tool, software) backup software - Software for doing a backup, often included as part of the operating system.

Backup software should provide ways to specify what files get backed up and to where.
 infrastructure?

Backup/Restore

Enterprises typically keep up to six months of data for restoration in the event of data loss. Backup and restore products are typically at the core of a data protection solution. Products in this segment tend to be a little slower than those in the staging segment; buying criteria is focused on reducing long-term data storage costs and enabling offsite data migration. Tape libraries have been entrenched en·trench   also in·trench
v. en·trenched, en·trench·ing, en·trench·es

v.tr.
1. To provide with a trench, especially for the purpose of fortifying or defending.

2.
 in enterprise back-up storage for many years. However, exponential growth Extremely fast growth. On a chart, the line curves up rather than being straight. Contrast with linear.  in enterprise data, shrinking back-up windows and high-profile incidents of tape loss in transit have exposed the limitations of tape. Further, studies show that one in three recoveries from tape fail--a rate of failure most organizations cannot tolerate. Disk-based products in the Backup/Restore category include Disk-Based Libraries and Capacity Optimized Storage.

Disk-Based Libraries

Disk-Based Libraries (DBL DBL Double
DBL Disability
DBL Down By Law (band)
DBL Database Language
DBL Drexel Burnham Lambert (defunct investment bank)
DBL Distance Between Lenses
DBL Don't Be Late
) are the biggest challenger to tape library solutions. These devices typically come in two formats--removable and fixed. Removable disk A disk or disk cartridge that is inserted into the drive for reading and writing and removed when not required. Using optical technologies, CDs and DVDs are the most common examples.  libraries have cartridges that allow for the physical removal of disks or disk cartridges A removable disk module that contains a single hard disk platter or a floppy disk. See cartridge.


Old and New Disk Cartridges
Older cartridges (right) came in a variety of sizes as large as 16" in diameter.
. However, the cartridges are often heavy fragile and difficult to move to a remote storage facility. Another issue with removable disk cartridges is that manual interaction and tracking is still involved; as with tape libraries, this remains the biggest source of potential recovery error. Fixed disk libraries rely on modern replication technologies to move data offsite for disaster recovery; they only allow for disk removal to replace defective disks. Well-designed DBL often provide both file system and virtual tape interfaces for maximum flexibility. Additionally, some disk libraries implement MAID (Massive Array of Independent Disks) technology to power disks up and down, lowering power costs over the lifetime of the system and extending the life of the disks within it.

Capacity Optimized Storage

Capacity Optimized Storage (COS) represents a new evolution in disk-based libraries. COS solutions not only allow for long-term disk-based storage, but also provide dramatic levels of compression (20:1 is typical), a feature that makes disk drives cost-competitive with tape for enterprise backup and restore. Some COS solutions also utilize the compression technology for wide-area replication, allowing low-cost site-to-site connections to be used for offsite data protection and obviating ob·vi·ate  
tr.v. ob·vi·at·ed, ob·vi·at·ing, ob·vi·ates
To anticipate and dispose of effectively; render unnecessary. See Synonyms at prevent.
 the need for manual processes around the movement of disk or tape

When evaluating Backup/Restore solutions, it's important to weigh the following criteria and consider:

* What is the cost of long-term data storage on disk vs. tape?

* What is the cost of ongoing maintenance?

* Does the solution provide mechanisms to check data integrity to ensure data recovery?

* How easy is it to move data offsite for disaster recovery protection?

* How well does the product integrate with standard backup management software?

Archiving

Archiving solutions are designed to retain data over the long term--typically more than six months and up to six years for most financial data. (Legally, some kinds of data may need to be retained for up to ninety-nine years). New disk-based products in this segment have difficulty competing with the sheer low cost of a shelved tape cartridge See cartridge. . However, a company may be liable for the reproduction of historical data and tape offers no certainty of data restorability or integrity. Modern optical storage systems, such as "jukeboxes," use DVD DVD: see digital versatile disc.
DVD
 in full digital video disc or digital versatile disc

Type of optical disc. The DVD represents the second generation of compact-disc (CD) technology.
 media to store data, but long-term storage can often be a challenge due to the sheer amount of media generated--more than 120 DVDs are required for every Terabyte archived--and locating the correct DVD for recovery can be difficult. New legislation such as Sarbanes-Oxley has prompted vendors to create solutions that meet stringent retention requirements. Disk-based solutions in the Archiving segment include Content Addressable Reachable. When something is addressable, it can be identified and manipulated independently of its surroundings. For example, screen pixels and RAM memory are addressable. Each of the screen's picture elements can be individually turned on and off, and each of the memory's bytes can be  Storage solutions.

Content Addressable Storage

Content Addressable Storage (CAS) solutions are disk-based storage solutions developed specifically for the long-term archiving and retrieval of data with a focus on ensuring the retrieved data is identical to that which was stored. Data passed to a CAS system are examined and then a 'hash' is created which uniquely identifies the data. The hash (a series of hexadecimal digits Noun 1. hexadecimal digit - a digit from 0 to 15 in hexadecimal notation
digit, figure - one of the elements that collectively form a system of numeration; "0 and 1 are digits"
) is generated by a 'hashing' algorithm (that processes the data to generate a hexadecimal See hex.

(mathematics) hexadecimal - (Or "hex") Base 16. A number representation using the digits 0-9, with their usual meaning, plus the letters A-F (or a-f) to represent hexadecimal digits with values of (decimal) 10 to 15.
 fingerprint of its contents). This algorithm is chosen to minimize the chances of any two pieces of non-identical data generating the same hash. Once the data are stored, the hash is then returned to the originator of the data to be used for retrieval--much like a coat check. The hashing Creating hash totals or hash tables. See hash total and hash table.

hashing - hash coding
 approach ensures that the same piece of data is never stored twice, as duplicate files can be identified though a comparison of hashes. While CAS systems work well, most require a separate, software-based control program that stores and retrieves data from the CAS system. Many programs exist for the most popular CAS solutions, including e-mail collection and archiving, Instant Messaging Exchanging text messages in real time between two or more people logged into a particular instant messaging (IM) service. Instant messaging is more interactive than e-mail because messages are sent immediately, whereas e-mail messages can be queued up in a mail server for seconds or  collection and archiving and file system front-ends for generic data storage.

When evaluating Archiving solutions, it's important to weight the following criteria and consider:

* What are your legal requirements for long-term corporate electronic record maintenance?

* What are the potential cost efficiencies you will realize in both storage and retrieval?

* What software support is available for archiving and retrieval of required data types?

* What solution support will be available over the lifespan of your archiving requirements?

Breaking down your data protection requirements by segment--Staging, Backup/Restore and Archiving--is an important first step in building a solution, making it much more straightforward to evaluate and select the appropriate products. In my next article, I will demonstrate how to use this approach in designing and implementing a complete data protection solution.

Geoff Barrall is CEO (1) (Chief Executive Officer) The highest individual in command of an organization. Typically the president of the company, the CEO reports to the Chairman of the Board.  of Trusted Data and serves on the board of advisors for Data Domain

www.trusteddata.com
COPYRIGHT 2005 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2005, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Disaster Recovery & Backup/Restore
Author:Barrall, Geoff
Publication:Computer Technology Review
Date:Jun 1, 2005
Words:1635
Previous Article:Recovery management: focusing on recovery-oriented data protection.(Disaster Recovery & Backup/Restore)
Next Article:Minimizing downtime with disk image restores.(Disaster Recovery & Backup/Restore)
Topics:



Related Articles
NTI DriveBackup! Introduced By NewTech Infosystems.(Product Announcement)
RELIATY INTROS FIRST DATA PROTECTION SOFTWARE FOUNDED ON NDMP.(Reliaty Backup based on Network Data Management Protocol)(Product Announcement)
Simplifying storage: how companies benefit with a backup appliance approach. (SAN).
Prepare for the worst: portable tape drives put the "recovery" in mobile disaster recovery.(Storage & Backup)
Assessing the foundation of long distance disaster recovery.(Regulatory Compliance)
CMS Products ships 100GB ABSplus ultra portable backup and restore system.(new backup software)
TCO analysis: where D2D fits--part 2.(Storage Networking)(Total Cost of Ownership)
Overcoming recovery barriers: rapid and reliable system and data recovery.(Data Protection)
The push for continuous data protection.(Special Section)
Personal disaster recovery software: an essential part of business disaster recovery plans.(Special Section)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles