Printer Friendly
The Free Library
14,536,885 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Backup storage appliances come of age.


Data is one of the corporation's most important assets, but it's also one of the corporation's biggest headaches. Data needs to be fully protected in the face of growing data volumes, the demand for shorter restore windows, and reduced storage expenditures. Believing that "two out of three ain't bad," many companies have sacrificed short restore windows for tape's large capacity and economical price point. Most know that backup to disk can enable much faster restores with higher reliability and easier administration, but in the past the cost has been too high.

Traditional backups have relied on tape because it is cost effective for fast backups and getting data offsite, but it is slow and unreliable for restores--the main reason backups happen in the first place. Other types of data protection do rely heavily on disk (replication, mirroring and snapshots are prime examples) but IT reserves these more expensive methods for critical data with high availability Also called "RAS" (reliability, availability, serviceability) or "fault resilient," it refers to a multiprocessing system that can quickly recover from a failure. There may be a minute or two of downtime while one system switches over to another, but processing will continue.  and rock-solid recovery requirements. Disk is also ideal for data warehouses and repositories, but having to house multiple backup versions makes heavy demands on disk-based backup. On top of cost issues, getting an integrated backup function to work across multiple applications, hosts and storage targets is extremely challenging.

More and more companies have been looking at backup appliances to let them use disk-based backup at a reduced price point, but these systems have lacked advanced data protection features. The challenge for backup product makers is to provide a disk-based long-term storage system that supports multiple backup applications, multiple versions of backups, allows restore to use disk's random access capability, has a comparable price point to tape, and ensures reliable and verifiable recoveries.

Backup and Recovery Requirements

Many disk manufacturers have triumphantly announced the death of tape, but tape sales remain healthy. It's easy to see why: tape technology advances have included better robotics robotics, science and technology of general purpose, programmable machine systems. Contrary to the popular fiction image of robots as ambulatory machines of human appearance capable of performing almost any task, most robotic systems are anchored to fixed positions , increasing capacity and performance on drives and media, and the ability to store multiple copies or versions at a low cost and on removable media In computer storage, removable media refers to storage media which can be removed from its reader device, conferring portability on the data it carries. A removable drive is a reader device for such media. . However, manual tape handling is awkward and prone to error, tape needs careful tuning to optimize streaming, and backup quality can be hard to determine until you go to recover--at which point you really hope the quality is there.

One of the main technical problems with tape drives is that they must be tuned to avoid what is called the shoeshine effect--starting, stopping and repositioning repositioning Laparoscopic surgery The changing of a Pt's position during a procedure to improve access or visualization of the operative field, which may be linked to complications, as it changes anatomic planes of operation. Cf Laparoscopic surgery.  the tape. To avoid this, it's much more efficient to stream tape drives by sending several concurrent backups. This process, called multiplexing multiplexing, in communication, technique whereby two or more independent messages, or information-bearing signals, are carried by a single common medium, or channel. , does help backup performance by minimizing the shoeshine effect. However, it takes extra time to read images and handle multiple incoming backup sources.

Disk storage counters these disadvantages by avoiding the shoeshine effect, handling incremental backups See backup types.

(operating system) incremental backup - A kind of backup that copies all files which have changed since the date of the previous backup. The first backup of a file system should include all files - a "full backup". Call this level 0.
 better, making off-site backup easier and more efficient, and providing more reliable recovery:

* Avoiding the shoeshine effect. Disk arrays do not need a steady stream of data, so there is no shoeshine effect even for small incremental backups.

* Simplifying and accelerating the backup process. Storage administrators like incremental backups because they help shrink backup windows, but tape restores are simpler and more effective with full backups See backup types. . Disk-based backup systems Noun 1. backup system - a computer system for making backups
ADP system, ADPS, automatic data processing system, computer system, computing system - a system of one or more computers and associated software with common storage
 allow administrators to schedule more incremental backups without worrying about performance penalties or risking restores.

* Making off-site backup copies A disk, tape or other machine readable copy of a data or program file. Making backup copies is a discipline most computer users learn the hard way-- after months of work is lost. See backup and LAN free backup.  easier and more efficient. Since it isn't multiplexing backup data from multiple backup clients on one tape, disk-to-tape copying efficiently organizes backup data by client, which speeds up the recovery process. Disk-to-tape copying also provides more flexibility than tape: tape-to-tape copying does not allow other backup or restore operations, but disk allows simultaneous access.

* Superior recovery. Disk-specific technologies like RAID make disk a more reliable medium than tape. One bad tape can cause an entire restore operation to fail, but RAID protection allows a restore to complete successfully even with a failed disk.

* Efficient single-file recoveries. According to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 analyst firm Strategic Research, 87% of all restores are single-file recoveries, not full-system recoveries. Disk is a random access medium, and is ideal for single-file recovery.

In spite of these advantages, people are still backing up to tape. So what's the problem? Disk has lagged behind tape as a long-term storage medium because of cost. Consider the price of storing four weeks' worth of full and daily incremental backups. Here's the math: If you assume that incremental backup sizes are about 5% of the original data set, over a month's time the backups will consume five times the original data size. Given this capacity, using primary disk subsystems for backup storage A storage device used to hold copies of data for backup and recovery. In the IT world, tape drives and tape libraries have been the traditional backup storage medium; however, magneto-optic (MO) and other optical discs as well as regular magnetic disks are also used. See LAN free backup.  is cost prohibitive. Less expensive ATA-based disk arrays are cheaper, but at $10-$20/GB they're still much more expensive than $1-$5/GB for tape-based backups.

A disk-based storage system must avoid the disadvantages of tape and disk while keeping their advantages: disk's effectiveness, reliability and speed, with tape's capacity and price point. This means that is must provide high performance for both backup and restore operations, be cost-effective, and enable reliable, verifiable storage.

Advanced Disk-Based Backup Systems

Backup system best practices include compression, accelerated backups and restores, cost advantages versus tape, verifiable recoverability and highly resilient storage, and simple and seamless integration An addition of a new application, routine or device that works smoothly with the existing system. It implies that the new feature or program can be installed and used without problems. Contrast with "transparent," which implies that there is no discernible change after installation.  with backup software See backup program.

(tool, software) backup software - Software for doing a backup, often included as part of the operating system.

Backup software should provide ways to specify what files get backed up and to where.
.

Compression for Disk Price Parity

A key technology for fast and economical disk-based backup is compression. Compression creates a smaller and more cost-effective system by pooling redundancies and storing only unique data. Backup images contain a lot of redundant data, especially weekly full backups. Even incremental backups, which capture changed files only, usually capture unchanged data blocks as well. Global compression pools backup image redundancies and stores only unique data. By slicing data into sequences of variable lengths, global compression detects duplicate files, redundant patterns within and across files, and repeated patterns within blocks. By pooling and storing only unique data, the technology produces a highly efficient and simpler-to-manage backup environment. While most compression technologies offer around 2X compression, some more global approaches can reduce repetition of large segments across the history of storage, with order-of-magnitude smaller results.

High Performance for Backup and Restore

Performance should be comparable to high-end tape drives or network attached storage (NAS (1) See network access server.

(2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular
) systems, without tape's multiplexing operations. With tape drives, multiplexing has been used to compensate for slow clients, multiple slow networks, and small incremental backups. Backup appliance best practices yield multiple, concurrent backup streams sent at varying speeds, and must be easily scalable.

Administrators typically make a full backup on the weekend, along with daily incremental backups. If the company requires a full system restore later in the week, tape-based backup must chug (jargon) chug - To run slowly; to grind or grovel. "The disk is chugging like crazy."  through several days of incremental backups in order to update files modified since the previous full backup. Using random access disk, this hours-long process takes a tiny fraction of the time that's required when using tapes.

Highly Reliable Storage and Verifiable Recoverability

The backup system should be highly reliable and allow IT administrators to continuously confirm the integrity of the recovery copy. Through extensive data and metadata verification, the system can validate the reliability of the recovery copy. This ability is an improvement on RAID: RAID storage systems do not provide file system verification. It's quite possible to overwrite (1) A data entry mode that writes over existing characters on screen when new characters are typed in. Contrast with insert mode.

(2) To record new data on top of existing data such as when a disk record or file is updated.
 good data on a disk subsystem, and to return bad information. Unless IT carefully plans for software faults and disk errors or have automated file system verification in place, backup data stored on RAID may or may not be good when it comes time to restore.

A reliable backup system will provide advanced RAID features like mirrored configuration (RAID1+0), hot spares and automatic restoration after (perish TO PERISH. To come to an end; to cease to be; to die.
     2. What has never existed cannot be said to have perished.
     3. When two or more persons die by the same accident, as a shipwreck, no presumption arises that one perished before the
 the thought!) a disk failure. It should also be able to verify the entire backup image at backup time, have a highly tolerant append-only file system, and offer solid write protection after a power interruption.

* Verifying the integrity of the backup image. The simplest way to verify backup quality is to restore and check the entire backup image. However, this requires extra storage and processing power, and the initial check may not be enough--IT should periodically check the backup image to make sure it remains recoverable. Since this procedure takes considerable time and effort, it's unlikely that IT will regularly check the entire backup. And they shouldn't have to; the backup system should do it by running a continuous validation process in the background.

* Append-only file system. Conventional disk storage systems can make very fast random block changes, which nicely supports transactional data operations. However, backup files A file on a tape, removable disk or the fixed disk of another computer that is a copy kept for backup purposes. See backup types.  are large and sequential, which means that restores are much more demanding than other types of file requests. An append-only file system has greater fault tolerance See fault tolerant.

(architecture) fault tolerance - 1. The ability of a system or component to continue normal operation despite the presence of hardware or software faults. This often involves some degree of redundancy.

2.
 for large-scale restore movements because it allocates new blocks for writing data in only one direction. The system will not overwrite these blocks until it encounters a formal clean command.

* Verifying writes after power loss. If a drive suddenly loses power, it's often impossible to tell if it finished writing to disk--writes may look complete in the write back cache A disk or memory cache that supports the caching of writing. Data normally written to memory or to disk by the CPU is first written into the cache. During idle machine cycles, the data are written from the cache into memory or onto disk.  on the drive, but the data may not make it onto the actual platter One of the disks in a hard disk drive. Each platter provides a top and bottom recording surface. There may be only one or several platters in a drive with each platter having its own pair of read/write heads. See magnetic disk. . NVRAM (Non-Volatile RAM) May refer to dynamic RAM (DRAM) and static RAM (SRAM) chips that are backed up by a battery or to non-volatile chips such as flash memory. See non-volatile memory, dynamic RAM, static RAM and solid state disk.  technology allows the system to validate full writes. Another use for NVRAM's verified writes is to catch bad blocks. No matter how reliable the disk, bad blocks happen--errors in cache, firmware A category of memory chips that hold their content without electrical power. Firmware includes flash, ROM, PROM, EPROM and EEPROM technologies. When holding program instructions, firmware can be thought of as "hard software." See flash memory, ROM, PROM, EPROM, EEPROM and FOTA. , hardware and bugs are all causes. When drives are mirrored, eventually the bad blocks on one drive will yield an inconsistent mirror. The backup system should be able to detect a mismatch mismatch

1. in blood transfusions and transplantation immunology, an incompatibility between potential donor and recipient.

2. one or more nucleotides in one of the double strands in a nucleic acid molecule without complementary nucleotides in the same position on the other
, identify the bad blocks and resynchronize the mirrors.

For example, Data Domain's Restorer is an on-site disk-based restore repository that replaces ATA RAID Using ATA (IDE) drives in a high-performance and/or fault tolerant configuration. See RAID.  systems or NAS, and complements tape by relieving the pressure of being the primary backup and restore target. It offers high capacity with 20X Global Compression, accelerates backups and restores with cost advantages similar to tape, verifies recoverability, and operates with standard backup applications.

Backup storage appliances like the Restorer offer fast and extremely reliable recovery. By making the backup and recovery process faster, simpler, verifiable and more cost effective, sophisticated storage systems extend the benefits of backup software and vastly improve the data protection infrastructure.

Brian Biles is vice president of marketing at Data Domain Inc. (Palo Alto Palo Alto, city, California
Palo Alto (păl`ō ăl`tō), city (1990 pop. 55,900), Santa Clara co., W Calif.; inc. 1894. Although primarily residential, Palo Alto has aerospace, electronics, and advanced research industries.
, CA)

www.datadomain.com
COPYRIGHT 2003 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2003, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Security
Author:Biles, Brian
Publication:Computer Technology Review
Date:Nov 1, 2003
Words:1692
Previous Article:Data reacquisition vs. data restoration: a new model for business continuity.(Backup/Restore)
Next Article:Building iSCSI storage area networks.(Storage Management)



Related Articles
NETWORK APPLIANCE LAUNCHES NEARSTORE R100.(Product Announcement)
IBM Tivoli storage manager and NetApp NearStore R100 provide auto storage Mgnt with fast access.(Product Announcement)
Simplifying storage: how companies benefit with a backup appliance approach. (SAN).
Cradle-to-grave storage management now a reality: and not a moment too soon.
Do you backup data? Or do you have a data protection strategy?(Backup/Restore)
CryptoStor secures vital data.(Security)
SAN-based intelligence: the Holy Grail of storage management?(Storage Networking)(storage area networks)
How to control proliferating storage options: three tips for reining in your storage environment.(Storage Management)
New Holy Grail: information lifecycle management; Has it been found? Not yet.(Storage Management)(Editorial)
Decru bags CNL security deal.(CNL Financial Group uses Decru DataFort storage security appliances)(Brief Article)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles