Printer Friendly
The Free Library
14,505,205 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Tape pipeline for BC/DR planning: new considerations in backup and recovery. (Storage Networking).


Tape has long been an affordable backup medium. However, when it came to remote data replication In database management, the ability to keep distributed databases synchronized by routinely copying the entire database or subsets of the database to other servers in the network.

There are various replication methods.
, tape had severe limitations. With heavy latency (1) The time between initiating a request in the computer and receiving the answer. Data latency may refer to the time between a query and the results arriving at the screen or the time between initiating a transaction that modifies one or more databases and its completion.  impact over as little as a 10-mile distances, remote tape backup Using magnetic tape for storing duplicate copies of hard disk files. Users can add an internal or external tape drive to their desktop computers for backup purposes, and files are typically copied to the tapes using a backup utility that updates on a periodic schedule.  was not a viable option for business-continuity/disaster-recovery(BC/DR) planning--until the introduction of tape pipelining.

Tape pipelining virtually eliminates the impact of latency on the sustainable throughput The speed with which a computer processes data. It is a combination of internal processing speed, peripheral speeds (I/O) and the efficiency of the operating system and other system software all working together.

1.
 of tape backup over distance. With tape pipelining, the remote tape backup system appears local to the server and is able to sustain high performance over thousands of miles. This greatly enhanced performance over distance allows considerable flexibility to companies considering tape as part of their BC/DR strategy. Tape pipelining can enable a single central site to act as a backup facility for many geographically dispersed dis·perse  
v. dis·persed, dis·pers·ing, dis·pers·es

v.tr.
1.
a. To drive off or scatter in different directions: The police dispersed the crowd.

b.
 data centers. It can also enable backup from a central site to a remote site.

Companies can backup stored tape data across low-cost and readily available IP connections, using both newer native Fibre Channel tape drives as well as older SCSI SCSI
 in full Small Computer System Interface

Once common standard for connecting peripheral devices (disks, modems, printers, etc.) to small and medium-sized computers. SCSI has given way to faster standards, such as Firewire and USB.
 drives. In addition, open systems servers such as UNIX UNIX

Operating system for digital computers, developed by Ken Thompson of Bell Laboratories in 1969. It was initially designed for a single user (the name was a pun on the earlier operating system Multics).
 and NT can be incorporated into remote tape strategies.

Many companies are using IP networks for remote disk mirroring which is typically synchronous Refers to events that are synchronized, or coordinated, in time. For example, the interval between transmitting A and B is the same as between B and C, and completing the current operation before the next one is started are considered synchronous operations. Contrast with asynchronous.  and therefore uses substantial network bandwidth during the day. Since tape backup is typically asynchronous Refers to events that are not synchronized, or coordinated, in time. The following are considered asynchronous operations. The interval between transmitting A and B is not the same as between B and C. The ability to initiate a transmission at either end.  and often done during nights and weekends, tape pipelining means that businesses can now use their existing IP networks for both disk mirroring and remote tape backup solutions.

The result is improved data protection and recoverability in the event of a disaster and a significant reduction in the total cost of ownership of a remote BC/DR solution.

With tape pipelining, a single location can act as the backup for numerous remote servers and centrally manage the backup/restore operation. Tape pipelining over IP allows this to be done over plentiful plen·ti·ful  
adj.
1. Existing in great quantity or ample supply.

2. Providing or producing an abundance: a plentiful harvest.
 yet inexpensive IP connections.

Selecting BC/DR Solutions

In planning BC/DR systems, businesses can select from a wide range of data-protection solutions (see Figure 1). The choices they make will be determined by the importance of the information they need to protect (i.e. its time sensitivity and business, value) and the amount of money they are willing to spend to protect it. These factors help determine the appropriate recovery lime objective (RTO (Recovery Time Objective) The amount of time a computer system or application can stop functioning before it is considered intolerable to the enterprise. It can be computed to be from seconds to days, depending on how critical the application is to the organization. ).

Industry research shows that a typical mid-sized to large enterprise might have 150 mission- and time-critical applications that require continuous availability; 300 or so applications that require data restoration within four to 24 hours; and 100 or so non-critical applications that can wait as long as several weeks for data restoration.

Levels of information protection: If a business cannot continue profitable operations without its data, it usually implements a remote hot site that completely duplicates critical data and applications and makes them continuously available. The cost of such a solution is justified by the cost to the business of even a few minutes of down time. At the other end of the spectrum is business data that may not need to be restored for days, weeks or even longer after a data center disruption disruption /dis·rup·tion/ (dis-rup´shun) a morphologic defect resulting from the extrinsic breakdown of, or interference with, a developmental process. . In fact, some business data requires no restoration at all but simply needs to be protected for legal and archival purposes.

The most cost-effective enterprise-wide BC/DR plan will probably involve a mixture of disk and tape technologies, from the most expensive for data that needs to be continuously available to less expensive technologies for data of less time sensitivity and importance. Figure 2 shows what such a plan might involve.

New factors and increased focus in BC/DR planning: BC/DR strategies are affected by the increasing amount of business data being pushed into the mission-critical category, requiring continuous availability. Businesses are also aware that the dangers to their mission-critical data are greater than ever before. Not so long ago businesses had to plan for human error, technical glitches and, of course, natural disasters. Post 9/11, businesses must also plan for terrorist attacks that, as a November 2002 report from Illuminata put it, could "take out more than just a couple of floors of one data center."

Indeed, government agencies and private disaster planners are urging businesses to establish BC/DR systems that separate primary and backup sites A backup site is a location where a business can easily relocate following a disaster, such as fire, flood, or terrorist threat. This is an integral part of the disaster recovery plan of a business.  by at least 200 miles, instead of the typical 60 miles. The SEC and Federal Reserve are calling for businesses that play a significant role in financial markets to establish fully redundant backup sites at least 300 miles offsite and be able to recover critical activities on the same business day of a disaster.

Tape Backup Market

The evolution toward networked storage, the heightened awareness of business-continuity and disaster-recovery planning and the need to reduce costs and improve efficiencies throughout IT, is stimulating the market for cost-effective tape and enterprise backup solutions.

IDC (September 2001) sees the tape market moving from solutions targeting the high end of the data storage market to lower-cost solutions that add more capacity than standalone stand·a·lone  
adj.
Self-contained and usually independently operating: a standalone computer terminal. 
 tape drives. Through 2003/2004, as data volumes increase 200%-300% per year, Meta Group (October 2001) projected that backup/restore windows will be reduced using intelligent storage, highly parallel tape solutions, and FC-based SANs.

Gartner Group (company) Gartner Group - One of the biggest IT industry research firms.

Address: Connecticut, USA.
 stated unequivocally (March 2001) that disk-based replication will not replace tape backup. Tape's low cost--it is about 10 to 60 times less expensive than disk storage--makes it a compelling economic argument for its inclusion in any high-availability storage infrastructure.

Tape Backup and Its Historical Limitations

Tape backup plays a big role in cost-effective BC/DR systems since it is significantly less expensive than disk-based replication and is perfectly appropriate for protecting many types of business information. However, historically there have been inherent performance problems with tape backup over distance that limited its role in BC/DR planning.

The primary reason for this limited role is the sequential nature of tape I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output.

I/O - Input/Output
 operations, which makes tape backup highly sensitive Adj. 1. highly sensitive - readily affected by various agents; "a highly sensitive explosive is easily exploded by a shock"; "a sensitive colloid is readily coagulated"  to the latency that accompanies distance. Unlike disk-based operations in which blocks can be written in parallel, tape I/O for a single block must complete before the next block can be written. Figure 3 illustrates the typical tape performance falloff fall·off  
n.
A reduction or decrease: a falloff in car sales.

Noun 1. falloff - a noticeable deterioration in performance or quality; "the team went into a slump"; "a gradual slack in
 with distance for several popular tape products. As the chart makes plain, remote tape backup systems simply don't have the throughput to move large amounts of data over distances of more than a few miles.

Tape Pipelining Removes Distance Limitations

With tape pipelining technology, tape backup can play a larger role in BC/DR plans, even at distances of thousands of miles or more.

What is tape pipelining? Tape pipelining is a data-streaming technique for remote tape applications. It maximizes the efficient flow of data by using the concepts of buffering and error recovery, and extending these capabilities over remote distances via emulation (architecture) emulation - When one system performs in exactly the same way as another, though perhaps not at the same speed. A typical example would be emulation of one computer by (a program running on) another. .

How does tape pipelining work? In tape pipelining, the server and tape controller can sit thousands of miles apart, but the two units believe they are sitting next to each other in the same data center. Both the server and the tape controller have an intermediate device such as a storage router See data mover.  nearby that emulates the distant server or tape controller.

The exchange of information between the server and tape controller is buffered, allowing write/confirm operations to take place in tandem Adv. 1. in tandem - one behind the other; "ride tandem on a bicycle built for two"; "riding horses down the path in tandem"
tandem
 using temporary memory. For error recovery purposes, operations (and associated data) in the queue Pronounced "Q." A temporary holding place for data. See queuing, message queue and print queue.

(programming) queue - A first-in first-out data structure used to sequence objects. Objects are added to the tail of the queue ("enqueued") and taken off the head ("dequeued").
 are retained until a successful completion status has been received from the controller.

Tape Pipelining Over IP

Tape pipelining over IP allows businesses to perform remote tape backups during those periods when bandwidth is essentially free.

For example, using existing IP networks between geographically separated facilities, companies can support their daily production network and real-time synchronous data Synchronous data

Information available at the same time. To test option-pricing models, the price of the option and of the underlying should be synchronous and reflect the same moment in the market.
 replication and disk mirroring applications by day, and perform scheduled, asynchronous tape backups during evenings and weekends, when excess bandwidth is available.

Studies show that up to 80% of the total costs for a remote replication solution over five years are network bandwidth costs. Re-using existing bandwidth rather than maintaining dedicated lines can dramatically lower overall cost of ownership.

Key benefits of tape Pipelining over IP:

Businesses can use existing storage resources and IP networks as part of their BC/DR solutions by having a central site act as the backup for a number of geographically dispersed facilities.

Businesses can streamline server backups into one remote tape backup system because open systems servers, such as UNIX and NT, can be incorporated into previously mainframe-only networks.

Businesses can re-use existing tape drives by incorporating both the newer native Fibre Channel tape drives as well as older SCSI drives into the remote tape backup system.

Businesses can reduce the number of people required by centrally managing tape backup/restore operations.

Businesses can use existing IP networks and bandwidth, particularly during evenings and weekends.

Remote Tape Backup--a Key Part of BC/DR Planning

With support for tape pipelining over IP networks, companies can perform remote backups using readily available, less expensive IP networks. It also allows highly distributed organizations to centrally manage the backup operation for its distributed facilities, improving data availability Refers to the degree to which data can be instantly accessed. The term is mostly associated with service levels that are set up either by the internal IT organization or that may be guaranteed by a third party datacenter or storage provider.  of its distributed, mission-critical data (see Figure 4).

For many companies, analysts recommend that BC/DR plans provide disk backup (data replication or data mirroring) to secondary backup centers hundreds of miles from primary sites for critical information that needs to be continuously available.

Analysts also advise that other types of information be available within hours of a manmade or natural disaster. The most cost effective way to meet both needs is to use the same network infrastructure and remote facilities for both synchronous disk backup (for business information essential to continuous operations) and for asynchronous tape backup (for less time-critical but still important data).

Tape pipelining makes it possible by eliminating the latency effects that traditionally have accompanied tape backup over distance. Devices that make tape pipelining possible use buffering, emulation and advanced compression techniques to make local and remote controllers and servers act as if they were right next to each other.

This not only makes it easier for more businesses to use disk and tape together to provide maximum coverage for information based on business requirements and budgetary restrictions, it also allows businesses to take better advantage of their IP bandwidth.

With tape pipelining, remote tape backup/restore comes of age. For businesses seeking to improve their data availability while at the same time controlling costs by using existing hardware, software and network bandwidth, remote tape backup over IP makes perfect sense.

[FIGURE 1 OMITTED]

[FIGURE 2 OMITTED]

[FIGURE 3 OMITTED]

Patty Barkley is storage networking marketing manager at CNT (Carbon NanoTube) See nanotube.  (Minneapolis, Minn.)

WWW WWW or W3: see World Wide Web.


(World Wide Web) The common host name for a Web server. The "www-dot" prefix on Web addresses is widely used to provide a recognizable way of identifying a Web site.
.cnt.com
COPYRIGHT 2003 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2003, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Barkley, Patty
Publication:Computer Technology Review
Geographic Code:1USA
Date:Jan 1, 2003
Words:1766
Previous Article:Disk imaging in Windows: increases user productivity and permits broader range of backup devices. (Storage Networking).
Next Article:Online backup and recovery services: throwing a lifeline to "stranded servers". (Storage Networking).
Topics:



Related Articles
"Tape Is Always Going To Be Important".(Company Operations)
For Business Preservation [ldots] Get It On Tape.(first of two articles)(Technology Information)
Watch your back: The mounting risks of unauthorized data access, theft and corruption in secondary storage. (SAN).
The evolving role of tape storage.
Where was your data when the lights went out? Mission-critical data and network apps' security, accessibility during the blackout of 2003.(Disaster...
Tape libraries: a different type of virus protection.(Security)
Tape storage an asset for high leveragability.(First In/First Out)
SAN-based data replication.(Storage Management)(Storage area networks)
Data protection: recovery with tape.(first in/first out)
Backup and disaster recovery techniques: a look back and forward.(first in/first out)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles