Printer Friendly
The Free Library
14,680,088 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Data replication considerations.


Data replication technology is finding its way into every aspect of the IT application infrastructure. Moving data between servers, a task that once meant physically carrying tapes between locations, can now be streamlined by transmitting data directly over a network. Enterprise, metro-area and wide-area networks Wide-area networks

Communication networks that are regional, nationwide, or worldwide in geographic area, with a minimum distance typical of that between major metropolitan areas. Smaller networks include metropolitan and local-area networks.
 (available at relatively low cost) offer the bandwidth and speed necessary to efficiently transfer data across the data center--or across the world--for disaster recovery, content distribution, hardware migration or system consolidation.

Data replication has revolutionized enterprise disaster preparedness pre·par·ed·ness  
n.
The state of being prepared, especially military readiness for combat.

Noun 1. preparedness - the state of having been made ready or prepared for use or action (especially military action); "putting them
. Organizations large and small rely increasingly on software applications for every facet of business activity. Any interruption of access to application data can be devastating dev·as·tate  
tr.v. dev·as·tat·ed, dev·as·tat·ing, dev·as·tates
1. To lay waste; destroy.

2. To overwhelm; confound; stun: was devastated by the rude remark.
. Using data replication technologies, organizations are now able to safeguard the IT infrastructure from disaster.

For IT staff involved in configuring replication, staying abreast of business process vulnerability to data loss and changes in replication technology is essential. This paper aims to provide an overview of data replication alternatives and a review of the considerations enterprises must weigh in determining the appropriate use of replication technology.

Data Replication Alternatives

The term data replication covers a broad spectrum of methods for creating duplicate copies of operational data, either locally or at a remote data center. Like other applications of data replication technology, disaster recovery planning requires comprehensive and universally applicable data copy functionality that offers high-performance movement of data across significant distances.

The storage infrastructure provides multiple platforms Refers to two or more operating environments, which typically include the CPU family and operating system. For example, if versions of a program run on Windows and the Macintosh, the software is said to support multiple platforms.  for hosting enterprise data replication processing. Whether operating from an application server, the controller of a high-end storage array, or from a network-resident dedicated device, the data replication application creates mirror images of primary system data on secondary volumes located anywhere on the network. The backup copies A disk, tape or other machine readable copy of a data or program file. Making backup copies is a discipline most computer users learn the hard way-- after months of work is lost. See backup and LAN free backup.  can then be made available to business applications if an outage out·age  
n.
1. A quantity or portion of something lacking after delivery or storage.

2. A temporary suspension of operation, especially of electric power.
 disrupts access to the primary source of data.

Server-Based Replication

Server-based data replication, referred to by some vendors as host-based or application-based, is installed on, and operates from, an application server platform. Often running on the same server as the business application, server-based solutions are frequently co-located with volume management Products--Volume Replicator See port replicator.

replicator - Any construct that acts to produce copies of itself; this could be a living organism, an idea (see meme), a program (see quine, worm, wabbit, fork bomb, and virus), a pattern in a cellular automaton (see life), or (speculatively) a robot or
 from VERITAS offers an example. The replication software intercepts application I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output.

I/O - Input/Output
 and duplicates write activity to a secondary volume. Replicated volumes can be mounted to either the application server or to a backup application server located elsewhere on the network. Although server-based solutions consume valuable CPU CPU
 in full central processing unit

Principal component of a digital computer, composed of a control unit, an instruction-decoding unit, and an arithmetic-logic unit.
 and memory resources at the application server, they provide the storage administrator with significant flexibility when determining the topology topology, branch of mathematics, formerly known as analysis situs, that studies patterns of geometric figures involving position and relative position without regard to size.  of data replication. Any storage hardware supported by the application server can be configured to receive replicated data.

If the pool of application servers requiring replication is relatively homogenous homogenous - homogeneous , a single software product will likely satisfy all data copy needs. However, the more varied the pool of servers becomes, the more likely that multiple replication products will be required to accommodate all operating system operating system (OS)

Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs.
 software and hardware configurations. Storage administrators, tasked with managing these solutions, must learn the ins and outs ins and outs  
pl.n.
1. The intricate details of a situation, decision, or process.

2. The windings of a road or path.
 of each product and address the inevitable product compatibility issues.

The close integration between existing storage management applications and same-vendor replication software has definite advantages. Replication utilities will probably have a familiar look and feel, and product support will be integrated, speeding resolution of potential problems.

The most significant drawback DRAWBACK, com. law. An allowance made by the government to merchants on the reexportation of certain imported goods liable to duties, which, in some cases, consists of the whole; in others, of a part of the duties which had been paid upon the importation.  to server-based replication is limited scalability. Software must be installed and managed at each server with data volumes to be replicated, adding to replication costs and to the administration burden of storage managers. For small-scale configurations, however, server-based replication presents a cost-effective approach.

Controller-Based Replication

High-end storage arrays offer a wide range of proprietary storage management functionality that leverages the array controller as an operating environment In computing, an operating environment is the environment in which users run programs, whether in a command line interface, such as in MS-DOS or the Unix shell, or in a graphical user interface, such as in the Macintosh operating system. . Logical unit (LUN) management, performance monitoring and management, and bulk data movement are just some of the functions commonly built into the storage array. The tight integration between hardware and software allows vendor developers unmatched scope for optimizing the speed and functionality of storage management applications.

All vendors of enterprise-class storage arrays now offer data replication as a controller-based (sometimes called disk-based) application. Products include: SRDF SRDF Symmetrix Remote Data Facility
SRDF Symmetric Remote Data Facility
 from EMC (1) (EMC Corporation, Hopkinton, MA, www.emc.com) The leading supplier of storage products for midrange computers and mainframes. Founded in 1979 by Richard J. Egan and Roger Marino, EMC has developed advanced storage and retrieval technologies for the world's largest companies. , PPRC PPRC Peer-To-Peer Remote Copy
PPRC Pollution Prevention Resource Center
PPRC Physician Payment Review Commission
PPRC Pulp and Paperworker's Resource Council
PPRC Provisioning Preparedness Review Conference
 from IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries) , Truecopy from Hitachi Data Systems See HDS. , and ContinuousAccess from HP.

Functionally, controller-based replication applications are identical to their server-based cousins. Each write I/O generated by a business application results in two downstream writes; one write to the primary storage volume and one to a volume defined on a secondary storage array. Controller-based replication products generally insist on like-to-like data copies, which means that the target storage array must be identical to the primary. Although understandable from the perspective of the storage vendor, who has high-performance promises to keep, for IT managers, the single-vendor storage solution invariably in·var·i·a·ble  
adj.
Not changing or subject to change; constant.



in·vari·a·bil
 results in a more costly replication configuration.

As with server-based replication solutions, deployed alongside volume management software from the same vendor, controller-based replication delivers single-vendor support and common look-and-feel benefits. And it is difficult to beat the performance of an application specifically optimized to the hardware platform. For mission-critical applications in high-end enterprise infrastructures, where cost is secondary to performance, controller-based replication is an obvious first choice.

SAN-Based Replication

Storage management applications are gradually finding their way into the intelligent storage networks being rapidly deployed across all levels of the enterprise. Whether running on a dedicated network appliance (1) A specialized device for use on a network. For example, Web servers, cache servers and file servers can be implemented as general-purpose computers with the appropriate software or as network appliances, which are computers dedicated to a single function and cannot do anything  or an intelligent network switch (technology that has been announced but not yet delivered) SAN-based storage management solutions, and data replication in particular, offer tremendous benefits to the storage administrator.

Hosting data replication in the network provides administrators with a huge productivity boost. Rather than manage replication at each application server and storage array, all replication is controlled from a central platform in the network. SAN-based replication is also application server and storage vendor agnostic ag·nos·tic  
n.
1.
a. One who believes that it is impossible to know whether there is a God.

b. One who is skeptical about the existence of God but does not profess true atheism.

2.
, offering significant cost advantages when purchasing and deploying a heterogeneous pool of storage devices.

SAN-based replication solutions are available in two distinct topological to·pol·o·gy  
n. pl. to·pol·o·gies
1. Topographic study of a given place, especially the history of a region as indicated by its topography.

2.
 flavors: in-band (also know as symmetric No difference in opposing modes. It typically refers to speed. For example, in symmetric operations, it takes the same time to compress and encrypt data as it does to decompress and decrypt it. Contrast with asymmetric.

(mathematics) symmetric - 1.
) and out-of-band (also known as asymmetric A difference between two opposing modes. It typically refers to a speed disparity. For example, in asymmetric operations, it takes longer to compress and encrypt data than to decompress and decrypt it. Contrast with symmetric. See asymmetric compression and public key cryptography. ).

In-Band: In-band replication solutions place the hardware running replication logic in the path of application I/O. When a host server initiates I/O, it passes through the replication appliance. Any write activity generates two downstream I/Os--one to the primary storage volume and one to the secondary. In-band replication solutions are non-invasive, having no need for a software footprint on the application server. The configuration does, however, add a "hop" in the data path, with implications for performance and scalability.

Out-of-Band: Out-of-band SAN-based replication requires a small software component, known as an agent, to be installed on the application server. The agent communicates with a meta-data server in the network to obtain replication control information. After determining the replication configuration, the agent issues the appropriate I/O to the primary and secondary storage volumes. Out-of-band replication solves the latency and scalability issues associated with in-band solutions by eliminating the extra "hop" in the application I/O path. However, the agent software consumes processor, memory, and I/O resources at the application server.

Replication Mode

Whether choosing server-based, controller-based, or SAN-based replication, storage managers and application architects must determine what mode of replication will best meet the needs of the application. Replication mode influences the probability of data loss in the event of a failure, and the impact of the replication task on application performance. And because vendors interpret replication mode in slightly different ways, understanding how each product functions--with regard to mode--is essential.

Point-In-Time

Point-in-time replication generates a snapshot image of a storage volume as it existed at the time the replication utility was run. Database agents are available to work with the application to get a good image in preparation for the replication. The snapshot is replicated to a secondary storage device to create a duplicate image of the primary volume at the remote location.

The impact on application performance from a point-in-time copy is negligible. Although a quiesce of application databases may be necessary, I/O activity between the application server and primary storage device is never impeded im·pede  
tr.v. im·ped·ed, im·ped·ing, im·pedes
To retard or obstruct the progress of. See Synonyms at hinder1.



[Latin imped
. However, in an outage event, point-in-time replication has the highest risk for loss of data. If access to the primary volume is interrupted, data written to this volume since the last snapshot copy was transferred to the secondary storage device will be lost. Storage administrators maintain some control over the amount of data loss incurred, and can increase the frequency of snapshots and make certain that snapshots are run at a transaction boundary, ensuring proper data integrity.

Asynchronous Refers to events that are not synchronized, or coordinated, in time. The following are considered asynchronous operations. The interval between transmitting A and B is not the same as between B and C. The ability to initiate a transmission at either end.  

Asynchronous replication significantly reduces the window for data loss. Each write I/O on the primary storage volume is immediately replicated to a secondary volume, creating a close to real-time duplicate copy at the remote site. The primary and secondary write I/Os are completely independent of each other, eliminating the possibility of a performance drag on Verb 1. drag on - last unnecessarily long
drag out

last, endure - persist for a specified period of time; "The bad weather lasted for three days"

2.
 the application. This independence introduces the possibility of lost in-flight I/Os during an outage event. In-flight I/Os (those writes successfully committed to the primary volume but not yet written to the secondary) are unlikely to occur conveniently at a transaction boundary. This means that storage administrators must address data integrity problems before the secondary volume data is available to applications.

Unique to asynchronous mirroring is the challenge of ordered writes--keeping the write I/Os in the order they were sent from the application. To ensure data consistency Data consistency summarizes the validity, accuracy, usability and integrity of related data between applications and across the IT enterprise. This ensures that each user observes a consistent view of the data, including visible changes made by the user's own transactions and  between the local and remote copies, storage administrators can leverage technologies that guarantee ordered writes. As long as the writes are sent in order to the remote location, the data is valid and most transactional applications will recover just as they would with synchronous Refers to events that are synchronized, or coordinated, in time. For example, the interval between transmitting A and B is the same as between B and C, and completing the current operation before the next one is started are considered synchronous operations. Contrast with asynchronous. .

Synchronous

Synchronous replication closes the window on data loss completely. Application write I/Os are simultaneously sent to both primary and secondary storage volumes. The application waits for both writes to be successfully completed before continuing. Should an outage occur, the secondary volume reflects the state of the primary volume at the time of the problem.

Of the available alternatives, synchronous replication has the greatest potential impact on application performance. Excessive latency during primary or secondary I/O will translate directly into longer I/O elapse e·lapse  
intr.v. e·lapsed, e·laps·ing, e·laps·es
To slip by; pass: Weeks elapsed before we could start renovating.

n.
 time for the application. Variations on Replication Mode In a bid to improve performance and reduce the risk of data loss, developers of replication software have subjected the basic modes of replication to numerous twists and tweaks. Two common examples include: buffering writes to the secondary replication volume during asynchronous replication to minimize data loss from in-flight I/O; and limiting in-flight I/O to a single write per application, the aim being to reduce the risk of data loss to a single uncommitted write.

Choosing a Data Replication Solution

Establishing the right combination of replication mode and platform requires a detailed understanding of the business application. A business impact analysis can determine how dependant the organization is on an application and how well the application tolerates data loss. The analysis will provide two key pieces of business continuity information: the recovery time objective (RTO (Recovery Time Objective) The amount of time a computer system or application can stop functioning before it is considered intolerable to the enterprise. It can be computed to be from seconds to days, depending on how critical the application is to the organization. ); and the recovery point objective (RPO RPO Recruitment Process Outsourcing
RPO Recovery Point Objective (disaster recovery)
RPO Royal Philharmonic Orchestra
RPO Rochester Philharmonic Orchestra
RPO Representative Poetry Online
RPO Railway Post Office
). The RTO indicates how quickly an application must be resurrected after a failure in order to avoid business losses, and is important in the design of failover mechanisms. Of more importance to considerations of data replication, the RPO indicates how much data an application can risk losing before the business begins to suffer losses.

Distance, Data Loss, and Application Performance

Choosing between the principal data replication modes--point-in-time, asynchronous, and synchronous--involves a trade-off between risk of data loss and application performance. Guaranteeing zero data loss imposes the greatest potential impact on application performance. Synchronous replication--the only mode that promises to eliminate data loss completely--telegraphs any latency, from either primary or secondary I/O, directly to the application. The result is slower application I/O response time.

Real-world data replication processes frequently require the geographic separation of primary and secondary volumes. As the distance between the replication platform and the secondary storage volume increases, so too does the round-trip latency of the secondary write. And because synchronous replication forces the application to wait for completion of both I/Os before continuing, the speed of the secondary write sets the I/O response time for the entire transaction. Many factors contribute to I/O latency including propagation delay The time it takes to transmit a signal from one place to another. Propagation delay is dependent solely on distance and two thirds the speed of light. Signals going through a wire or fiber generally travel at two thirds the speed of light. Contrast with nodal processing delay.  within the network, the requirements of the network connection protocol (for example, if every send requires acknowledgement, round-trip latency is effectively doubled), and equipment latency (the time it takes for a request to enter and exit each physical device encountered along the path to the remote secondary volume). Each of these factors can be tuned to optimize performance, but the physical distance an I/O must travel presents limits that can he difficult to overcome.

If application performance demands prove incompatible with synchronous replication, some risk of data loss must be anticipated. Asynchronous and point-in-time modes reduce the impact of replication on the application and change the decision making process to one of data timeliness and cost. Asynchronous mode See asynchronous and SCSI asynchronous mode.  generates a more current copy of the primary volume data, but requires considerable and continuous network bandwidth. Point-in-time copy allows flexible scheduling of the data transfer to optimize available network resources, but also introduces the greatest risk of data loss.

Application Server Overhead

Replication mode impacts performance at the microscopic level of individual I/O response times. The Replication platform, on the other hand, has just as much impact on the application at the macroscopic macroscopic /mac·ro·scop·ic/ (mak?ro-skop´ik) gross (2).

mac·ro·scop·ic or mac·ro·scop·i·cal
adj.
1. Large enough to be perceived or examined by the unaided eye.

2.
 level.

Server-based and SAN-based out-of-band replication solutions require software components on the application host that consume processor, memory, and I/O resources. This additional load on the host puts replication in competition with business applications for the limited resources of the server.

Running replication software on the application server also complicates capacity management. An expanding storage pool will increase the amount of server resources needed to manage data replication. Capacity planners must balance the demands of the business application and replication tasks and size the server platform accordingly. Unanticipated growth in demand from either task may require complex and time-consuming upgrades to the server, impacting end user access.

The Cost of Data Replication

Of the many factors that have a bearing on data replication costs, support for heterogeneous storage, licensing fees, and network bandwidth requirements Bandwidth requirements (communications)

The channel bandwidths needed to transmit various types of signals, using various processing schemes. Every signal observed in practice can be expressed as a sum (discrete or over a frequency continuum) of sinusoidal
 are the most significant.

Replication products that take a vendor agnostic approach to storage can significantly reduce costs by allowing less expensive storage hardware, and redeployed legacy devices, to be used as replication targets. Heterogeneous support also gives IT managers, charged with purchasing and deploying storage, greater leverage when negotiating with vendors. Healthy price competition between multiple vendors vying vy·ing  
v.
Present participle of vie.

vying vie
 for an account benefits the customer.

Although controller-based solutions lock IT managers into a single-vendor relationship, replicating between duplicate high-end storage arrays can provide tangential tan·gen·tial   also tan·gen·tal
adj.
1. Of, relating to, or moving along or in the direction of a tangent.

2. Merely touching or slightly connected.

3.
 benefits. Close integration between replication and storage platforms often reduces support costs, and gives IT administrators better access to support personnel. A single-vendor solution also reduces the finger pointing that often happens when problems occur with multi-vendor configurations.

Server-based replication solutions often provide the most economic model for small-scale replication tasks. In fact, replication functionality may already be available, at little or no additional cost, from the vendor providing storage management software for the application server. As the number of servers involved in replication grows, however, costs can quickly escalate es·ca·late  
v. es·ca·lat·ed, es·ca·lat·ing, es·ca·lates

v.tr.
To increase, enlarge, or intensify: escalated the hostilities in the Persian Gulf.

v.intr.
. Server-based replication requires licensing at each server, and often a duplicate server at the receiving end of the replication.

Keep in mind that vendor solutions that support data compression data compression

Process of reducing the amount of data needed for storage or transmission of a given piece of information (text, graphics, video, sound, etc.), typically by use of encoding techniques.
 can significantly offset bandwidth costs. However, even with very high compression ratios compression ratio

Degree to which the fuel mixture in an internal-combustion engine is compressed before ignition. It is defined as the volume of the combustion chamber with the piston farthest out divided by the volume with the piston in the full-compression position (
, there will always be some expense to transmit information across a network. Both synchronous and asynchronous modes require continuous use of network connections, with enough capacity to support peak write rates. Point-in-time replication, however, is less demanding of the network, and consequently less expensive. The transfer of snapshot copies to a remote location can be scheduled and prioritized to optimize available network bandwidth.

Conclusion

High-speed networks are making it possible to host near real-time copies of production data at what would once have been considered unimaginably distant data centers. Wide availability of high-bandwidth networks, at relatively low-cost, has opened up the market for replicated data within IT. Whether used for content distribution, system consolidation, data migration, or disaster recovery, replication technologies are now supplanting sup·plant  
tr.v. sup·plant·ed, sup·plant·ing, sup·plants
1. To usurp the place of, especially through intrigue or underhanded tactics.

2.
 magnetic tape for data movement.

Unlike tape-based data transfer, choosing replication is no longer the end of the decision making process. Myriad choices face the administrator and architect designing an effective data replication configuration. The type of replication (point-in-time, asynchronous, synchronous), platform for replication (server-based, SAN-based, or controller-based), and physical and operational capabilities of the hardware involved must all be carefully correlated to meet the needs of each business application.

In reality, production IT environments support a diverse array of business applications and will require a correspondingly diverse set of data replication solutions. Understanding the needs of each application, and the costs and limitations of each replication approach, is the first step to making an informed decision.

www.cnt.com

Patty Barkley is storage networking market manager at CNT (Minneapolis, MN)
COPYRIGHT 2003 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2003, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Backup/Restore
Author:Barkley, Patty
Publication:Computer Technology Review
Date:Sep 1, 2003
Words:2833
Previous Article:The rise of storage process automation.(Backup/Restore)
Next Article:Consolidate storage without losing control.(Security)



Related Articles
EMC launches world's first integrated ATA networked storage solution.
Tape pipeline for BC/DR planning: new considerations in backup and recovery. (Storage Networking).
Disk-based backup: is LAN-based or SAN-based the fairest mirror of them all?(Connectivity)
Automating the full data protection process: Integrity 1.4.5 from Avail Solutions.(Top Technology Showcase)
Rapid restores from data disasters.(Disaster Recovery)(Zetta Server)
Data protection strategies: are they too complex?(Storage Management)
Preparing for disaster with an effective business continuity strategy: overcoming potential dangers to your information infrastructure.(Disaster...
SAN-based data replication.(Storage Management)(Storage area networks)
TCO analysis: where D2D fits--part 2.(Storage Networking)(Total Cost of Ownership)
Intelligent ATA arrives to meet backup and recovery SLAs.(Storage Networking)(service level agreements)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles