Printer Friendly
The Free Library
14,537,061 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Addressing the challenges of data protection; key data must be 100% reliable, accessible and up-to-date.


Data loss, whether due to human or equipment error, or due to natural or artificial disaster, has business implications of enormous proportions. The need for key data to be 100% reliable, always accessible and fully up-to-date is clear for a growing number of enterprises. These conditions must be met at a cost that is affordable and without in any way hampering the operation of critical business applications.

Companies need to protect mission-critical data from unplanned events, such as a power outage Noun 1. power outage - equipment failure resulting when the supply of power fails; "the ice storm caused a power outage"
power failure

equipment failure, breakdown - a cessation of normal operation; "there was a power breakdown"
, natural disasters, etc., which could interrupt A signal that gets the attention of the CPU and is usually generated when I/O is required. For example, hardware interrupts are generated when a key is pressed or when the mouse is moved. Software interrupts are generated by a program requiring disk input or output.  business operations Business operations are those activities involved in the running of a business for the purpose of producing value for the stakeholders. Compare business processes. The outcome of business operations is the harvesting of value from assets . Without an adequate disaster recovery solution in place, lost data and prolonged pro·long  
tr.v. pro·longed, pro·long·ing, pro·longs
1. To lengthen in duration; protract.

2. To lengthen in extent.
 downtime The time during which a computer is not functioning due to hardware, operating system or application program failure.  could result in a loss of massive amounts of revenue and productivity, as well as customer trust and brand equity, which take years to build but just hours to destroy. With hourly downtime costs over $6M for some organizations (Gartner Study), the need for an effective disaster recovery solution is high-up on the strategic agendas of the CIOs of leading international companies.

Data drives much of the value created by the enterprises today and there are a number of reasons why enterprises worry about data protection in event of a disaster. Loss of critical data could have consequences such as:

* Bankruptcy bankruptcy, in law, settlement of the liabilities of a person or organization wholly or partially unable to meet financial obligations. The purposes are to distribute, through a court-appointed receiver, the bankrupt's assets equitably among creditors and, in most  

* Loss of revenue

* Loss in productivity

* Loss of customers

* Decreased customer loyalty

* Non-compliance of regulatory requirements Regulatory requirements are part of the process of drug discovery and drug development. Regulatory requirements describe what is necessary for a new drug to be approved for marketing in any particular country.  

* Legal implications

* Missing key audit requirements

* Loss of competitive advantage

* Bad public image

Data Protection Challenges

Protecting high-value data and delivering 24x7 data protection and business continuity is of paramount importance to organizations throughout the world. Unfortunately, those organizations that have embarked on this mission have found considerable challenges along the way: from the many infrastructure challenges associated with managing heterogeneous Not the same. Contrast with homogeneous.

heterogeneous - Composed of unrelated parts, different in kind.

Often used in the context of distributed systems that may be running different operating systems or network protocols (a heterogeneous network).
 platforms, applications and data, to the challenges presented by the limiting and costly technology options available today.

Infrastructure Challenges

IT infrastructures usually include a myriad of server, storage, and application platforms. In addition, data and applications often span across distributed or clustered servers and storage. Supporting and protecting these heterogeneous platforms is a complex issue. Furthermore, as not all data is of equal value to an organization, and as the value of data can change, determining how to most effectively protect this data is an ongoing problem.

Managing an end-to-end DR solution across an enterprise is currently an extremely complex challenge. Different storage platforms offer proprietary DR solutions, each with its own management challenges. Host-based solutions can impact server performance and require another layer of data management. In addition, many DR solutions today also require additional infrastructure (like protocol converters (networking) protocol converter - A device or program to translate between different protocols which serve similar functions (e.g. TCP and TP4).

Some call this a "gateway", though others use that term for other kinds of internetworking device.
) that in turn add yet another layer of complexity. And of course, organizations must deliver DR solutions without impacting the performance of key applications.

These many infrastructure challenges result in costly implementations that often do not address the complete DR needs of an organization.

Data Replication In database management, the ability to keep distributed databases synchronized by routinely copying the entire database or subsets of the database to other servers in the network.

There are various replication methods.
 Challenges

Enterprises need a disaster recovery solution that delivers a reliable up-to-date remote copy of its mission critical data but will not result in performance degradation DEGRADATION, punishment, ecclesiastical law. A censure by which a clergy man is deprived of his holy orders, which he had as a priest or deacon.  of the applications; it must be cost-effective and therefore must use minimal extra storage (an original and one copy should be enough), and must support the organization's specific (and dynamic) availability requirements.

Data replication methods, from synchronous Refers to events that are synchronized, or coordinated, in time. For example, the interval between transmitting A and B is the same as between B and C, and completing the current operation before the next one is started are considered synchronous operations. Contrast with asynchronous.  to asynchronous Refers to events that are not synchronized, or coordinated, in time. The following are considered asynchronous operations. The interval between transmitting A and B is not the same as between B and C. The ability to initiate a transmission at either end.  to point in time, have evolved over the years, in an attempt to address these dynamic needs of enterprises. Unfortunately, whereas each method offers advantages over the others, significant disadvantages are also present in all.

Synchronous replication addresses the very fundamental requirement for any effective disaster recovery solution of having an up-to-date remote copy of the data. With this replication method, every write transaction must be acknowledged from the remote site. This method ensures that an up to date copy of the primary site is maintained at a secondary site and that if a disaster occurs in the primary site, the secondary site will be consistent with the primary site. This works well for replication within a local SAN environment; however, extending this approach to transfer data over the WAN results in significant latency (1) The time between initiating a request in the computer and receiving the answer. Data latency may refer to the time between a query and the results arriving at the screen or the time between initiating a transaction that modifies one or more databases and its completion.  problems, high bandwidth costs and a dramatic degradation in the performance of critical business applications. This can have a highly disruptive disruptive /dis·rup·tive/ (-tiv)
1. bursting apart; rending.

2. causing confusion or disorder.
 effect on business operations.

[FIGURE 1 OMITTED]

With asynchronous replication every write transaction is acknowledged locally and then added to a queue Pronounced "Q." A temporary holding place for data. See queuing, message queue and print queue.

(programming) queue - A first-in first-out data structure used to sequence objects. Objects are added to the tail of the queue ("enqueued") and taken off the head ("dequeued").
 of writes waiting to be sent to the remote site. Although asynchronous replication does not reduce the bandwidth requirements Bandwidth requirements (communications)

The channel bandwidths needed to transmit various types of signals, using various processing schemes. Every signal observed in practice can be expressed as a sum (discrete or over a frequency continuum) of sinusoidal
 associated with synchronous replication, it does reduce the latency problems. Unfortunately, however, for "write intensive" applications performance will eventually deteriorate de·te·ri·o·rate
v.
1. To grow worse in function or condition.

2. To weaken or disintegrate.
 to that of synchronous replication.

Furthermore, with asynchronous replication, the copy at the secondary site is not necessarily up to date; as a result, in most disaster scenarios data will be lost. Another key drawback DRAWBACK, com. law. An allowance made by the government to merchants on the reexportation of certain imported goods liable to duties, which, in some cases, consists of the whole; in others, of a part of the duties which had been paid upon the importation.  of asynchronous replication is data inconsistency in·con·sis·ten·cy  
n. pl. in·con·sis·ten·cies
1. The state or quality of being inconsistent.

2. Something inconsistent: many inconsistencies in your proposal.
: in certain situations, even the most advanced solutions currently available are unable to maintain "write order fidelity" at the remote site and in the event of a disaster, no consistent copy will exist. Additionally, existing asynchronous solutions do not scale well and are either limited to one storage subsystem The part of a computer system that provides the storage. It includes the controller and disk drives. See storage system.  or one server.

With both synchronous and asynchronous replication, all modified data is transferred to the remote location. As a result, resource requirements The components of a system that are required by software or hardware. It refers to resources that have finite limits such as memory and disk. In a PC, it may also refer to the resources required to install a new peripheral device, namely IRQs, DMA channels, I/O addresses and memory , including storage and bandwidth, are high and costly. With Snapshot (1) A saved copy of memory including the contents of all memory bytes, hardware registers and status indicators. It is periodically taken in order to restore the system in the event of failure.

(2) A saved copy of a file before it is updated.
 replication, a consistent image of the changes made to the primary site (since the previous snapshot) is periodically transferred to the remote site, thus reducing the amount of transferred data. The advantages of this approach include lower bandwidth costs and minimal application degradation. However, in practice, existing solutions can be prohibitively pro·hib·i·tive   also pro·hib·i·to·ry
adj.
1. Prohibiting; forbidding: took prohibitive measures.

2.
 expensive due to the cost of excessive storage requirements--for instance, EMC's "Multi-hop" solution needs 4-5 times the primary site storage in order to create a single copy of the data at the remote site. In addition, snapshot replication provides limited protection in the event of a disaster; since the snapshot at the remote site will not be up to date, there could be significant data loss. Furthermore, existing solutions remain bandwidth intensive as they transfer data in an unreasonably large granularity The degree of modularity of a system. More granularity implies more flexibility in customizing a system, because there are more, smaller increments (granules) from which to choose. .

Enterprises are thus challenged with the fact that although each replication method addresses important issues, none of them is ideal for the dynamic requirements of the organization. It is clear that what is needed is a replication methodology that encompasses the advantages of the above methods but that eliminates the disadvantages; a replication methodology that can intelligently and dynamically select and utilize a replication method based on customer provided policies and on the point in time availability of network resources.

Existing Disaster Recovery Solutions

Commonly used solutions such as off-site back up tapes do not provide up to date protection of data nor do they enable rapid recovery. The need to use communication lines for hot replication to a remote disaster recovery site is thus clear.

Current solutions, including volume mirroring, host-based replication, storage-based replication and database replication, are either limited in functionality, only work with a selected platforms, are expensive to implement, or all of the above.

One company, San Jose-based Kashya, has developed a new technology that moves the intelligence for data protection into the network (both SAN and LAN (Local Area Network) A communications network that serves users within a confined geographical area. The "clients" are the user's workstations typically running Windows, although Mac and Linux clients are also used. ) and provides an intelligent universal solution for data protection for all of the storage and servers on the network.

Technology Options

There are several technology options available that offer some form of disaster recovery, and as in the case of the replication methods discussed above, each one was designed to address the deficiencies of the other.

Volume Mirroring creates an exact mirror of the original data and therefore demands extremely short distances. In addition, in order to reduce application degradation, it requires an extremely high-speed connection, resulting in high network costs. Furthermore, of course, it does not provide the distance required for effective disaster recovery.

In Host-Based Replication, the distance between the sites can be extended dramatically. However, since the replication software resides in each server, it takes valuable host cycles away from the application, possibly degrading TO DEGRADE, DEGRADING. To, sink or lower a person in the estimation of the public.
     2. As a man's character is of great importance to him, and it is his interest to retain the good opinion of all mankind, when he is a witness, he cannot be compelled to disclose
 application performance. This solution often requires a significant WAN bandwidth and also has a major impact on the local application performance. Furthermore, installation and setup See BIOS setup and install program.  of the replication software in each and every server can easily become a cumbersome cum·ber·some  
adj.
1. Difficult to handle because of weight or bulk. See Synonyms at heavy.

2. Troublesome or onerous.



cum
 and costly endeavor.

Storage-Based Replication offers a host-independent solution, offloading the host from replication responsibilities. Many storage vendors offer their own proprietary solution and therefore only support that specific storage platform. This limitation results in undesired management complexity and cost.

Data Base Replication is offered by many data base vendors as a way to protect data within its control. As a result, only a portion of an organization's data can be protected in this fashion and customers must use additional technologies to cover all other data types.

Which technology to use in order to deliver reliable disaster recovery is therefore a difficult challenge. With currently available options the decision always results in costly implementations that are not optimized to the organization's dynamic needs. Clearly what is needed is a next generation replication technology that delivers maximum data protection with no data loss, that is host and storage platform independent, that can understand data base linkages and dependencies, that will not increase management complexity, and that will adapt to an organization's changing needs--in a cost-effective manner.

The Reality of Data Protection Without Limits

Next-Generation Architecture A next generation networked based data protection platform architecture is based on an intelligent "Data Protection Appliance" that connects to the SAN and IP infrastructure and provides data protection for all the storage and servers attached to the network.

A next generation data protection appliance such as Kashya's achieves no data loss by making an "up to date" copy of the data available at the remote site while combining this with very short recovery time in the event of a disaster. The solution intelligently recognizes the differences between the local and WAN environments and utilizes unique algorithmic-based technologies to combine the best features of each of the three existing replication approaches, while avoiding their disadvantages. It achieves this by dynamically adapting the replication approach to changes in traffic conditions due to the output load from the host application and as data is transferred from the local environment to the WAN.

A powerful, highly differentiating feature of the new generation approach is its ability to establish flexible replication policies based not just on the widely used technical parameters (e.g., maximum "write lag limit" between the primary and remote sites) adopted by other replication solutions but on criteria directly linked to business performance. For instance, the frequency with which data from a specific application is replicated can be set to reflect the relative business risk and cost to the company of lost data and/or application downtime when compared to data generated by other applications.

In the event of a disaster in which the primary storage system is temporarily disabled, a data replication appliance ensures rapid recovery with full data consistency Data consistency summarizes the validity, accuracy, usability and integrity of related data between applications and across the IT enterprise. This ensures that each user observes a consistent view of the data, including visible changes made by the user's own transactions and  and no data loss. Effectively, Kashya's solution thus achieves the business continuity of a synchronous solution while, at the same time, it minimizes application degradation and the bandwidth and storage costs associated with any one individual replication approach. In addition, "Multiple Snapshot" techniques enable users to "roll back" to a snapshot of the data as at various points prior to the time of the disaster as an added precaution against the risk of data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors. .

A replication solution should support multiple host/multiple storage system environments and integrate fully with all existing local replication and management solutions, thus allowing companies to leverage their existing storage infrastructure.

Other Features to Look for in Data Replication

Universal Data Protection: Data protection for all open server and storage platforms on the network. It must remove the SAN distance limitation allowing DR site to be far apart. It should offer all possible replication policies in a single system (Snapshot, Asynchronous, and Synchronous Replication) without the need of edge connect or WDM (1) (Wavelength Division Multiplexing) A technology that uses multiple lasers and transmits several wavelengths of light (lambdas) simultaneously over a single optical fiber.  devices using the standard shared or dedicated IP infrastructure.

Autonomous Management: Make sure your replication system is capable of adjusting to WAN bandwidth and/or application demand changes dynamically while enforcing the established policies for specific applications. The user should be capable of establishing different policies for each Volume Group and to enforce the established policies automatically. The system allows different sets of policies dictated dic·tate  
v. dic·tat·ed, dic·tat·ing, dic·tates

v.tr.
1. To say or read aloud to be recorded or written by another: dictate a letter.

2.
a.
 by the business need and criticality of the application and data enabling multiple service levels through the same infrastructure. Kashya is capable of recovering from disasters or site or storage system failures through a rapid resynchronization feature minimizing down time.

Application Aware Compression: Look for unique agent technology that supports typical applications such as Oracle databases and, by detecting the nature of the application, is capable of optimizing the replication techniques while maintaining always-consistent remote copy.

Delta Differentials: The system should maintain the write order fidelity and tracks and transmits only the changed bytes as opposed to writing the complete block of data multiple times. This saves bandwidth and improves performance.

Hot Spot Compression: Operating in the Snapshot mode the system should track multiple WR requests against the same data blocks and only transmit the last WR while maintaining the consistency of the remote copy at all times.

Algorithmic Compression: Kashya has developed a proprietary data compression data compression

Process of reducing the amount of data needed for storage or transmission of a given piece of information (text, graphics, video, sound, etc.), typically by use of encoding techniques.
 algorithm that achieves superior compression with reduced CPU CPU
 in full central processing unit

Principal component of a digital computer, composed of a control unit, an instruction-decoding unit, and an arithmetic-logic unit.
 requirements.

The above methods deliver unprecedented compression. Compression beyond 25x is possible depending on the application access patterns.

Organizations embarking on the implementation of an enterprise-class data protection plan are faced with difficult challenges that result from the limiting technologies available to them today. Developed from the ground up with a completely new approach, Kashya's unique technology is designed to deliver a new level of data protection at a fraction of the cost of existing solutions.

www.kashya.com

Mehran Hadipour is vice president, product marketing, at Kashya (San Jose San Jose, city, United States
San Jose (sănəzā`, săn hōzā`), city (1990 pop. 782,248), seat of Santa Clara co., W central Calif.; founded 1777, inc. 1850.
, CA)
COPYRIGHT 2004 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2004, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Backup/Restore
Author:Hadipour, Mehran
Publication:Computer Technology Review
Geographic Code:1USA
Date:Jan 1, 2004
Words:2302
Previous Article:Solid-state disks: moving from luxury to necessity.(Backup/Restore)
Next Article:Want to stop spam? Multiple techniques in unison is the answer.(Internet)
Topics:



Related Articles
Recovery for the rest of us: small and mid-size businesses need to be as diligent as large enterprises in implementing an emergency preparedness...
Backup storage appliances come of age.(Security)
Data grid disaster puts SMBs at most risk.(Disaster Recovery)(small to medium size enterprises)
Data protection: the #1 storage priority; There's no ILM process without it.(Data Protection)(Information Lifecycle Management)
Overcoming recovery barriers: rapid and reliable system and data recovery.(Data Protection)
Understanding the new generation of data protection solutions.(Disaster Recovery & Backup/Restore)
Beyond the data protection dilemma.(Disaster Recovery & Backup/Restore)
Personal disaster recovery software: an essential part of business disaster recovery plans.(Special Section)
Security and compliance: danger lurks for stored data.
Building practical data protection strategies.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles