Printer Friendly
The Free Library
14,529,525 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Data protection and disaster recovery of local and remote file servers.


Data management in today's enterprises becomes exponentially ex·po·nen·tial  
adj.
1. Of or relating to an exponent.

2. Mathematics
a. Containing, involving, or expressed as an exponent.

b.
 more complex as the organization itself grows. This is especially true for organizations that establish many remote locations because more data is generated and stored remotely, away from central data centers. Lacking IT professionals on site, most remote locations do not have the resources and knowledge required to manage and back up their data. IT managers have struggled with how to ensure that data is properly protected in remote locations as well as how to ensure that it is updated and accessible to the entire enterprise.

Traditionally, this task has been daunting daunt  
tr.v. daunt·ed, daunt·ing, daunts
To abate the courage of; discourage. See Synonyms at dismay.



[Middle English daunten, from Old French danter, from Latin
 and cumbersome because available technology generally fell far short of what was needed for efficient, centralized cen·tral·ize  
v. cen·tral·ized, cen·tral·iz·ing, cen·tral·iz·es

v.tr.
1. To draw into or toward a center; consolidate.

2.
 management of remote data. If an organization couldn't afford a private network between its locations, the time, bandwidth and extra security required for moving data over a public WAN made it an impractical, if not impossible, alternative. Now, however, new technologies are in place that makes this task easy, efficient, accurate and automated.

One aspect of data backup and management that has particularly benefited from the latest technologies is data replication. To this point, "data replication" has most often been used to refer to real-time replication for disaster recovery and business continuity. Real-time, or immediate, replication can be either synchronous (data is written to both storage devices at the same time, and the write operation isn't complete until both devices have completed writing) or asynchronous Refers to events that are not synchronized, or coordinated, in time. The following are considered asynchronous operations. The interval between transmitting A and B is not the same as between B and C. The ability to initiate a transmission at either end.  (data is written to both storage devices at once, although the write operation is considered complete when the primary storage device has completed writing).

[FIGURE 1 OMITTED]

Real-Time, Synchronous Data Synchronous data

Information available at the same time. To test option-pricing models, the price of the option and of the underlying should be synchronous and reflect the same moment in the market.
 Replication

Real-time synchronous replication is the most effective type of replication for high-availability clusters High-availability clusters (also known as HA Clusters or Failover Clusters) are computer clusters that are implemented primarily for the purpose of improving the availability of services which the cluster provides.  and server failover in a LAN (Local Area Network) A communications network that serves users within a confined geographical area. The "clients" are the user's workstations typically running Windows, although Mac and Linux clients are also used.  environment, since the data is completely identical at all times between the primary and secondary storage server. Data is replicated (or mirrored) between a single source storage server and a single target storage server, and often the realtime replication is tied into a cluster application. However, the characteristics of real-time replication that make it great for high-availability applications are the same characteristics that make it a poor choice for remote data protection. There is no scalability with this solution, as synchronous replication is almost always a one-to-one configuration. Real-time synchronous replication works well in high-bandwidth network environments, such as a Fibre Channel SAN or MAN, but when the primary and secondary storage servers are separated by more than a few miles, network latency See latency.  can cause serious performance problems for the storage application. Also, with data being written to both storage servers at the same time, there is no protection against data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors.  or file deletion File deletion is a way of removing a file from a computer's file system.

The reasons for deleting files are
  1. Freeing the disk space
  2. Removing duplicate or unnecessary data to avoid confusion
  3. Making sensitive information unavailable to others
, since the data will be corrupt on both systems at the same time. A traditional backup product is required in addition to synchronous replication to provide file-level data protection.

[ILLUSTRATION OMITTED]

Real-Time, Asynchronous Data Replication

Like its synchronous counterpart, real-time asynchronous replication is used to protect organizations from the loss or unavailability of a primary storage device. However, since the new or changed data does not need to be sent to the secondary device at the same time it is written to the primary storage, asynchronous replication is not bound by the same latency and distance constraints of synchronous replication. This allows for disaster recovery across a WAN, and works well for long distance disaster recovery, although not high-availability failover. Latency and security concerns limit clustered servers from being spread over a WAN, preventing long distance automated failover.

Real-time asynchronous replication can scale to support one-to-many (data distribution) and many-to-one (data consolidation) configurations, but support for remote offices is still problematic due to the 'always on' nature of real-time replication, which consumes network resources on a constant basis. Data corruption remains a problem for real-time asynchronous replication, where corrupt data is immediately replicated to all of the storage servers in the replication configuration.

Point-in-Time, Asynchronous Data Replication

The preferred solution for remote data management is point-in-time, or scheduled, asynchronous replication. Point-in-time, asynchronous replication offers an excellent compromise between system-level and file-level data protection, and provides the flexibility needed for remote office data protection and data management. Replication jobs are run on a scheduled basis, bringing the target storage server up-to-date at preset preset Cardiac pacing A parameter of a pacemaker that is programmed permanently when manufactured  intervals and based on corporate policies that can be tailored to each type of data that needs to be protected.

In this sense, point-in-time replication acts more like a backup than a mirror, except that it stores the copied data in its original format so it can be accessed immediately without the need for recovery and rebuilding. Remote offices any distance away can be protected since point-in-time replication does not suffer from distance limitations, and data consolidation and distribution can be scheduled as needed as needed prn. See prn order.  to most efficiently use limited network resources. (Figure 1.)

Remote Office Data Protection: A Recipe for Success

Effectively protecting remote storage servers and providing disaster recovery requires several elements: centralized management, efficient network utilization, flexible scheduling and job configuration, and robust security. Any replication solution or other data protection product that doesn't provide these features can leave remote data vulnerable to loss, corruption, or just not being available, resulting in lost time, money and productivity.

Centralized Management

Remote offices generally aren't large enough to have dedicated IT resources on site, so data protection needs to be automated and centrally managed. Replicating remote data to a central office storage server for archiving to tape eliminates the need for tape backup Using magnetic tape for storing duplicate copies of hard disk files. Users can add an internal or external tape drive to their desktop computers for backup purposes, and files are typically copied to the tapes using a backup utility that updates on a periodic schedule.  in the remote offices, improves data protection reliability and recoverability, and can save thousands of dollars in hardware and administration costs. Replication software solutions that offer web-based management consoles A terminal or workstation used to monitor and control a network. See Microsoft Management Console.  provide the easiest way to manage and monitor data protection. Since any workstation on the network with a browser can be used to access, manage and monitor replication jobs, local IT professionals can still manage data movement whether they're in the main office, or on rounds to the remote offices. Centralized management also enables IT administrators to automate data movement not only for data consolidation, but also automated data distribution to assure everyone is working from the same data.

Network Optimization

High speed, dedicated network connections between corporate offices are usually only, available between the main data centers of an enterprise since usage and maintenance fees for these lines are quite expensive. Smaller remote offices access the corporate network through low-speed lines, or over the Internet on a secured network pipe, and users and applications compete for network bandwidth. Network-optimized replication uses bandwidth throttling Bandwidth throttling is a method of ensuring a bandwidth intensive device, such as a server, will limit ("throttle") the quantity of data it transmits and/or accepts within a specified period of time. , byte-level differential data replication and data compression data compression

Process of reducing the amount of data needed for storage or transmission of a given piece of information (text, graphics, video, sound, etc.), typically by use of encoding techniques.
 to keep the amount of data transferred over the network to an absolute minimum. With byte-level differential replication only the differences between the original and replicated file need to be transferred, dramatically reducing the traffic on the network. Applying data compression algorithms to condense con·dense  
v. con·densed, con·dens·ing, con·dens·es

v.tr.
1. To reduce the volume or compass of.

2. To make more concise; abridge or shorten.

3. Physics
a.
 that data will continue to shrink the amount of data on the network. Bandwidth throttling allows an administrator to set the maximum percentage of bandwidth that replication will use for data transfer, and if the application supports it, bandwidth throttling can also be set based on the time of day and day of week that the replication takes place.

Flexible Scheduling

Not all data is created equal, so flexibility in scheduling data movement is vital for remote data management and protection. Automated scheduling for data replication lets an IT administrator "set it and forget it", so that once configured, all replication happens automatically without any manual intervention from IT staff. Data that has a very low tolerance for data loss can be kept in sync with up-to-the-minute replication. Often both of these types of data are located on the same storage server, so configuring directory or file-level data movement rather than file-system level replication will serve the needs of all levels of data. If network performance is the most important consideration, data transfers can be scheduled "after hours Adv. 1. after hours - not during regular hours; "he often worked after hours" " to eliminate impact to end users.

Robust Security

With increasing news coverage about security breaches compromising business data, replicating data across private and public networks has to be highly secure. The first line of defense for most businesses is the corporate firewall, which is fine for replicating data within a local LAN, and data replication performance can be improved by not using additional security. But when data moves beyond the firewall to the remote office additional layers of data protection are required. The minimum that most replication solutions offer is data encryption data encryption, the process of scrambling stored or transmitted information so that it is unintelligible until it is unscrambled by the intended recipient. Historically, data encryption has been used primarily to protect diplomatic and military secrets from foreign . Data is encrypted en·crypt  
tr.v. en·crypt·ed, en·crypt·ing, en·crypts
1. To put into code or cipher.

2. Computer Science
 on the server using any one of several encryption The reversible transformation of data from the original (the plaintext) to a difficult-to-interpret format (the ciphertext) as a mechanism for protecting its confidentiality, integrity and sometimes its authenticity. Encryption uses an encryption algorithm and one or more encryption keys.  schemas, and then sent over the network to its destination server, which decrypts the data before writing it to disk. However, encryption alone doesn't guarantee that sensitive data isn't being intercepted by the wrong system. To prevent rogue servers from intercepting data in transit, the servers themselves should be digitally authenticated au·then·ti·cate  
tr.v. au·then·ti·cat·ed, au·then·ti·cat·ing, au·then·ti·cates
To establish the authenticity of; prove genuine: a specialist who authenticated the antique samovar.
 before any data is moved between them. Digital certificates are managed by a central Certificate Authority server, and before any data is moved, the certificates on the source and destination server are validated by the Certificate Authority. Look for replication products that have this functionality built in to provide the additional security without additional management complexity. By adding extra layers of security for remote data protection, data can be moved over inexpensive Internet connections, rather than requiring an expensive private network link, providing a significant cost savings for businesses with many remote offices.

Summary

Point-in-time replication solutions, such as Adaptec Snap Enterprise Data Replicator See port replicator.

replicator - Any construct that acts to produce copies of itself; this could be a living organism, an idea (see meme), a program (see quine, worm, wabbit, fork bomb, and virus), a pattern in a cellular automaton (see life), or (speculatively) a robot or
 (EDR (Enhanced Data Rate) An acronym occasionally used to reflect an advancement in transmission or transfer speed. For example, Bluetooth 2.0 + EDR provides a dramatic increase in speed over the previous version. See Bluetooth. ) software, that encompass all of these features make remote data protection easy, efficient and cost-effective.

Julie Herd Goodman is product manager, NAS (1) See network access server.

(2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular
 application software, at Adaptec, Inc., Milpitas, CA

www.adaptec.com
COPYRIGHT 2005 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2005, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Storage Networking
Author:Goodman, Julie Herd
Publication:Computer Technology Review
Date:Aug 1, 2005
Words:1604
Previous Article:Is SATA ready for the enterprise?(Connectivity)(Serial Advanced Technology Attachment)
Next Article:Grid storage for grid computing.(Storage Networking)
Topics:



Related Articles
The emergence of e-vaulting: electronic vaulting is a compelling improvement on traditional in-house data backup and recovery functions.
Centralized file-cached storage protects against disaster: consolidated data is easier to protect, easier to manage, and much less expensive. But...
Simplifying disaster recovery solutions to protect your data.(Disaster Recovery)
Addressing the challenges of data protection; key data must be 100% reliable, accessible and up-to-date.(Backup/Restore)
Rapid restores from data disasters.(Disaster Recovery)(Zetta Server)
Online data backup & recovery takes hold: outsourcing better addresses data protection.(Disaster Recovery & Backup/Restore)
Assessing the foundation of long distance disaster recovery.(Regulatory Compliance)
Data protection: recovery with tape.(first in/first out)
Is SMB disaster recovery really within reach?(Disaster Recovery & Backup/Restore)(small and midsize businesses)
Will continuous data protection make tape-based backups obsolete?

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles