Printer Friendly
The Free Library
5,670,445 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Managing distributed data in the enterprise.


As mission-critical data spreads across businesses, managing and protecting remote data, i.e. everything stored outside the data center, poses unique challenges. Remote data can exist on a variety of network and computing platforms See platform. , can be located at sites without IT staff, and its protection can depend on nontechnical employees who need to remember to perform regular backups. Traditional remote data management is high cost, unreliable, and manually intensive in the face of these challenges. This article examines how to choose the best remote data solution for your organization.

Key Considerations for Managing Remote Data

Effective remote data management requires consideration of a variety of factors, including:

Central Policy-Based Control:

Efficient control requires the ability to implement central policies, set a rule once and have it work company-wide, rather than managing activities individually at different sites. However, many products claiming "central control," in fact, require Administrators to set policy through a unique connection to each remote node A terminal or computer located apart from the main network. It may refer to a branch office or a traveling user with a laptop. Access to the company LAN is typically made via an Internet or dial-up connection. See remote control software.  whenever business requirements change.

Wide Area Network (WAN) Network Bandwidth Utilization:

Because they move data between locations, remote data solutions must accommodate bandwidth restrictions. They should have features that enable efficient bandwidth use, such as byte-level data transfer, bandwidth throttling Bandwidth throttling is a method of ensuring a bandwidth intensive device, such as a server, will limit ("throttle") the quantity of data it transmits and/or accepts within a specified period of time. , multi-streaming, and compression. Also, the less overhead information a product adds to transmitted data, the better. Since remote connections may become impaired, companies need the ability to restart To resume computer operation after a planned or unplanned termination. See boot, warm boot and checkpoint/restart.  at the point of failure and re-route information flow to alternate networks.

Security and Data Integrity:

Effective solutions should authenticate (1) To verify (guarantee) the identity of a person or company. To ensure that the individual or organization is really who it says it is. See authentication and digital certificate.

(2) To verify (guarantee) that data has not been altered.
 all sending and receiving nodes prior to data transfer, encrypt See encryption.  data during transmission, use a single firewall port, and minimize firewall rules. 100% data integrity is also essential, especially for remote backup using tape; corrupted cor·rupt  
adj.
1. Marked by immorality and perversion; depraved.

2. Venal; dishonest: a corrupt mayor.

3.
 data on the tape is one of the primary causes of recovery failure.

Remote Process Automation and Application Interfacing:

Automating processes and remote application interface can eliminate manual intervention A procedure used in a lawsuit by which the court allows a third person who was not originally a party to the suit to become a party, by joining with either the plaintiff or the defendant.  at remote locations. The chosen solution must integrate with applications that have a native backup package and ensure integrity by accessing data through applications instead of at the database, filesystem, or disk levels. Additionally, remote solutions should automate To turn a set of manual steps into an operation that goes by itself. See automation.  any custom or script-based processes needed prior, during or after data transmission.

Heterogeneous System heterogeneous system
n.
A chemical system that contains various distinct and mechanically separable parts or phases, such as a suspension.
 Support:

Since most companies have a variety of computing platforms and applications, it is important to choose a solution that supports a heterogeneous environment Using hardware and system software from different vendors. Organizations often use computers, operating systems and databases from a variety of vendors. Contrast with homogeneous environment. .

Additional Requirements for Remote Data Backup

Remote office backup requires more than just writing data to tape. It must address data integrity and accuracy, automatic operation, offsite storage, and of course, restoration. The best solutions eliminate manual effort at remote sites.

Since restoration quality is based on the integrity of the original data, it is important to preserve integrity when files are stored for backup. To ensure this, the disk-backup mechanism must have options (skip, open file transfer, or create error log) for handling open files.

The Case for Archiving

An often-overlooked, but critical component of remote data management and protection is archiving. User files and emails are seldom re-opened after the first three days of creation/receipt. Statistics show that files that haven't been accessed for 90 days have a 90%+ probability of never being accessed again. But, since its impossible to predict what 10% will be needed, companies need to keep it all, consuming valuable storage resources. The cost-effective method is to move unused data to lower-cost secondary storage (archive), with easy retrieval capabilities, for long-term retention. Archiving also helps businesses ensure compliance to federal document regulations. Similar to remote backup, data integrity is essential to any archival solution.

The Central Policy/Consolidated Approach to Managing Remote Data

The most effective solution for remote data is allowing central IT staff to control remote data management and backup. The staff should be able to set data policies, automate processes to execute those policies on remote servers, and move data between remote servers and central systems. Centrally controlled automated au·to·mate  
v. au·to·mat·ed, au·to·mat·ing, au·to·mates

v.tr.
1. To convert to automatic operation: automate a factory.

2.
 processes can decrease backup costs by as much as 75%.

Individual remote backup and archiving processes are replaced with a consolidated process that moves remote data to a hub site for backup or archive. This requires moving the data over available networks in an efficient, secure, timely fashion and requires technology that can handle the issues associated with controlling and moving data among sites and network connections.

Disk-to-Disk Consolidated Backup

The use of disk-to-disk backup is increasing due to the falling cost of disk storage, the elimination of physical limits, relative unreliability of manual tape storage programs, and the need for ready access to data.

A best practices model of disk-to-disk backup for remote data moves the data to be backed up over a network to a different location. Disk-to-disk backup performed at the same site does not provide appropriate protection for site disaster events (fire, flood, etc.). For businesses with multiple locations, consolidating disk-to-disk backup delivers operational and cost efficiencies as well as enhanced data security.

It's common for remote data to be periodically analyzed an·a·lyze  
tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es
1. To examine methodically by separating into parts and studying their interrelations.

2. Chemistry To make a chemical analysis of.

3.
 to determine what data has changed since the last backup. A copy of this changed data is moved to a central site to be stored on disk. Some technologies can minimize the size of data transfer by discerning dis·cern·ing  
adj.
Exhibiting keen insight and good judgment; perceptive.



dis·cerning·ly adv.
 and moving just the byte-level file modifications. It's best to store data at a central site as incremental Additional or increased growth, bulk, quantity, number, or value; enlarged.

Incremental cost is additional or increased cost of an item or service apart from its actual cost.
 data packets are reconstructed re·con·struct  
tr.v. re·con·struct·ed, re·con·struct·ing, re·con·structs
1. To construct again; rebuild.

2.
 to provide full, up-to-date copies of files which are instantly accessible if a remote file is lost or destroyed.

The second approach enables output from backups performed on remote servers to be stored on local disk. The resulting backup image is transferred to disk at the central site. This works well for applications with native backup or snapshot (1) A saved copy of memory including the contents of all memory bytes, hardware registers and status indicators. It is periodically taken in order to restore the system in the event of failure.

(2) A saved copy of a file before it is updated.
 features that can be utilized in the consolidated backup process.

These approaches can also be used together. For example, backing up user files may be best performed with changed data transfer, while backing up Microsoft Exchange Messaging and groupware software for Windows from Microsoft. Exchange Server is an Internet-compliant e-mail system that runs under Windows NT/2000 and Windows Server 2003. It can be accessed by Web browsers, the Exchange client, versions of Outlook and the earlier Windows Inbox.  data may be best performed using the consolidated backup image approach.

In either approach, the backup data on disk at the central location can be sent to tape if desired.

Consolidated Archive

Consolidated archive involves archiving data that meets corporate archival policy from remote systems to a central disk. Archival policy determines what data is to be archived, and when, as well as parameters, including last date accessed, file type, content, location, ownership, and file size. Another essential element is a mechanism by which data can be retrieved from the archive by end users, preferably pref·er·a·ble  
adj.
More desirable or worthy than another; preferred: Coffee is preferable to tea, I think.



pref
 without involving IT.

Central policy-based processes, such as consolidated backup and archive, can significantly lower costs, eliminate risk, improve data consistency Data consistency summarizes the validity, accuracy, usability and integrity of related data between applications and across the IT enterprise. This ensures that each user observes a consistent view of the data, including visible changes made by the user's own transactions and , and ensure better backup/retention policy compliance.

A Best Practices Guide to Managing Remote Data

Best practices for managing and protecting remote data involve both understanding and implementing technology that supports the automated processes. There are five primary steps toward implementing a company-wide solution:

* Identify and understand remote data and the network environment

* Select a remote data management solution

* Create a remote data management policy

* Implement the policy through centrally controlled automated processes

* Monitor and adjust as business conditions change

The questions below will help companies implement the first two steps of this process.

Assess the current system:

* Which data transfer applications are used? (e.g. FTP FTP
 in full file transfer protocol

Internet protocol that allows a computer to send files to or receive files from another computer. Like many Internet resources, FTP works by means of a client-server architecture; the user runs client software to connect to
, xcopy, robocopy, DFS/FRS, public folder In a graphical user interface (GUI), a simulated file folder that holds data, applications and other folders. Folders were introduced on the Xerox Star, then popularized on the Macintosh and later adapted to Windows and Unix. In Unix and Linux, as well as DOS and Windows 3.  replication In database management, the ability to keep distributed databases synchronized by routinely copying the entire database or subsets of the database to other servers in the network.

There are various replication methods.
)

* What level of manual intervention is required?

* What are the failure rates and costs?

* How easily does it adjust to new business requirements?

Determine data movement goals:

Knowing the company's data movement goals helps in developing cost-recovery models that justify necessary purchases. Examples include:

* Reduce backup failure rates/increase data protection

* Reduce meantime to remote office restore

* Automate data transfers to and from remote sites

Determine the types of data that need to be moved:

* How much data is stored remotely?

* What are its characteristics?

* Which characteristics must be maintained?

* What applications are running at the remote locations?

* Should the data transfer system integrate with particular applications in 'real time'?

* What data are users not currently backing up?

* What types of data need to be sent to the remote office? How compressible com·press·i·ble  
adj.
That can be compressed: compressible packing materials; a compressible box.



com·press
 is it?

Determine the data movement volume:

* What is the rate of data change on a day-to-day basis?

* How many sites need to be supported?

Assess your current network:

* What bandwidth is available to each location?

* How much bandwidth is required by the other applications currently using this bandwidth?

* Can traffic be segregated using quality of service (QoS) applications?

* Is the network traffic prone to bursts?

* How secure is the network?

Choose your solution:

Evaluate vendors and solutions against required remote data capabilities:

* Can the solution solve multiple remote data problems, such as backup, archive and distribution?

* Does the vendor have expertise with remote data application and integration? Can they help assess specific company requirements, data change and growth rates Growth Rates

The compounded annualized rate of growth of a company's revenues, earnings, dividends, or other figures.

Notes:
Remember, historically high growth rates don't always mean a high rate of growth looking into the future.
?

Summary

Managing remote data effectively requires dealing with network and platform variability, security, and data integrity, and implementing process automation. While many vendors offer remote data solutions, companies need to choose a solution that fits the needs identified in the best practices section of this article. With the latest remote data management and movement technologies, companies can cost-effectively solve the challenges of managing remote data with a single unified approach.

www.adaptec.com

By Adaptec
COPYRIGHT 2006 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2006, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Disaster Recovery & Backup/Restore
Publication:Computer Technology Review
Date:Sep 1, 2006
Words:1531
Previous Article:Playing Russian roulette with your business data: the importance of disaster recovery planning.(Disaster Recovery & Backup/Restore)
Next Article:Traditional backup software is no match for Exchange.(Disaster Recovery & Backup/Restore)



Related Articles
Storage Management Best Practices.(Industry Trend or Event)
INNOVATION Data Processing announces S/390 Linux rescuer.(disaster recovery software)
Simplifying storage: how companies benefit with a backup appliance approach. (SAN).
Tape pipeline for BC/DR planning: new considerations in backup and recovery. (Storage Networking).
Case study: ATPCO raises data protection to new heights with IBM Tivoli software.(Special feature: IBM 10th anniversary)
TCO analysis: where D2D fits--part 2.(Storage Networking)(Total Cost of Ownership)
TCO should include value as well as cost.(TCO: Disk Arrays)
Overcoming recovery barriers: rapid and reliable system and data recovery.(Data Protection)
Data storage sticker shock: the need to recalculate data storage TCO.(Business of Technology)(Total Cost of Ownership )
Personal disaster recovery software: an essential part of business disaster recovery plans.(Special Section)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles