Printer Friendly
The Free Library
5,665,456 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Centralized file-cached storage protects against disaster: consolidated data is easier to protect, easier to manage, and much less expensive. But there's a problem.


Here's a recipe for disaster: Take large and growing volumes of critical, file-based data, stir in widely distributed Adj. 1. widely distributed - growing or occurring in many parts of the world; "a cosmopolitan herb"; "cosmopolitan in distribution"
cosmopolitan

bionomics, environmental science, ecology - the branch of biology concerned with the relations between organisms
 networks and users, then add remote offices with poor or nonexistent non·ex·is·tence  
n.
1. The condition of not existing.

2. Something that does not exist.



non
 backup plans and minimal IT resources. The result: Valuable enterprise data is at serious, and continual, risk.

Most businesses would love to consolidate their data into protected data centers--consolidated data is easier to protect, easier to manage, and much less expensive. The problem is that remote users can't access their files over low bandwidth, high-latency wide area networks (WANs). So data and storage remain distributed throughout the enterprise, and protection of that data remains a formidable challenge.

The most commonly employed alternatives (centralized cen·tral·ize  
v. cen·tral·ized, cen·tral·iz·ing, cen·tral·iz·es

v.tr.
1. To draw into or toward a center; consolidate.

2.
 backup, thin clients, data replication, and client-based backups) work to a point, but each has significant drawbacks. New approaches built on file-caching technology are showing great promise in consolidating and protecting remote office data while responding to the user's need for remote file access.

Challenges

Most enterprises are well aware of the value of their companys' data. Government regulations, the threat of data loss, and unmanageable data all add to the pressure of protecting this critical asset.

Government regulations: Regulatory requirements Regulatory requirements are part of the process of drug discovery and drug development. Regulatory requirements describe what is necessary for a new drug to be approved for marketing in any particular country.  are quickly becoming 800-pound gorillas in the corporate world. For example, the Sarbanes-Oxley Act See SOX.  of 2002 includes general business oversight practices around data retention and corporate records. The government is serious about this: just this past July, government regulators invoked Sarbanes-Oxley in a large civil suit against executives at HealthSouth Corp.

Threat of data loss: The terrorist attacks on the Oklahoma City Oklahoma City (1990 pop. 444,719), state capital, and seat of Oklahoma co., central Okla., on the North Canadian River; inc. 1890. The state's largest city, it is an important livestock market, a wholesale, distribution, industrial, and financial center, and a farm  federal building and New York's Twin Towers demonstrated the potential for data loss on a large scale, and natural disasters put many regional offices at risk. Less dramatic attacks from hackers and viruses can be just as costly as physical disasters, and loss from employee actions (mistaken or malicious) is common and widespread.

Unmanageable data: Managing storage challenges even the most sophisticated enterprises. Storage area networks (SANs) enable storage administrators to consolidate block-level data into centrally managed storage arrays. However, 80% of enterprise data (Word, Excel, and PowerPoint documents; e-mail attachments A file that rides along with an e-mail message. The attached file can be of any type. E-mail programs make it easy to attach a file. For example, in Eudora, all you do is select Attach from the Message menu, browse through the folder hierarchy to find the file you want and then double ; graphics applications) is file-based, using file system protocols that resist management and do not operate well over WANs.

In response to these challenges, corporations have carefully protected the data they directly control. They've built and consolidated highly managed data centers with sophisticated data protection and disaster recovery solutions, placing them at corporate and large regional headquarters. These centralized data protection solutions include replication, mirroring, snapshots, tiered storage A data storage system made up of two or more types of storage based on their access speed. For example, magnetic disk and tape or magnetic disk and optical disc are widely used in a tiered storage system. See HSM.  and online backup Using the Web to store copies of data for backup. There are numerous providers on the Internet that charge for storage, and fees are typically based on capacity. Online backup services provide offsite backup, which is essential for disaster recovery. See backup types. .

Although central protection is critical, as much as 60% of a corporation's data resides outside its managed servers on remote networks, desktops, and mobile systems. Not only does a corporation's core data protection scheme core rarely extend to the distant remote offices at the network's edge, but even if it did, it couldn't handle the large volumes of unstructured files. As much as 75% of this "edge data" is unprotected, because it is either ineffectively backed tip or not backed up at all. This is a risky business practice, as edge data can be as critical to the company's survival as its more manageable centralized data.

Some enterprises throw up their collective hands in frustration and do little to protect edge data. Other organizations create corporate backup policies and issue them to all their branch offices, holding each office responsible for its own local backup procedures. The larger offices with IT support personnel may implement workable data protection solutions, but smaller offices without IT staff may make spotty spot·ty  
adj. spot·ti·er, spot·ti·est
1. Lacking consistency; uneven.

2. Having or marked with spots; spotted.



spot
 backups or none at all. Businesses that do take action to protect edge data generally either deploy branch office backup solutions, replicate data to their data centers, or deploy terminal servers to eliminate data stored at the edge. However, all of the solutions available for protecting edge data have significant drawbacks.

Branch office solutions including backup servers, backup software See backup program.

(tool, software) backup software - Software for doing a backup, often included as part of the operating system.

Backup software should provide ways to specify what files get backed up and to where.
, tape drives, and tape media are expensive and not always reliable, and add to the management complexity of a distributed IT infrastructure. The more common replication technologies are temperamental tem·per·a·men·tal  
adj.
1. Relating to or caused by temperament: our temperamental differences.

2. Excessively sensitive or irritable; moody.

3.
 and complex, demand never-ending capacity and bandwidth, and frequently fail over high-latency WANs. Overworked data center staff must also monitor daily replications and backups from dozens of remote offices to make sure they actually complete the backups. And terminal servers, which use the WAN to process and display requests between remote thin clients and centralized application servers, suffer from poor performance and lack scalability.

Taming the Edge with File-Caching Gateways

The most commonly used file system protocols (Windows CIFS (Common Internet File System) The file sharing protocol used in Windows. It evolved out of the SMB (Server Message Block) protocol in DOS, which is why the terms CIFS/SMB and SMB/CIFS are sometimes seen. The word "Internet" in the CIFS name has little relevance.  and Unix NFS (Network File System) The file sharing protocol in a Unix network. This de facto Unix standard, which is widely known as a "distributed file system," was developed by Sun. See file sharing protocol and WebNFS.

NFS - Network File System
) were never designed to operate over high-latency or limited bandwidth network links. But if enterprises were able to keep all persistent data Data that exists from session to session. Persistent data are stored in a database on disk or tape. Contrast with transient data. See persistent name.  at the data center and still provide LAN-like access to data fur remote office users, corporations would have the best of both worlds--centralized data protection and management combined with high application service levels for their remote users.

The key is file-caching technology. Throughout most networks, 5% of network files account for more than 50% of I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output.

I/O - Input/Output
 activity. If just these active data sets stay immediately available to users, processing and transport time drop significantly. Since file caching relieves servers of at least half of its I/O hits, it can increase server throughput considerably. File caching can also relieve WANs of large amounts of data traffic, greatly improving traffic movement over low-bandwidth, high-latency WANs.

File caching works by keeping the most active files in local storage. For example, a remote office with 1TB of total data requires only 100GB of cache for its active working data set. By storing just the active working data set in a local cache, users in the remote office can quickly and transparently retrieve their data no matter where it resides--in their office or at corporate headquarters.

A typical deployment would involve a client-side caching appliance A self-contained cache server dedicated to Web caching. See Web cache and NetCache.


NetCache Appliance
A NetCache is an "appliance" because it plugs into the network and performs one function: Web caching.
 installed at each remote office and a server-side caching appliance at the data center. The client-side caching appliance responds to file system requests initiated by local clients and presents a cached view of centralized data. Because heavily accessed data already resides in cache, data requests receive a near-instant response. If the requested data is not in cache, the client-side appliance communicates with a corresponding server-side appliance to obtain the requested data from the centralized server or NAS (1) See network access server.

(2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular
 head-end in the data center. Using network and protocol optimization techniques, the request is quickly served to the client in the remote office.

By utilizing file caching, local clients can access their files, browse remote directories, and perform file system operations at LAN-like speeds. Global version management and locking mechanisms maintain data coherency co·her·en·cy  
n. pl. co·her·en·cies
Coherence.

Noun 1. coherency - the state of cohering or sticking together
coherence, cohesion, cohesiveness
 and ensure that remote clients retrieve the latest copy of a file from the cache. A central management agent offers remote management capabilities for each file-caching appliance through a Web-based interface. Data center administrators can centrally control and manage storage capacity. They can also optimize system performance by prepositioning files and directories into target caches or an external directory, and by setting policies to manage data transfers.

File-caching appliances such as Actona's ActaStor use caching, compression and network optimization techniques to run standard CIFS and NFS file protocols over low-bandwidth, high-latency WANs. The appliances seamlessly integrate into existing network and storage infrastructures, and require no software to be installed on client machines or file servers. By combining centralized storage with local file services, Actona's file-caching solution enables companies to consolidate servers and storage, and centralize cen·tral·ize  
v. cen·tral·ized, cen·tral·iz·ing, cen·tral·iz·es

v.tr.
1. To draw into or toward a center; consolidate.

2.
 backup and disaster recovery processes, while providing remote users with LAN-like performance.

File caching enables efficient storage consolidation across the WAN, driving storage, management, and backup operations out of remote offices and into the data center. Backup and restore operations are much faster and more reliable in a centralized environment. Centralized file-cached storage is cost-effective because it reduces storage provisioning across the extended enterprise and reduces the costs of remote office IT support. The results include lower management costs, higher resource utilization, more effective backup, and improved disaster recovery, and the most important benefit of all--the confidence that all data will be available whenever and wherever it's needed.

www.actoma.com

John Henze is vice president of marketing at Actona Technologies (Los Gatos Los Gatos (lôs gä`tōs, lŏs, găt`əs), city (1990 pop. 27,357), Santa Clara co., W Calif.; inc. 1887. It is an affluent residential community and health resort. , CA)
COPYRIGHT 2003 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2003, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Disaster Recovery
Author:Henze, John
Publication:Computer Technology Review
Date:Sep 1, 2003
Words:1357
Previous Article:Tape backup: the cornerstone of data protection.(Stub files: opening shots in continuing stories ...)
Next Article:Preparing for worst-case scenarios in the data center: lessons learned.(Disasters Recovery)



Related Articles
Virtual Tape Meets The 24-Hour Availability Challenge.(News Briefs)
Storage Management Best Practices.(Industry Trend or Event)
Protecting your digital sources.(Protecting Your Library's Digital Sources: The Essential Guide to Planning and Preservation (Book))(Book Review)
Data protection: recovery with tape.(first in/first out)
The network-centric file management appliance: overcoming the challenges of enterprise file services.(Storage Management)
Fighting SMB and nonprofit storage fires without getting burned.(Business of Technology)(Small and Medium sized Business)
Tape turning: protect against data loss.(first in/first out)
Data storage sticker shock: the need to recalculate data storage TCO.(Business of Technology)(Total Cost of Ownership )
Data protection and disaster recovery of local and remote file servers.(Storage Networking)
Maximizing availability and performance of your enterprise Microsoft Exchange environment with an advanced network-based solution.(Storage Management)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles