Printer Friendly
The Free Library
14,530,480 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Rapid restores from data disasters.


As an IT administrator, it is with a sense of utter panic you realize that the machine with critical data is unusable for a variety of reasons. What's more, you realize backing up your data was the easy part--restoring it under the gun is time consuming and tedious. In many instances, it can take a whole day to recover your data partially, and restore your server.

This article will examine the most frequent cause of data disasters like hardware failure, application corruption, and site disasters. It will also describe an architecture that is equally effective for protecting transactional applications--such as databases, mail, and files--using Zetta Server, a 64-bit operating system operating system (OS)

Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs.
 that can be installed on standard Intel servers.

Appliances based on Zetta Server provide advanced data protection, high-availability, and rapid restores for Windows, Unix, and MAC OSX See Mac OS X.  platforms--all managed from a single unified Web interface. The secret sauce is a combination of redundant hardware, unified file, block technology, sophisticated application aware (e.g., Exchange, SQL Server An earlier relational DBMS from Sybase and from Microsoft. Sybase introduced SQL Server in 1988 for various Unix versions. In that same year, with help from IBM, Sybase created an OS/2 version that Microsoft licensed and branded as Microsoft SQL Server. ) snapshot technology, as well as advanced replication that allows you to replicate both file and block data to a remote facility.

[GRAPHIC OMITTED]

Data Disasters

There are four major causes of data disasters:

* Hardware failure

* Admin/user error

* Application corruption

* Site disasters

Hardware Failure: Hardware failures can be mitigated by using redundant power supplies, RAID, mirrored memory, and an Active/Passive failover system.

Admin/User Error: Administrator error can arise in different ways. First, the traditional backup process is largely untested for restores. Administrator errors often show up only in the event of emergencies. There is no easy way to verify that backups have been successfully executed.

Application Corruption: Applications such as Microsoft Exchange Messaging and groupware software for Windows from Microsoft. Exchange Server is an Internet-compliant e-mail system that runs under Windows NT/2000 and Windows Server 2003. It can be accessed by Web browsers, the Exchange client, versions of Outlook and the earlier Windows Inbox. . SQL Server, and Oracle sometimes corrupt data. Almost every system or database administrator with any level of experience has spent hours wrestling with a corrupt database or a corrupt information store. This is difficult to deal with because backups themselves may be corrupt and there is virtually no way to deal with it other than to extract backup after backup from tape until the application recognizes a consistent backup. Application corruption can also cause significant data loss.

Site Disasters: Site disasters can be prompted by earthquakes, fires, floods, terrorism and sometimes as simply as the failure of air conditioning air conditioning, mechanical process for controlling the humidity, temperature, cleanliness, and circulation of air in buildings and rooms. Indoor air is conditioned and regulated to maintain the temperature-humidity ratio that is most comfortable and healthful.  in the server room. A site disaster is often disastrous and less than 40% of companies recover successfully from one. Even companies that use offsite backup have to deal with data loss, because backups are typically over a week old. Recreating servers from backups is a monumental task, especially if it involves databases.

Data Protection: It's All About Rapid Restores

Traditionally, backup has been one of the chores relegated to the junior system administrator. Most companies religiously back up to tape or tape libraries--a full backup See backup types.  once a week and incremental backups See backup types.

(operating system) incremental backup - A kind of backup that copies all files which have changed since the date of the previous backup. The first backup of a file system should include all files - a "full backup". Call this level 0.
 every day. In the event of a disaster (especially with data continuously generated by applications such as Exchange, SQL Server, and Oracle) current backup techniques are woefully woe·ful also wo·ful  
adj.
1. Affected by or full of woe; mournful.

2. Causing or involving woe.

3. Deplorably bad or wretched:
 inadequate for the following reasons.

First, a backup restored from tape is likely to be at least 24 hours old. Second, it is quite likely that the most recent backup is not usable. Third, it is not possible to determine whether Exchange data or databases are corrupt until the backup is restored and working correctly with the application. Most often, the application server is likely down for the better part of a day, while back-up administrators are figuring out which backup to use. After going through this painful process, in many cases, the backup that is successfully restored is often a week old.

Disk-Based Backup

There are a number of companies touting disk-based backup as the panacea Some antidote or remedy that completely solves a problem. Most so-called panaceas in this industry, if they survive at all, wind up sitting alongside and working with the products they were supposed to replace.  for these problems. Disk-based backups are great for decreasing the time to back up vast amounts of data, but have the same problems associated with backing up to tape, because it is very difficult to back up application-generated data continuously to disk.

Point-in-Time Copy or Snapshot Technology

Snapshot technologies allow companies to create point-in-time copies or online backups Using the Web to store copies of data for backup. There are numerous providers on the Internet that charge for storage, and fees are typically based on capacity. Online backup services provide offsite backup, which is essential for disaster recovery. See backup types.  as the system is running. In the event of data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors.  or data loss, users can roll the state back to the last snapshot or the last uncorrupted snapshot and restore. According to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 Randy Kerns, an analyst with the Evaluator Group, "With snapshots, you do not have to take your application out of service."

Most users are used to traditional backup methodologies: Level 0 backups every week and Level 1 backups every day. Restoring from these backups involves first loading a Level 0 backup and then a Level 1 backup. Unlike traditional backups, snapshots have the following important features:

* Each snapshot is complete in itself. It is not necessary to load the original snapshot, and then add incremental Additional or increased growth, bulk, quantity, number, or value; enlarged.

Incremental cost is additional or increased cost of an item or service apart from its actual cost.
 snapshots to create a final image.

* Advanced snapshot algorithms The snapshot algorithm is an algorithm used in distributed systems for recording a consistent global state of an asynchronous system. It is also known as Chandy-Lamport Algorithm for the determination of consistent global states, after Leslie Lamport and K. Mani Chandy.  are incremental using copy-on-write technology. Copy-on-write works by intercepting calls to modify files or blocks and making a copy prior to the modification.

The benefits of good copy-on-write algorithms are:

* There is no necessity to make a copy of the data at the point a snapshot is taken. Instead, a copy of the pointers pointing to data is made. This greatly speeds the snapshot process since, during this time, there can be no access to data to achieve consistency and prevent data corruption.

* Copy-on-write ensures that only changed blocks are copied. Consequently, data is not unnecessarily duplicated.

* If a virus was to attack a system, it would automatically initiate a copy-on-write on any files it corrupts. Consequently, it is easy for a sys admin to go back to a previous snapshot without the virus corruption.

Point-in-Time Copy Technology

Zetta Systems' point-in-time copy technology, ZSnap, is a significant advancement over current snapshot or point-in-time technologies. These advancements include.

Low snapshot overhead: Most current copy-on-write snapshot technologies are fast for small volumes of data. As data sizes increase dramatically, the time required for a snapshot increases, often linearly, as the data increases. Zetta Systems' patented snapshot algorithm is constant-time, meaning that independent of the size of the data, the same number of pointers is copied. It takes less than 100 milliseconds to snapshot a Zetta Server SAN-NAS device, independent of the volume of data. In addition, the actual disk overhead at snapshot time is always constant at 1KB. Lightweight snapshots are key in decreasing the granularity of data protection from days to minutes.

Number of active snapshots: Most current technologies limit the number of snapshots to less than 255. Microsoft supports 64 snapshots per volume: Network Appliance (1) A specialized device for use on a network. For example, Web servers, cache servers and file servers can be implemented as general-purpose computers with the appropriate software or as network appliances, which are computers dedicated to a single function and cannot do anything  supports 255 snapshots per volume. Scaling beyond this number is difficult without the right algorithm and file system support. Zetta Server places no practical limits on the number of active snapshots. Unlimited snap shots are another important consideration in obtaining maximum data protection.

Unified block+file snapshots: Storage connected to a Zetta Server can be accessed as both a block (SAN) and a file (NAS (1) See network access server.

(2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular
) device. Zetta Server's snapshot technology allows users to snapshot both files as well as blocks using the same mechanism. Snapshotting files is very useful for recovering single files, while snapshotting blocks is essential for preserving the integrity of databases and the Exchange server information store.

Rapid Restoration: Many snapshot technologies do not facilitate rapid restores. Restores can take hours with even moderate amounts of data. Zetta Server has been designed so that any snapshot can be swapped with the active file system in less than 5 seconds, facilitating rapid restores.

What is Zetta Server?

Zetta Server includes a number of patent pending technologies resident on an enhanced version of the FreeBSD operating system, including:

ZFS (1) (Zettabyte File System) A distributed file system from Sun that was added to OpenSolaris in 2005 and Solaris 10 in 2006. With support for 128 bits, compared to 64 for NTFS and other contemporary file systems, storage capacity is unlimited for all practical : A high-performance, adaptive, caching, file system that is core in implementing ZSnap, a powerful, lightweight snapshot technology that works intuitively with blocks as well as files.

Zunified: A unified driver that implements block and file access to a common pool of storage. Users can access Zetta Server-based appliances as network attached storage. In this mode. Zetta Server supports both Windows and Unix clients with integration into Active Directory and NIS Niš or Nish (both: nēsh), city (1991 pop. 175,391), SE Serbia, on the Nišava River. An important railway and industrial center, it has industries that manufacture textiles, electronics, spirits, and locomotives. . Zetta Server-based appliances can also be accessed through a Fibre Channel connection which presents a block SCSI interface SCSI interface - SCSI adaptor  to clients. Multiple servers can be connected to Zetta Server via a Fibre Channel switch In a computer storage field, a Fibre Channel switch is a network switch compatible with Fibre Channel (FC) protocol. It allows the creation of a Fibre Channel fabric, that is currently the core component of most storage area networks. .

Zditto: Replication technology that replicates block as well as file data to a remote off-site server. ZDitto is ideal for disaster recovery from site disasters.

The Figure (on page 28) shows an ideal disaster-proof implementation using Zetta Server appliances. Each Zetta Appliance is configured with one gigabit Ethernet An Ethernet standard that transmits at 1 Gbps. Used mostly to connect high-end workstations and servers as well as for network backbones, Gigabit Ethernet transmits full duplex from point to point using switches and half duplex in a shared environment (CSMA/CD) using a hub.  card and one Qlogic Fibre Channel HBA (Host Bus Adapter) See host adapter. . Zetta appliances are connected to each other using a gigabit Ethernet for sensing whether the paired Zetta Server is active. Both Zetta Servers are connected to a high-end storage array either through a separate Fibre Channel connection or through SCSI SCSI
 in full Small Computer System Interface

Once common standard for connecting peripheral devices (disks, modems, printers, etc.) to small and medium-sized computers. SCSI has given way to faster standards, such as Firewire and USB.
.

Data is replicated to a remote site, to a pair of Zetta Servers. In the event of a site disaster, data can be recovered quickly from the remote server.

Zetta Server-based appliances protect data from the primary causes of data loss; disk failures through RAID, hardware failures through automated fail-over, administrator errors and application corruption through highly granular snapshotting, and site failures through file + block replication. Zetta Server is a cost-effective approach to creating disaster-resistant file and block network storage.

Dr. Ganapathy Krishnan is founder and CEO (1) (Chief Executive Officer) The highest individual in command of an organization. Typically the president of the company, the CEO reports to the Chairman of the Board.  of Zetta Systems (Woodinville, WA)

www.zettasystems.com
COPYRIGHT 2004 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2004, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Disaster Recovery; Zetta Server
Author:Krishnan, Ganapathy
Publication:Computer Technology Review
Geographic Code:1USA
Date:Feb 1, 2004
Words:1558
Previous Article:The impact of regulatory compliance on storage: "the compliance landscape is a minefield."--Enterprise Storage Group.(Disaster Recovery)
Next Article:Lifecycle management drives data management's evolution from art to science.(Disaster Recovery)
Topics:



Related Articles
Global namespace: The Future of File System Management, Part 2. (Storage Networking).
Online backup and recovery services: throwing a lifeline to "stranded servers". (Storage Networking).
Recovery for the rest of us: small and mid-size businesses need to be as diligent as large enterprises in implementing an emergency preparedness...
Data grid disaster puts SMBs at most risk.(Disaster Recovery)(small to medium size enterprises)
Is regular backup enough? Join the continuous backup revolution.(Backup/Restore)
Implementing an IP SAN for disaster recovery: using iSCSI as an enabler.(Storage Networking)
Preparing for disaster with an effective business continuity strategy: overcoming potential dangers to your information infrastructure.(Disaster...
Overcoming recovery barriers: rapid and reliable system and data recovery.(Data Protection)
Protecting Microsoft Exchange Server in SMBs.(Disaster Recovery & Backup/Restore)(small and midsize businesses)
Disaster recovery for the masses: the role of OS-level server virtualization in disaster recovery.(Disaster Recovery & Backup/Restore)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles