Printer Friendly
The Free Library
14,530,480 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Comparing host-based D2D to VTLs for backup and restore: Part 1.


Tape technologies have been the dominant force in backup and recovery almost since the beginning of modern computing; however, the landscape is finally changing. With the arrival of next-generation ATA (1) (AT Attachment) The specification for IDE drives. See IDE.

(2) See analog telephone adapter.

ATA - Advanced Technology Attachment
 disk technology, newer, low-cost backup and recovery methods are being offered by vendors promising to solve many of the problems that IT administrators have faced with tape backups Using magnetic tape for storing duplicate copies of hard disk files. Users can add an internal or external tape drive to their desktop computers for backup purposes, and files are typically copied to the tapes using a backup utility that updates on a periodic schedule.  and restores. The market has produced a proliferation proliferation /pro·lif·er·a·tion/ (pro-lif?er-a´shun) the reproduction or multiplication of similar forms, especially of cells.prolif´erativeprolif´erous

pro·lif·er·a·tion
n.
 of disk-based offerings ranging from simply including disk arrays as a "middleman mid·dle·man  
n.
1. A trader who buys from producers and sells to retailers or consumers.

2. An intermediary; a go-between.
" in the backup path to complicated imaging and snapshot systems that store their content on disk. With such a wide field of offerings, it is useful to focus in on only a couple at a time and see how they compare. In this discussion, we will take a closer look at two of the most promising of these new backup technologies, namely host-based disk-to-disk (D2D (Disk-to-Disk) Typically refers to backing up data on disks rather than on tape. Disk-to-disk backup systems provide a very fast restore capability compared with tape backup. See D2D2T and virtual tape. ) and virtual tape libraries A hard disk array that emulates a tape library. A virtual tape library (VTL) enables the storage medium to be switched from tapes to disks while continuing to use the existing tape backup software. See virtual tape system and storage virtualization.  (VTL See virtual tape library. ).

At the outset, new host-based D2D backup strategies using ATA disk arrays seemed to offer the most promise, using low-cost disks and providing increased data transfer speeds for backup and recovery. As many IT administrators rushed to implement D2D, they discovered that there are a number of challenges with D2D that may not have been readily apparent, including integration issues, storage formats, file system size and performance problems, fragmentation concerns and others. IT administrators should be aware of these issues as they strive to reap the benefits of D2D in their backup and restore environments.

Another technology that is capturing the attention of IT administrators is that of virtual tape libraries (VTLs). VTL takes elements from both traditional tape backup and the newer ATA disk-based technology and combines them to provide a solution optimized for existing backup environments. A virtual tape library is essentially a disk-based library that emulates standard tape library and tape formats. Acting like a tape library in the environment, with the performance of modern ATA disk, this new VTL is truly the "best of both worlds".

Here, we will first take a look at the problem of traditional backup and restore from tape, another quick look at the advent of ATA technology, then delve deeper into these two new backup and restore technologies. We will investigate the benefits and challenges of each technology and present information that will help executives and IT decision makers determine which is the better technology to improve their backup environment.

The Nightmare of Traditional Tape Backup and Restore

Backup and restore from tape has been a source of consternation for IT administrators for years. One has only to make a quick search of the Internet for the topic "tape backup problems" to see the proliferation of articles and discussions about the challenges of traditional tape backups. After looking at some of the top entries, some of the problems described include:

* Tape failures during backup

* Failed notifications of incomplete backups

* Inability to locate an appropriate and current backup tape See tape backup.

* Cost of tape media

* Tape backup overruns into production time

* Inadequate restore speeds from tape

In spite of all of these difficulties, organizations have been left with few alternatives to traditional backup technologies until recently. To add to the list of common problems with tape backup and restore, IT administrators are now faced with a growing mountain of data that must be protected.

Improvements in computing hardware--like processor and memory speeds, lower cost hardware and software, and application development advances that improve computing capabilities--have led to the creation of more (and larger) data. Historically, data consisted primarily of text, word processor documents, spreadsheets and the like. The total amounts of data created in an organization were relatively small in size and were not very difficult to backup and restore using tape media. Today organizations use more complex applications to create images, databases, 3D geophysical ge·o·phys·ics  
n. (used with a sing. verb)
The physics of the earth and its environment, including the physics of fields such as meteorology, oceanography, and seismology.
 maps, movies and other sizeable data. The total quantity of this data within organizations has become rather daunting daunt  
tr.v. daunt·ed, daunt·ing, daunts
To abate the courage of; discourage. See Synonyms at dismay.



[Middle English daunten, from Old French danter, from Latin
 for IT administrators, and is increasingly difficult to manage in terms of tape backup and recovery following data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors.  or system failures.

Over the past few years, IT administrators have begun to look to disk-based solutions to solve the nightmares of tape backup and recovery. The most prevalent problem with implementing disk-based systems has been the cost of hard drives in comparison to tape media. In order to utilize a disk-based solution, IT administrators were often faced with doubling their total disk capacity to create an online backup Using the Web to store copies of data for backup. There are numerous providers on the Internet that charge for storage, and fees are typically based on capacity. Online backup services provide offsite backup, which is essential for disaster recovery. See backup types. , yet still having to further backup to tape for a complete data protection scenario. This approach would solve the problem of backup overruns and tape restore speeds, but still carried a cost that was typically insurmountable. These nightmares of traditional backup soon set the stage for new technology that could offer the data protection of tape, the performance of disk, at a cost that was still in that same ballpark as the tape technology.

ATA Increases Speed While Lowering Cost

Prior to about 2001, basically two types of hard drives were available, Small Computer System Interconnect (SCSI SCSI
 in full Small Computer System Interface

Once common standard for connecting peripheral devices (disks, modems, printers, etc.) to small and medium-sized computers. SCSI has given way to faster standards, such as Firewire and USB.
) and Integrated Drive Electronics See IDE.

Integrated Drive Electronics - Advanced Technology Attachment
 (IDE). SCSI disks were typically used in server systems or those systems requiring higher transfer rates. Early SCSI interfaces SCSI interface - SCSI adaptor  were capable of only around 5MB/see transfer, while later versions of SCSI supported 80MB/see or more. Concurrent development has also been taking place over the years in the Fibre Channel arena, with FC drives now capable of 200MB/see transfer speeds. These drives are now used in many high-end storage arrays and to provide optimal data transfer performance when combined with 2Gbps FC network components. Recently, with the latest advances in Serial SCSI Running SCSI on Fibre Channel, SSA or FireWire. SCSI is a parallel bus, and the parallel signals must be converted to serial transmission to ride over different transport systems. See Fibre Channel, SSA, FireWire and SCSI. , this interface is capable of up to 300MB/see data transfer. While the performance of these SCSI and Fibre Channel drives is attractive, their cost has typically made them feasible for only primary PC, server and more expensive storage systems.

The IDE interface was later renamed Advanced Technology Attachment (storage, hardware, standard) Advanced Technology Attachment - (ATA, AT Attachment or "Integrated Drive Electronics", IDE) A disk drive interface standard based on the IBM PC ISA 16-bit bus but also used on other personal computers. . Maxtor held patents on Parallel ATA See PATA.  technology, and drove its development to the point that it was capable of 100MB data transfer speeds by the year 2003. According to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 Maxtor, parallel ATA was reaching its limits at that speed. To get beyond 100MB/sec, a cooperative industry group including Maxtor, Intel, Seagate, APT, Dell and IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries)  began developing the Serial ATA See SATA.

Serial ATA - Serial Advanced Technology Attachment
 specification. Serial ATA is currently capable of 150MB/sec, and has resulted in the production of new Serial ATA drives and interfaces. In the near future, according to Seagate, Serial ATA will be capable of up to 600MB/sec data transfers, making it a very attractive, low cost, hard drive technology.

[GRAPHIC OMITTED]

ATA technology offers the highest performance-to-cost ratio of any other disk alternative available on the market today. This fact has now spurred the development of a number of ATA disk-based products to assist organizations in overcoming their tape backup and recovery problems.

Could D2D End Tape Backups?

Redundant Arrays of Independent Disks (storage, architecture) Redundant Arrays of Independent Disks - (RAID. Originally "Redundant Arrays of Inexpensive Disks") A project at the computer science department of the University of California at Berkeley, under the direction of Professor Katz, in conjunction with Professor  (RAID) aggregates the performance of multiple disks together. It is not only the speed of an individual disk that counts, but rather the speed of a disk group (i.e., the media transfer rate of an ATA 7200RPM hard disk is around 35MB/sec. With 10 disks in a group, you'd get ~350MB/sec). With the advances in data transfer rates being applied to lower cost hard drives, a number of vendors quickly began to devise products and methods for backup based on RAID arrays using ATA technology. The combined performance of disks in RAID arrays well outpaces that of writing to traditional tape, D2D became a new buzzword A term that refers to the latest technology or a term that sounds catchy. If not a flash in the pan, new technologies become mainstream. For example, Java was a hot buzzword in the 1990s, but should remain a major topic for decades.  in the market, with promises of a new panacea Some antidote or remedy that completely solves a problem. Most so-called panaceas in this industry, if they survive at all, wind up sitting alongside and working with the products they were supposed to replace.  for the data and backup headaches of IT administrators everywhere. Higher backup speeds, online backups and restores and additional data protection, without the difficulties of tape, looked promising.

Many vendors soon began to offer drive arrays, NAS (1) See network access server.

(2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular
 devices, SAN products, and other configurations built around these new, low-cost ATA drives The formal name for an IDE drive. See IDE. . Storage was now available for around a penny per MB, where it had previously been as high as ten cents Ten Cents has several meanings:
  • Ten Cents, a worth of a dime
  • Ten Cents, a fictional character in TUGS
 or more! IT administrators rushed to get their hands on this new technology. After all, D2D offered a number of benefits over traditional tape backup, including:

* Faster writes of backup data to disk

* Faster restores from disk

* Familiar PC Server hardware

* Familiar RAID storage technology

While cheaper RAID arrays were certainly attractive, IT administrators did not readily abandon their tape backup strategies. For existing IT environments of any size, moving from a tape backup infrastructure to a disk-based strategy was more difficult than it looked. Most enterprise organizations have a sizeable investment in their existing backup infrastructure, and have likely spent considerable effort to get their backups functioning as well as possible using existing tape technology.

Business leaders and IT administrators are hesitant to throw away such an investment, and are loathe to upset their existing backup processes just for the promise of increased performance. In order for D2D to be successful in replacing tape backups, it needs to be implemented in such a fashion so as not to interrupt the existing environment, and to somehow leverage the sizeable investment organizations have made in their backup infrastructure.

Host-Based D2D Offers Some Promise

Vendors of traditional host-based backup software See backup program.

(tool, software) backup software - Software for doing a backup, often included as part of the operating system.

Backup software should provide ways to specify what files get backed up and to where.
 and hardware were quick to recognize the potential of host-based D2D and moved to ensure that they did not lose their customers to new players in this arena. Leading backup application vendors like Veritas and Legato (Legato Systems, Inc., Mountain View, CA, www.legato.com) A leading provider of storage management and high-availability software founded in 1988 and acquired by EMC Corporation in 2003. Legato software, including Celestra data management (data mining, data migration, etc. , for example, began to offer the ability to point backups to RAID arrays instead of to tape libraries or tape drives. They also began to create vaulting vaulting

Gymnastics exercise in which the athlete leaps over a form that was originally intended to mimic a horse. At one time, the pommel horse was used in the vaulting exercise, with the pommels (handles) removed.
 applications that would allow the RAID arrays to be further backed up to tape in the traditional fashion, foreseeing that organizations might not easily forego their dependence on tapes.

These new offerings from the traditional backup application vendors seem at first glance to be exactly what IT administrators need, namely a method that will allow them to continue to use their existing backup software and still gain the advantages of the lower cost ATA RAID Using ATA (IDE) drives in a high-performance and/or fault tolerant configuration. See RAID.  arrays. Host-based D2D adds benefits above and beyond generic D2D to include:

* Integration with existing backup and restore applications

* Ability to execute multiple, simultaneous backup and/or restore operations using disk-based systems

* Ability to combine disk snapshots with backup and recovery operations Operations conducted to search for, locate, identify, rescue, and return personnel, sensitive equipment, or items critical to national security.

* Flexibility in combining disk-based staging with tape backups

* Ability to consolidate storage and backup data pools

The Challenges of Host-Based D2D

While the benefits of a host-based D2D solution are significant, upon closer inspection, host-based D2D solutions still have some inherent problems. Many challenges involv-ed in host-based D2D involve the integration of the various pieces necessary for the solution. Most of the application vendors do not sell RAID arrays, servers or the other necessary network hardware (and possibly software) to ensure compatibility among the various pieces.

When we look at the installation requirements for both host-based D2D and VTL, it becomes apparent that much more configuration is required of the D2D solution. The Table illustrates the steps required for each.

Host-based D2D has considerably more steps involved in the installation than a VTL system. As we can see above, the VTL system is designed to be plug and play. The D2D system being a collection of pieces from various vendors essentially brings all of the challenge of primary storage configuration into the backup arena. IT administrators should be aware of the potential installation requirements, know the limitations of the hardware, file systems, RAID types, etc., before deciding to implement this kind of solution.

Compatibility presents some challenge for IT administrators wishing to implement host-based D2D. The IT administrator must typically assemble a collection of devices from different vendors, configure each of these to operate as needed as needed prn. See prn order. , then connect the hardware and applications so that they will interoperate properly. Incompatibility The inability of a Husband and Wife to cohabit in a marital relationship.


incompatibility n. the state of a marriage in which the spouses no longer have the mutual desire to live together and/or stay married, and is thus a ground for divorce
 could potentially be a serious problem if the wrong devices or applications are coupled. Further exacerbating the problem is the issue of technical support, should any one of the pieces fail. This scenario could easily result in finger pointing among vendors, leaving the IT administrator to act as referee, while his data protection is in jeopardy.

Even though compatibility and integration provide the most significant problems with a host-based D2D solution, there are other problems inherent in the solution as well. The D2D capabilities of the backup/ restore software in conjunction with off-the-shelf RAID arrays rely on file systems like NTFS (NT File System) An optional file system for Windows NT, 2000 and XP operating systems. NTFS is the more advanced file system, compared to FAT32. It improves performance and is required in order to implement numerous security and administrative features in the OS.  and NFS (Network File System) The file sharing protocol in a Unix network. This de facto Unix standard, which is widely known as a "distributed file system," was developed by Sun. See file sharing protocol and WebNFS.

NFS - Network File System
 that were created for use in PC server systems. These file systems were designed to manage many small files for many users. The problem, when these file systems are used for tape-style backups, is that they take a performance hit when performing the larger write operations typically executed by tape backup applications. Additionally, when combined with tape backup applications, the storage system's processor takes an additional, unnecessary performance hit, as the application must interact with the file system's file allocation tables (file system) File Allocation Table - (FAT) The component of an MS-DOS or Windows 95 file system which describes the files, directories, and free space on a hard disk or floppy disk.

A disk is divided into partitions.
 to determine a particular file's location.

As mentioned, general-purpose file systems used with most D2D RAID arrays are not optimized for tape backups. Three other problems exist with these general-purpose file systems relative to backup systems Noun 1. backup system - a computer system for making backups
ADP system, ADPS, automatic data processing system, computer system, computing system - a system of one or more computers and associated software with common storage
.

* Most general-purpose file systems are subject to significant file system fragmentation over time, just as typical servers are. Fragmentation of the file system increases over time and degrades the performance of the reads and writes on the system.

* General-purpose file systems like FAT, NFS, and NTFS are subject to file system, partition, and individual file size limitations. There is an additional limitation as to the total number of files allowed on a file system. These limitations can impose restrictions on the scalability of backup volumes used by host-based D2D applications. When the limitation is reached, organizations will incur additional expense and difficulty by having to move data, create new volumes, and reconfigure their backup applications.

* The other major problem with many D2D file systems is security of the data that resides on them. Hackers with malicious intent and viruses can compromise general-purpose file system security. Any D2D file system could be subject to these same security concerns, which could lead to a failure in the data protection scenario. These problems can be avoided when using a more advanced disk-based backup system called a Virtual Tape Library (VTL).
Virtual Tape Library                Host-Based D2D

Assumption: Customer has existing enterprise backup application (i.e.
Veritas or Legato) and was using a traditional tape library or tape
drives for backups prior to the purchase of new system.

1. VTL arrives fully configured to  1. RAID Array system arrives.
the customer's specifications,
including RAID arrays, hardware
and included software.
2. System is installed and plugged  2. IT administrator or vendor will
into existing Fibre Channel or      have to install hardware, hard
Ethernet network infrastructure.    drives, cabling and other physical
                                    hardware necessary for RAID system.
3. Backup is executed using         3. D2D system is directly connected
enterprise backup application.      (or via switch) to Host backup
Since VTL emulates previously used  application server that will utilize
tape library or backup, the backup  its storage volumes.
application sees VTL as existing
library. Changes to policies
rarely required.
4. Backup job is verified.          4. IT administrator decides on RAID
Installation and test complete.     types, configures RAID sets
                                    5. IT administrator installs file
                                    system required by host backup
                                    application (i.e. NTFS for Windows,
                                    NFS for Solaris) and partitions the
                                    bolumes created in the RAID sets.
                                    6. IT administrator configures
                                    security access, zones, etc.
                                    7. IT administrator configures host
                                    backup application to utilize RAID
                                    system as target for backup
                                    operations.
                                    8. New backup job is created and
                                    run.
                                    9. Backup job is verified.
                                    Installation and test are complete.

Table 1: Installation steps required for each solution


Part 2 of this article will appear in the next edition of CTR See click-through rate. .

Paul Feresten is vice president, marketing, SEPATON, Inc. (Southborough, MA)

www.sepaton.com
COPYRIGHT 2004 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2004, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Disaster Recovery & Backup/Restore; Virtual Tape Library; disk-to-disk
Author:Feresten, Paul
Publication:Computer Technology Review
Date:Sep 1, 2004
Words:2632
Previous Article:Sarbanes-Oxley: compliance meets technology.(Business of Technology)
Next Article:Catch the next storage wave of change: jump in and get ready to profit!(Storage Networking)
Topics:



Related Articles
ENVISION: A Clear Look At The Future.
IBM Tivoli storage manager and NetApp NearStore R100 provide auto storage Mgnt with fast access.(Product Announcement)
Speeding up the network: D2D backup lets VARs beat the Bottleneck Bugaboo. (Nectivity).(Buyers Guide)
Tape or disk: why not both?(Storage Management)(Industry Overview)
Getting disk into the backup process; adding benefits of disk while supporting existing processes.(Storage Networking)
Comparing host-based D2D to VTLs for backup and restore: Part 2.(Disaster Recovery & Backup/Restore)(Virtual Tape Libraries)
Tape-based WORM: the best choice for HIPAA-compliant storage.(Write-once read many; Health Information Portability and Accountability Act)
Tape turning: protect against data loss.(first in/first out)
Enabling tiered storage through tape virtualization: delivering more performance, reliability and efficiency at lower cost.(HSM: Special Section)
USAF looks beyond backup to data recovery.(Case Studies)(United States. Air Force)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles