Printer Friendly
The Free Library
14,505,983 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Challenges to Open File Backup.


Many of today's most critical applications such as email, web servers, and transaction servers can no longer be shut down for backup. They are constantly up and running ... 24x7. Yet, no data center manager would eliminate backup as a critical management task.

How, then, to address open file backup? The conventional approach is open file agents and managers that insured that all files were backed up intact and not skipped because they were open. Yet the complexity of applications and operating systems Operating systems can be categorized by technology, ownership, licensing, working state, usage, and by many other characteristics. In practice, many of these groupings may overlap.  has spiraled, and data loss is once again a major problem facing the data center.

Santa Barbara Santa Barbara (săn'tə bär`brə, –bərə), city (1990 pop. 85,571), seat of Santa Barbara co., S Calif., on the Pacific Ocean; inc. 1850. , CA-based Strategic Research Corporation reports that users could not recover a usable intact dataset from their backup in over 35% of the time when they tried a restore function. Many times, the files were successfully restored intact, but would not operate correctly since they do not match each other's timestamps. To address these problems, another enabling technology has been developed.

OPEN TRANSACTION MANAGER (OTM OTM

See: Out of the money.
)

This is an enabling technology that presents a stable, coherent, point in time snapshot alternative view of one or more volumes or physical drives to any backup application without impacting system performance. OTM permits a user to back up a server or workstation with all files and databases open and active. Popular databases like Oracle and Lotus Notes Messaging and groupware software from IBM Lotus that was introduced in 1989 for OS/2 and later expanded to Windows, Mac, Unix, NetWare, AS/400 and S/390. Notes provides e-mail, document sharing, workflow, group discussions and calendaring and scheduling. , SQL servers An earlier relational DBMS from Sybase and from Microsoft. Sybase introduced SQL Server in 1988 for various Unix versions. In that same year, with help from IBM, Sybase created an OS/2 version that Microsoft licensed and branded as Microsoft SQL Server. , and email servers can be in use and running.

The technology creates an alternate "virtual drive" or static copy of the drive to be backed up (See Fig). Software looks for a short period of inactivity, five seconds or so, where no writes are occurring to any of the volumes or drives that have been selected for backup. Once this "quiescent quiescent

at rest; latent; the G0 stage of the cell cycle.
 period" is obtained, OTM maps in a virtual drive letter for each volume selected for backup. The backup application can now access this static virtual volume instead of the original volume, which is changing during the backup.

When a write occurs on the original volume, OTM pauses it and copies the old corresponding data to its cache file A file of data on a local hard drive. When downloaded data is temporarily stored on the user's local disk or on a local network disk, it speeds up retrieval the next time the user wants that same data (Web page, graphic, etc.) from the Internet or other remote source. See Web cache and cache. , immediately sending the original write data to the hard drive. This keeps the hard drive real time current and safe at all times during the backup. Read requests from all applications, except the backup, are passed directly to the hard drive with no intervention. Read requests from the backup package are passed to the OTM filter driver that determines if the data is already in cache.

If that data is in cache, OTM passes the cached data to the backup package. If not, the data is passed directly from the hard drive. Since OTM only needs to preserve the original data, additional writes to the same sector are not cached and passed directly to the hard drive.

BANISH ban·ish  
tr.v. ban·ished, ban·ish·ing, ban·ish·es
1. To force to leave a country or place by official decree; exile.

2. To drive away; expel: We banished all our doubts and fears.
 THE BACKUP WINDOW

The mere fact that there is a "backup window" (that small slice in time when a backup can be performed without affecting system performance) is a major problem for most system administrators. All file-by-file backup applications have relied on secondary and tertiary agents and managers just to allow the backup of the open files while the system is live and in use. Yet these file-by-file applications combined with open file agents and managers are extremely CPU CPU
 in full central processing unit

Principal component of a digital computer, composed of a control unit, an instruction-decoding unit, and an arithmetic-logic unit.
 and I/O intensive Refers to an application that reads and/or writes a large amount of data. The performance of such an application depends on the speed of the computer's peripheral devices and can cause a computer to become I/O bound. See I/O bound. . This can bring server performance down to its knees during your backup.

OTM makes all of the file, database, Ethernet, CPU, and I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output.

I/O - Input/Output
 subsystem resources available to users on a priority basis during the backup by throttling the I/O. This increased availability makes the time to complete the backup a secondary issue.

A faster backup would also minimize that time. OTM "sees" a True Image of the file server. All I/Os pass through OTM, even all of the OS's hidden system I/Os. Only OTM can safely give users priority over secondary processes like backup because of this.

Since servers can task switch many times a second, once True Throttling is implemented, no one should ever have to wait for their data. As backup applications have become more efficient at consuming all available resources, his background application can effectively shut down the more costly applications--users. OTM realigns these priorities according to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 commonsense com·mon·sense  
adj.
Having or exhibiting native good judgment: "commonsense scholarship on the foibles and oversights of a genius" Times Literary Supplement.
 management guidelines. The backup also takes less time, since the timestamps no longer need to be reset after backup on the virtual OTM drive.

I/O PERFORMANCE SUPPORTS THE BACKUP FUNCTION

Since OTM sees all I/Os, OTM pauses the backup I/Os if there are outstanding user I/Os. Since the system will timeslice, no one will ever have to wait for his or her data. When the system is at 100% I/O utilization, then the backup will wait-not the user. The backup application itself will not be issuing more file requests to the OS until the current requests are fulfilled because the I/Os from the backup are being paused in a 100% I/O saturation situation. If the backup file A file on a tape, removable disk or the fixed disk of another computer that is a copy kept for backup purposes. See backup types.  requests were to continue, the OS itself could quickly overburden o·ver·bur·den  
tr.v. o·ver·bur·dened, o·ver·bur·den·ing, o·ver·bur·dens
1. To burden with too much weight; overload.

2. To subject to an excessive burden or strain; overtax.

n.
1.
 the CPU, the network cable, and the storage subsystem The part of a computer system that provides the storage. It includes the controller and disk drives. See storage system. . By preventing more backup file I/O requests, the OS, CPU, and the Ethernet are free to service all user requests and the backup resumes only after the user requests are satisfied.

OTM works at the block level and not the file level; all of its work is done within OTM and not the OS and the file system. This improves performance in read/write operations. Since write operations are typically less than 10% of reads, OTM's I/Os are generally invisible to the user or the backup.

For reads from user's volumes, the read request is passed directly to the hard drive with no delay or system impact. For reads from the backup application's alternate "virtual" volume, these blocks reads are passed through to the hard drive if the data has not changed since backup commencement. For block reads from the virtual volume that have changed, OTM will read only those blocks from the hard drive's cache file, resulting in slightly slower file access to the backup.

For writes either from the user or the backup application, the old data is first copied to the cache file before the write is passed to the hard drive. OTM speeds up this operation by utilizing the "lazy write Refers to the effect caused by using a write back cache. Data are written to the cache first and, later, during idle machine cycles or at some specified time, are written to disk if there is a disk cache or to memory if there is a CPU cache. " feature of the OS to write to the cache file. Additional performance can be achieved by placing OTM's cache file on a separate system. All subsequent writes to the same sectors will not be cached, but would be immediately written to the hard drive. Once volumes have completed their backups, OTM can be told to release them, thus allowing all subsequent reads and writes to be immediately passed to the hard drives without OTM.

Alan Welsh is the president of Columbia Data Products Columbia Data Products (CDP) introduced the MPC 1600 "Multi Personal Computer" in June 1982. It was an exact functional copy of the IBM PC model 5150 except for the BIOS which was clean roomed.  (Altamonte Springs Al·ta·monte Springs  

A city of east-central Florida, a residential suburb of Orlando. Population: 40,900.
, FL).

Backup Problems In the Data Center

Conventional open-file agents or managers face significant problems in the data center. Here are a few:

* THE TIMESTAMP PROBLEM (RELATIONAL INTEGRITY)

Most of today's applications have multiple associated files that are all updated together. When a backup is done while running an application, the backup copies file "A" to tape. Now an application subsequently updates file "A" and another associated file "B" that must match each other for the dataset to load. The backup now backs up file "B" to tape, which no longer matches file "A" on the tape. Everything looks good to the administrator until restoration is necessary. Convinced that the file restore must have gone awry a·wry  
adv.
1. In a position that is turned or twisted toward one side; askew.

2. Away from the correct course; amiss. See Synonyms at amiss.
, he tries again and again, but it just won't run the data after "restoring."

* DISAPPEARING BACKUP WINDOW PROBLEM

Most of today's most critical applications such as email, web-servers, and transaction-servers can no longer be shut down for backup and are running 24x7. This makes backup integrity a virtual impossibility when utilizing open file agents or managers.

* CONFIGURATION INDUCED LOSS

When using open file agents or managers, the only way to overcome this limitation is for the administrator to learn each applications' file structures and directories, then configure groups of all the associated files together in the open file agent or manager, and finally test to make sure that no files were missed in the grouping. Since many applications write these files in multiple directories, the chance for error is high.

* IMPROVED SYSTEM/BACKUP PERFORMANCE

Since open file agents and managers must operate at the file level, they can seriously degrade TO DEGRADE, DEGRADING. To, sink or lower a person in the estimation of the public.
     2. As a man's character is of great importance to him, and it is his interest to retain the good opinion of all mankind, when he is a witness, he cannot be compelled to disclose
 the server's performance by consuming huge amounts of CPU and I/O bandwidth. OTM can be operational without seriously impacting system performance as it acts at the disk sector level, bypassing the operating system operating system (OS)

Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs.
 and, thus, CPU overhead. Open file agents and managers consume vast system resources (1) In a computer system, system resources are the components that provide its inherent capabilities and contribute to its overall performance. System memory, cache memory, hard disk space, IRQs and DMA channels are examples.  like most backup software--users and vital processes are left out and can't run at the same time as the backup in this virtual "musical chair" performance game.

* THE "DAILY" UPGRADE

Dedicated open file managers and open-database agents must have intimate knowledge of the underlying database to work correctly. So anytime the database vendor makes a change, the open file agents vendor must make a corresponding change. So the customer must always be playing catch-up, installing new software, and then going through extensive testing to insure that it is working and configured properly. Some open file agent vendors are now grouping all files together, which improves the odds of a good backup. Unfortunately, if server activity is high during the first 60 seconds, all the grouping goes away and so does your dataset relational integrity.

* HIDDEN DATA LOSS

As more and more hidden structures in the operating system are suddenly "discovered," file-based open file agents and managers must constantly change to accommodate these new objects--and they may not be allowed by the underlying OS to properly protect them.
COPYRIGHT 1999 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1999, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Technology Information
Author:WELSH, ALAN
Publication:Computer Technology Review
Date:Oct 1, 1999
Words:1623
Previous Article:STORAGE AREA NETWORKS: REDI For The Next Generation.(Industry Trend or Event)
Next Article:The Importance Of Storage Domain management.(Technology Information)
Topics:



Related Articles
TAPE AUTOMATION AND THE MIGRATION FROM SCSI TO FIBRE CHANNEL.(Industry Trend or Event)
Breakthroughs In Enterprise Backup Solutions For NAS File Servers.(five solutions to back up and restore data on NAS filers)(Technology Information)
Tape and Backup Issues In Storage Area Networks.(Technology Information)
ST. BERNARD EXPANDS ITS OPEN FILE BACKUP TO PROVIDE OPTIMIZED DATA PROTECTION.(Open File Manager 8.0)(Product Announcement)
Open-file solutions: choosing the best package for your enterprise. (Storage Management).(Buyers Guide)
St. Bernard: Open File Manager 9.1 with disk-level technology and support for storage management.
St. Bernard announces Open File Manager 9.3.
TCO analysis: where D2D fits--part 1.(TCO: Disk Arrays)(Disk-To-Disk)
Comparing host-based D2D to VTLs for backup and restore: Part 1.(Disaster Recovery & Backup/Restore)(Virtual Tape Library)(disk-to-disk)
Policies, precautions and practices: some facts and practices for disaster recovery.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles