The emergence of e-vaulting: electronic vaulting is a compelling improvement on traditional in-house data backup and recovery functions.If small- and medium-sized businesses thought that data availability Refers to the degree to which data can be instantly accessed. The term is mostly associated with service levels that are set up either by the internal IT organization or that may be guaranteed by a third party datacenter or storage provider. was not crucial to their operational success before September 11, 2001, they do now. While these companies realized that their data was the lifeblood life·blood n. 1. Blood regarded as essential for life. 2. An indispensable or vital part: Capable workers are the lifeblood of the business. of their businesses, many believed that they just could not afford the costly staff, services, and capital to properly protect that data offsite. More often than not, they backed up to tape and prayed they never had to use those tapes to recover lost data. Even large enterprises have not implemented bulletproof Refers to extremely stable hardware and/or software that cannot be brought down no matter what unusual conditions arise. See industrial strength. bulletproof - Used of an algorithm or implementation considered extremely robust; lossage-resistant; capable of correctly backup and recovery procedures See: explosive ordnance disposal procedures. on a consistent basis. But in the wake of 9/11, businesses large and small want to ensure they can recover from a disaster. Larger enterprises are revisiting their backup procedures and expanding their infrastrures, smaller businesses are developing recovery plans first time. According to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. "Standard IT Processes: Getting to Yes at Intel," by Bill Whittington Bill Whittington (born September 11, 1949) is a former American racing driver from Lubbock, Texas who won the 1979 24 Hours of Le Mans together with his brother Don Whittington and Klaus Ludwig on a Porsche 935. , "a modern states that those who conduct backups are only two restores away from a pink slip." Events such as 9/11 have increased awareness that disaster are necessary, but the reality is that the primary loss within an organization--people--has nothing with headline-grabbing disasters. According to a recent published by CBL Cbl cobalamin. Technologies, approximately 15 percent unplanned downtime The time during which a computer is not functioning due to hardware, operating system or application program failure. occurs because of human error. Human error is perhaps the most common cause of data loss, but certainly not the only one that impacts a company's valuable data. How many times have employees deleted a file that they should not have? How many companies have filled a disk drive to 100 percent capacity, causing the server to crash and corrupting large chunks of the data on that disk? These are daily occurrences that data protection methods and schemes must take into account. Essentially, a company cannot afford not to protect its data. Hardware can be replaced but data is, at best, difficult to recover and, at worst, impossible to recover. Traditional Data Protection The traditional method for protecting computer data is to transfer a copy of the data from a hard disk to removable media In computer storage, removable media refers to storage media which can be removed from its reader device, conferring portability on the data it carries. A removable drive is a reader device for such media. such as a floppy disk or magnetic tape. Until recently, the low cost of tape-based storage relative to the cost of online hard disk storage made magnetic tape the most practical method for backing up data. Nonetheless, there have always been inherent problems with the use of magnetic tape for backup. Despite the high cost of available alternatives, those problems have been increasingly unacceptable as part of the risk of doing business. The inherent problems with the traditional magnetic tape backup Using magnetic tape for storing duplicate copies of hard disk files. Users can add an internal or external tape drive to their desktop computers for backup purposes, and files are typically copied to the tapes using a backup utility that updates on a periodic schedule. method include: * Slow speed of backup--Because tape uses a linear recording format, it takes more time to write backup data to it vs. disk. Traditional tape backup strategies including full, incremental Additional or increased growth, bulk, quantity, number, or value; enlarged. Incremental cost is additional or increased cost of an item or service apart from its actual cost. , and differential styles require significant bandwidth to meet short backup windows. * Inability to audit backup--With distributed systems Distributed systems (computers) A distributed system consists of a collection of autonomous computers linked by a computer network and equipped with distributed system software. , it is difficult to verify that tapes are being properly written to with each daily backup. * Data is not immediately offsite--Without manual intervention, backup tapes See tape backup. remain in the tape drive, leaving the data vulnerable to physical events. * Infrequent in·fre·quent adj. 1. Not occurring regularly; occasional or rare: an infrequent guest. 2. verification of backup data--Most people do not activate the option to "verify after write" on their tape drives because this adds 30 to 50 percent to the time required to complete the backup process. * Lack of data encryption--Most tape backup hardware or software does not encrypt See encryption. backup data, leaving it exposed to unauthorized access. * No 24x7 access to data for recovery--If tapes are removed from the drives to be sent offsite (often the recommended method), there can be a significant delay in those tapes returning for recovery purposes. * Changing tape formats--New generations of tape technology are introduced on a regular basis. When IT departments upgrade their tape technologies, few worry about recovering data from previous backups. Upgrades can make it difficult to satisfy corporate retention policies for longer periods of time. Tape is still a good option for long-term retention of data. It has excellent price performance characteristics and likely will continue to have a price advantage over disk for the foreseeble future. Therefore, for those applications requiring infrequent access and those that can afford to wait a longer period of time for "restores" tape will be there medium of choice. Electronic Vaulting vaulting Gymnastics exercise in which the athlete leaps over a form that was originally intended to mimic a horse. At one time, the pommel horse was used in the vaulting exercise, with the pommels (handles) removed. Emerges Electronic vaulting predicates the appearance of storage area networks (SANs) and network-attached storage See NAS. (NAS (1) See network access server. (2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular ) devices on the storage landscape. In fact, forms of electronic vaulting have been in use since the early days of computer networking
Computer networking is the engineering discipline concerned with communication between computer systems or devices. . Wide-area network architectures, as well as the TCP/I set of protocols that form the basis for the internet, have long provided a method to move data electronically from a local system to a remote one. Electronic vaulting is commonly defined as the movement of electronic data over private or public communication lines from a local (primary) computer storage device to a remote (secondary) storage device for the purpose of restoring the data in the event the primary copy is lost. Electronic vaulting also is referred to as "e-vaulting" and "online backup Using the Web to store copies of data for backup. There are numerous providers on the Internet that charge for storage, and fees are typically based on capacity. Online backup services provide offsite backup, which is essential for disaster recovery. See backup types. ." These terms imply the same process of moving electronic data from a local organized to manage its own internal data protection service. Market Size for Electronic Vaulting According to author Fred Moore's 2001 findings on the market penetration Noun 1. market penetration - the extent to which a product is recognized and bought by customers in a particular market penetration - the act of entering into or through something; "the penetration of upper management by women" of e-vaulting: The combined disk, and optical storage industry is approaching $50 billion in revenue, while the storage management software industry now exceeds $5 billion in annual revenue. The powerful dynamics of the storage industry, coupled with the abundance of bandwidth, are creating new and disruptive events that are reshaping the established methods for storing, retrieving, communicating, and managing data. E-vaulting represents a major transition away from traditional in-house data backup and recovery functions. The outsourced solution transitions the backup and recovery function of a company--non-core but critical functions--to an automated third-party service provider. As a result, the total market value for e-vaulting is the union of * traditional tape backup and archiving software products * traditional offsite tape storage or "tape vaulting" services Each time a company converts its data protection methodology to an e-vaulting solution, it is displacing the value propositions of both tape backup software and offsite storage with a compelling improvement on each function. As a result, analysts believe that both markets are taking a fundamental evolutionary step. The market size for BSP BSP Bromsulphalein, a dye used in the study of liver function. See also sulfobromophthalein clearance test. services is currently less than $100 million annually. Though the served market is small today, the addressed markets are large. For example, backup/recovery and digital archiving software was a $1.8 billion market in 2001, according to Gartner Inc., and offsite tape vaulting services comprised a $1.5 billion market in 2001, according to PRISM prism, in optics, a piece of translucent glass or crystal used to form a spectrum of light separated according to colors. Its cross section is usually triangular. International. In addition, the migration toward outsourced data protection solutions represents a transition away from direct-attached tape drives and their associated tape media because e-vaulting can be a direct substitution. Each of these markets represents a multi-billion-dollar IT spending practice. Into the Future The simple fact that information is increasingly "born digital" and, once rendered digitally, remains in that form, means that more and more information will be backed up, recovered, and stored on digital media. As a result, the enormous $8.5 billion global outsourcing (1) Contracting with outside consultants, software houses or service bureaus to perform systems analysis, programming and datacenter operations. Contrast with insourcing. See netsourcing, ASP, SSP and facilities management. market for paper records storage also will transition over time toward digital storage services. This transition won't happen overnight and it will be many years before any significant impact is felt in the legacy paper records storage business. Nonetheless, legislation such as the Health Insurance Portability and Accountability Act The Health Insurance Portability and Accountability Act (HIPAA) was enacted by the U.S. Congress in 1996. According to the Centers for Medicare and Medicaid Services (CMS) website, Title I of HIPAA protects health insurance coverage for workers and their families when (HIPAA (Health Insurance Portability & Accountability Act of 1996, Public Law 104-191) Also known as the "Kennedy-Kassebaum Act," this U.S. law protects employees' health insurance coverage when they change or lose their jobs (Title I) and provides standards for patient health, ), the Electronic Signatures Act, and the Gramm-Leach-Bliley Act The Gramm-Leach-Bliley Act, also known as the Gramm-Leach-Bliley Financial Services Modernization Act, Pub. L. No. 106-102, 113 Stat. 1338 (November 12, 1999), is an Act of the United States Congress which repealed the Glass-Steagall Act, opening up competition drive organizations to protect, backup, and archive data in ways that increasingly perpetuate per·pet·u·ate tr.v. per·pet·u·at·ed, per·pet·u·at·ing, per·pet·u·ates 1. To cause to continue indefinitely; make perpetual. 2. digital storage services. The transition to e-vaulting from traditional tape-based backup and offsite vaulting services has been predicted for some time, according to Gartner's Enterprise Storage Group and other respected storage analysts. However, the "primordial primordial /pri·mor·di·al/ (pri-mor´de-al) primitive. pri·mor·di·al adj. 1. Being or happening first in sequence of time; primary; original. 2. mix" for supporting this transition has only existed for a relatively short period of time. That mix of elements includes * cheap, reliable, widely adopted high-bandwidth network connections (DSL DSL in full Digital Subscriber Line Broadband digital communications connection that operates over standard copper telephone wires. It requires a DSL modem, which splits transmissions into two frequency bands: the lower frequencies for voice (ordinary or faster) * affordable, reliable, and massively scalable Redundant Array of Independent Disks (RAID) storage systems * new software technology engineered "from scratch" specifically to minimize the amount of data required to transmit a full backup See backup types. across a network and to maintain the history of those backups in a very small storage footprint at the remote location The software technology necessary to offer e-vaulting services through BSPs is fundamentally different than traditional tape-based backup and recovery solutions. To ensure the feasibility of automated data protection services, the following engineering challenges had to be addressed: Challenge 1: The development of communication techniques to handle large-volume data movements over public networks, with a high level of reliability needed for outsourcing the data backup and recovery function. Additional communication-related challenges included simultaneous multi-party processing and enhanced digital security. Challenge 2: Creation of a storage architecture with efficiencies that greatly exceed the limitations of historical data compression data compression Process of reducing the amount of data needed for storage or transmission of a given piece of information (text, graphics, video, sound, etc.), typically by use of encoding techniques. techniques. It was important to minimize the amount of data transferred across the network to effectively recreate a full backup and equally important to minimize the amount of data required to represent a comprehensive history of backup data. Challenge 3: Aggregation of all operating system operating system (OS) Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs. (OS) platforms to a common back-end platform. This change reduces the cost of configuration on the back-end. It also accommodates the recovery of data from one OS platform (OS of origin) to any supported OS platform. Another benefit of this common format is to mitigate the problems associated with tape media obsolescence ob·so·les·cent adj. 1. Being in the process of passing out of use or usefulness; becoming obsolete. 2. Biology Gradually disappearing; imperfectly or only slightly developed. . The result of the e-vaulting development effort over the past 10 years is a comprehensive solution that addresses the challenges described above. This solution fits between mirroring and long-term archiving on the data lifecycle spectrum. As shown in the data life cycle table, data protection, whether for a period of hours or up to one year, is the target market for the e-vaulting solution set. Mirror, Mirror on the Wall ... Some confuse mirroring with e-vaulting. This is understandable given that marketing literature and Web sites promoting these technologies claim similar features and benefits. Perhaps the most concise clarification is provided in "Mirroring Isn't Backup" by Rick Cook: Although data mirroring, especially remote mirroring, is an important tool in business continuity and disaster recovery, it is not a substitute for conventional backups. It offers no protection against several important causes of data loss, such as accidental deletion or virus infection. Anything that happens to the primary data happens to the secondary [mirrored] data. Mirroring technology provides different features and benefits for the user compared to e-vaulting. As shown in the accompanying table, mirroring solutions are complementary to, and not mutually exclusive Adj. 1. mutually exclusive - unable to be both true at the same time contradictory incompatible - not compatible; "incompatible personalities"; "incompatible colors" of, e-vaulting solutions. It is common to see companies using a local mirroring solution to provide real-time data Real-time data denotes information that is delivered immediately after collection. There is no delay in the timeliness of the information provided. Some uses of this term confuse it with the term dynamic data. protection for certain mission-critical applications that cannot afford more than a few minutes of downtime. And, that same company will use an e-vaulting solution to back up the mirrored copy of the data to a remote location for disaster recovery purposes. In addition, that company also may use evaulting as the principle backup method for protecting the data on the balance of its application servers. Business Case Assumptions Depending on the soft cost-savings that an organization can attribute to an e-vaulting solution, it is typical for a company to save significant dollars using e-vaulting while improving the level of service for the data protection function. The return on investment (ROI (Return On Investment) The monetary benefits derived from having spent money on developing or revising a system. In the IT world, there are more ways to compute ROI than Carter has liver pills (and for those of you who never heard of that expression, it means a lot). ) is illustrated in the accompanying table. The retention schedule for backup data is as follows: * Daily backups are saved for one cycle (one week--7 days). * Weekly backups are saved for one cycle (one month--5 weeks). * Monthly backups are saved for one cycle (one year--12 months). * Total number of backup sets/generations to support this schedule is 24. * Tapes used are DLT (Digital Linear Tape) A magnetic tape technology originally developed by Digital for its VAX line. The technology was later sold to Quantum, which makes it available to other manufacturers. DLT uses half-inch, single-hub cartridges similar to IBM's 3480/3490/3590 line. 4000 type with a 20GB capacity and an estimated drive cost of $3,000; individual tapes cost $50 apiece a·piece adv. To or for each one; each: There is enough bread for everyone to have two slices apiece. [Middle English a pece : a, a; see a . Backup administration involves * offsite administration * tape handling and labeling * backup monitoring and troubleshooting Troubleshooting is a form of problem solving. It is the systematic search for the source of a problem so that it can be solved. Troubleshooting is often a process of elimination - eliminating potential causes of a problem. * backup set up and scheduling * installation of software upgrades * labor rates estimated conservatively at $25 per hour including benefits In addition: * Offsite costs are calculated by having 20 of the 24 tapes offsite at any point in time. Daily pickup and delivery is estimated by taking 20 business days and multiplying by a per-visit charge of $15. (Storage charges are calculated based on a charge of $1 per tape per month.) * Emergency services emergency services Emergency care '…services …necessary to prevent death or serious impairment of health and, because of the danger to life or health, require the use of the most accessible hospital available and equipped to furnish those services' are required to bring a tape back from offsite to restore an entire disk drive or directory to the server. (It is estimated here that one emergency delivery is required per month.) * Restore time is required to carry out 10 "ad-hoc" restores per month to satisfy user requirements; eight hours are allocated (one day) per month to support this function. * Tape drives are continually being improved to enable the latest technology for speed and reliability, but they are mechanical devices and will fail over time. It is estimated that the DLT4000 will be replaced with new technology every two years. * Software license and maintenance costs are a constant per year for tape backup software. E-Vaulting Architecture There are normally three main components of an e-vaulting solution: agents, administration, and backend server. The software agent is installed on each system to be protected. The administration utility connects to the agents and allows for the scheduling of backups, defining backup parameters, monitoring backup activity and, when required, handling restore requests. The agents "wake up" at the scheduled backup time and transmit the "delta" changes to the remote server. The remote server accepts the changes, manages the retention life cycle, and serves up the data when a restore request is received. When getting started, the next step after the e-vaulting agent software is loaded and the backup parameters are defined is to perform a full backup. The client can choose one of the following backup methods to initialize To start anew, which typically involves clearing all or some part of memory or disk. or "seed" the e-vaulting process: * Do the first backup over the network (depending upon network speed and impact on production activity). * Send a full backup to a local tape device, which is then transported to the remote e-vaulting server for initialization in·i·tial·ize tr.v. in·i·tial·ized, in·i·tial·iz·ing, in·i·tial·iz·es Computer Science 1. To set (a starting value of a variable). 2. To prepare (a computer or a printer) for use; boot. 3. . * Run a full backup across the local network to a portable e-vaulting server, which is then transported to the remote site for initialization. Once the remote e-vaulting server is initialized, a full backup is normally never required again. Notifications are sent via e-mail--depending on agent settings--upon successful completion and/or exceptions. The process is completely automated from that point forward and virtually eliminates human intervention. The recovery procedure is easy to follow. A wizard within the administration utility guides the user through the retrieval process, prompting for date of backup, file/directory name, and other pertinent information. If the recovery is for a relatively small volume of data, it is generally performed immediately over the network connection. A complete recovery, on the other hand, is typically done from either tape or a portable e-vaulting server. In a complete recovery scenario, the customer restore request is handled at the remote e-vaulting server by competent staff who will initiate an export to tape or to a portable e-vaulting server. Either way, the restored data is delivered to the customer's site and recovered to the original or an alternate server. The portable server approach is much faster than using tape for larger volumes of data (greater than 40GB). The e-vaulting data flow illustrations on pages 29 and 30 describe the detailed flow of information from the client's server across the network and into the recovery-ready state at the remote e-vaulting server system. The delta processing technique results in minimal data travelling across the network to effectively recreate a full backup of the protected data. The empirical evidence from millions of backups performed since the mid-1990s is that 1 to 2 percent of the total uncompressed data is transmitted across the communication line each night. Thus, if the client's server has 100GB of storage designated for backup each night, then the amount of data transferred is typically 2GB or less. If that data were transmitted over a network with available bandwidth of 1.5MB/sec (offered by a T1--or DSL-type connection) then the backup window would be less than two hours on a typical night. The fact that most organizations have implemented highspeed Internet connections to support e-commerce and Web hosting Making a Web site available on the Internet. Many ISPs host a few personal Web pages for an individual at no additional cost above the monthly service fee, but the address is subordinate to the ISP; for example, www.friendlyisp.com/pat_smith. applications has been a boon Boon A general term that refers to a benefit or improvement for investors. This can include such things as increased dividends, a stock market rally and stock buybacks. Notes: for the e-vaulting business. Most of these Internet connections are under-utilized after prime office hours office hours, n.pl See business hours. and are usually fully available to carry backup data during the 6:00 p.m.--6:00 a.m. window. Companies that have implemented e-vaulting solutions experience the following benefits: * Shorter backup times--E-vaulting typically reduces the backup window from hours to minutes. * Central control--Through the administration utility, hundreds of remote systems can be managed from one or more locations. Log files can be reviewed, users receive exception notifications, and spot checks can be performed on any system to verify the backup process is operating with integrity. * Data is immediately offsite--One of the most important things learned from the 9/11 disaster is that data has to be remote from the primary site to ensure full protection. With e-vaulting, the data is offsite as soon as the backup is completed. * Security--E-vaulting solutions encrypt data at the agent before departure and the data remains encrypted en·crypt tr.v. en·crypt·ed, en·crypt·ing, en·crypts 1. To put into code or cipher. 2. Computer Science throughout its life cycle at the remote server. * 24x7 access to data for recovery--Most recoveries tend to be of the small ad-hoc variety. E-vaulting solutions provide instant access to data and that significantly improves the level of service an IT group can deliver to its users. * Labor productivity--The IT staff managing the traditional backup process now have more time to work on other tasks to support the business. * System independent format--IT departments avoid the cost and hassles of continuously upgrading tape equipment. They also can eliminate worries about legacy data when migrating from one operating system to another, making it easier to satisfy corporate retention policies. At the Core This article: * Discusses the need for effective data protection methods * Examines electronic vaulting technology and uses Key Questions to Ask Potential Vendors * What are the vendor's experience/qualifications specific to the industry? * How long it offered sound is the company? * How many clients does it have? * How many remote backup locations are there? * How much data can it handle? * What platforms does it support? * What are the backup procedures in place for the remove storage facility? * Can it service all your locations? In addition, customer should * make sure the vendor can handle all the data in the customer's backup window * ensure that a restore can be performed either online or via removable media * verify that the solution backs up all system states, security, and other operating system attributes for the files * make sure the vendor can send the older backup data to removable media for long-term retention * ensure that the solution allows for back up of open files and databases * verify that the solution can work over their existing network and/or internet connection and does require extensive network re-design * ensure that both backups and restores can be performed from a central management console A terminal or workstation used to monitor and control a network. See Microsoft Management Console. The Next Steps * Set up a simulation or test using an e-vaulting service. Or pick the distributed location causing the most trouble and give that site to the vendor on a paid trial basis. * Extrapolate extrapolate - extrapolation corporate communication requirements from the test. * Get a qoute for the company's full requirements and create the business case to move forward with a full solution to backup problems. * Assign a project manager in the company to organize the rollout of the solution on all distributed servers. Perform semi-annual or annual tests on a subset of company servers. Buyer Checklist What to Look for in an e-Vaulting Solution * automated and unattended backups * centralized cen·tral·ize v. cen·tral·ized, cen·tral·iz·ing, cen·tral·iz·es v.tr. 1. To draw into or toward a center; consolidate. 2. management of the backup and restore process from one or more locations * open file and open database backups * backups that take less time than a traditional full-tape backup * bundled software (1) Applications that are included with new hardware. For example, a new PC often comes with several applications, many of which may be light versions or full versions with no limit on usage. Others may be only 60- or 90-day trial packages. See bundle. licensing fees that do not incur additional fees * secure, dedicated computer facility that will house data * complete control of files and directories to be backed up, with file filtering capabilities * ad-hoc file restores around the clock via end-user or central administrator control * no special hardware requirements or changes to the network * ability to restore data either over the network or via the media of the customer's choice * ability to back up more than one operating system platform with the solution and an option to restore data from one platform to another * user-managed data retention schedule * several levels of data encryption data encryption, the process of scrambling stored or transmitted information so that it is unintelligible until it is unscrambled by the intended recipient. Historically, data encryption has been used primarily to protect diplomatic and military secrets from foreign * automatic restart To resume computer operation after a planned or unplanned termination. See boot, warm boot and checkpoint/restart. and resume capabilities for handling a variety of network conditions * exceptions and problems encountered reported to an administrator in electronic form automatically * detailed usage reporting available * ability to archive data for long-term retention without requiring a local tape drive Cost Considerations A detailed cost analysis will often reveal hidden expenses. When adding up costs, consider the following: * a declaration fee for full-system restores * ad-hoc restore fees * set-up and install costs * communication costs (if using an existing Internet connection, this should be free) * archive tape fees * usage fees (based on time, quantity, frequency, and other parameters) * cancellation fees for contract termination Defense procurement: the cessation or cancellation, in whole or in part, of work under a prime contract or a subcontract thereunder for the convenience of, or at the option of, the government, or due to failure of the contractor to perform in accordance with the terms of the contract (default). Data Protection Lingo Compression Technology: Virtually all backup software See backup program. (tool, software) backup software - Software for doing a backup, often included as part of the operating system. Backup software should provide ways to specify what files get backed up and to where. uses compression technology to reduce storage requirements. This technology can be software- or hardware-based and reduces the amount of data to be transmitted or stored by an average ratio of 2:1. Compression ratios compression ratio Degree to which the fuel mixture in an internal-combustion engine is compressed before ignition. It is defined as the volume of the combustion chamber with the piston farthest out divided by the volume with the piston in the full-compression position ( depend on the type of file being compressed; therefore, a company's actual experience could vary. On average, text files, word processing word processing, use of a computer program or a dedicated hardware and software package to write, edit, format, and print a document. Text is most commonly entered using a keyboard similar to a typewriter's, although handwritten input (see pen-based computer) and documents, and most databases experience the highest compression ratios. Full Backup Method: A full backup copies all files and directories that have been selected for backup. A user can select an entire disk volume or choose specific files and directories to back up. On the downside On the Downside is an EP by the San Diego, California band Counterfit, released by Alphabet Records in 2000. It was the band's first EP, recorded shortly after the members had relocated to San Diego from Fairfield County, Connecticut. , a full backup takes the longest time, and many files may be backed up unnecessarily because they may not have changed since the last backup. Incremental Backup See backup types. (operating system) incremental backup - A kind of backup that copies all files which have changed since the date of the previous backup. The first backup of a file system should include all files - a "full backup". Call this level 0. Method: An incremental backup copies only those files that have been modified or changed since the last backup. The obvious advantage with this method is that files that have not changed do not require back up, thus reducing the time required to do the backup. Using this option is just as simple as a full backup; however, there is a definite downside Downside The dollar amount by which the market or a stock has the potential to fall. Notes: You might hear someone say that the downside on stock XYZ is $10. What that means is that the stock could fall by this amount if things got bad. . The restoration process requires you to first restore from the first full backup and then to apply all incremental backups up to the required date of restore. This can be a tedious process. Typically, an incremental backup method requires a weekly full backup to help reduce the time to restore and reduce the potential for restore errors. Differential Backup See backup types. (operating system) differential backup - A kind of backup that copies all files that have changed since the last full backup. Each differential backup will include all files in previous differential backups since the full backup so to restore a version of Method: A differential backup copies all those files that have been modified or changed since the last full backup. This differs from the incremental that just copies the files changed since the last incremental backup. While it takes less time to back up than the full backup method, it requires more time than the incremental. This is because it must back up all the files that have changed since the last full backup and not just since the last incremental. On the positive side, the restoration process is simpler and quicker than the incremental. With this method, one needs only to apply the last full backup and the latest differential. Delta Backup See backup types. Method: In general, it is not practical to transfer full, differential, or incremental backups across a network. Network bandwidth is normally a bottleneck A lessening of throughput. It often refers to networks that are overloaded, which is caused by the inability of the hardware and transmission lines to support the traffic. It can also refer to a mismatch inside the computer where slower-speed peripheral buses and devices prevent the CPU that renders traditional backup methods obsolete. Most e-vaulting systems are designed to minimize the amount of data transferred across the network by using an enhancement on the incremental method called delta processing. Delta processing makes it possible to transfer just the changes that occurred since the previous backup. This procedure makes it practical to back up data to a remote locale (programming) locale - A geopolitical place or area, especially in the context of configuring an operating system or application program with its character sets, date and time formats, currency formats etc. Locales are significant for internationalisation and localisation. over relatively low-speed communication lines. Delta processing uses an algorithm that compares the image of each data file with the image of the same file from the last backup. This comparison generates a list of blocks within the data file that have changed. These blocks are compressed and encrypted and then transmitted across the network. To illustrate the difference between the methods, consider an example of a 100 MB data file containing customer information. If changes made to the customer data over the course of a day total 1MB, then approximately 1MB of data would be transmitted to the remote site to complete the backup. With the incremental or differential methods, the entire file of 100MB would be backed up. The "effective compression ratio" obtained by the delta method In statistics, the delta method is a method for deriving an approximate probability distribution for a function of an asymptotically normal statistical estimator from knowledge of the limiting variance of that estimator. can range from 50:1 to 1000:1. These large ratios make it feasible to back up large quantities of data over common network connections. Backup Timing: All backup methods require some form of "time stamp See timestamp. "to enable a proper recovery. This is important in database scenarios where updates may occur 24 hours a day, seven days a week. Some companies who run this type of operation actually take the system down for a small period of time to perform a backup. This ensures a clean copy of the database with no partial transactions. If the system cannot be taken down for a period of time then traditional backup methods do not provide a stable backup solution. For example, if a backup is taken off an open database then there is no guarantee which transactions made it into the backup and which did not. In such cases, special backup functions have been written into the database system that allow for backups to take place while updates are held in a queue. This backup is usually written to a temporary file, which is an image of the database. This image normally would be written to tape but also could be sent electronically to a remote location. Also, online databases may create journal files (lists of transactions) continuously, which allow for restoration using the last full backup plus all transactions up to the end of the journal file. There are other solutions to this problem that function at the operating system level. These solutions intercept intercept in mathematical terms the points at which a curve cuts the two axes of a graph. any write updates to open files during a backup and they store the previous data in a cache area. This allows the backup to read the data as of the point in time the backup started. Once the backup is finished the cache is released. The big benefit of these solutions is that mission-critical applications continue running 24 hours a day, seven days a week, and complete backups are obtained with integrity. Be Prepared In order to bid, vendors will want to know * recovery goals and backup window * detailed information on server hardware and operating system (Compile a list of servers to be backed up, including brand name, model number, nomenclature nomenclature /no·men·cla·ture/ (no´men-kla?cher) a classified system of names, as of anatomical structures, organisms, etc. binomial nomenclature , and estimated volume of data storage.) * existing communication facilities connecting locations--Internet, frame relay A high-speed packet switching protocol used in wide area networks (WANs). Providing a granular service of up to DS3 speed (45 Mbps), it has become popular for LAN to LAN connections across remote distances, and services are offered by most major carriers. , ATM, etc. * firewall management (How are special ports on a firewall opened to allow backup data to travel across the network?) * corporate retention schedule for computer data * how many people will be administering the backup/restore process and where they are located * location of alternate processing facility to be used in the event of a disaster
Mirroring Is Not Backup
Consideration Synchronous Asynchronous E-Vaulting
Protection Against Corruption No No Yes
Protection Against Deletion No No Yes
Cost of Solution High Medium-High Low
Distance Limitations Yes No No
Link Speed Required High Medium Low
Performance Impact High Medium Low
Failover Automatic Manual Requires
Restore
Multi-platform No No Yes
Return on Investment with E-Vaulting Solution
Traditional Tape
Current Cost Item Soft Cost Hard Cost E-Vaulting
Total tape inventory--30
tapes @ $50 per tape $1,500 $0
Backup administration--8
hr/wk/server @ $25 per hour $10,000 $0
Offsite storage--20 tapes
offsite x 12 months @ $320 per $3,840 $0
Estimated annual emergency
retrieval services @ $50
per month $600 $0
Restore activity--1 day per
month per server $2,400 $240
Replacement of tape drives
every 2 years $1,500 $0
Current backup software, updates
and maintenance per server $1,000 $0
Daily backup with 24 generations
of data for 20GB $8,400
(cost per GB/month = ($35x20GB)
x 12 months)
Total cost (soft and hard) $12,400 $8,440 $8,640
Grand Total (annual) $20,840 $8,640
References Brooks, Jeremy. "Protecting Your Data With an IT Continuity Plan." Itsecurity.com. Available at www.itsecurity.com/papers/cbl12.htm (accessed 25 November 2002). Cook, Rick. "Mirroring Isn't Backup." Available at http://searchstorage.techtarget.com/tip/1,289483,sid5_gci805340,00.html (accessed 25 November 2002). Moore, Fred. "Storage--Infinite Disruption" Horizon Information Strategies. 2001. Whittington, Bill. "Standard IT Processes: Getting to Yes at Intel." Storage Magazine. July 2002. Ray Ganong is Chief Technology Officer at EVault, a data backup and recovery provider. Ganong has worked in the data storage industry since 1989. He may be reached at Rayganong@cogeco.ca. |
|
||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion