Will continuous data protection make tape-based backups obsolete?Over the last several years an emerging data protection technology has come into the market, called continuous data protection or CDP CDP (cytidine diphosphate): see cytosine. (1) (Certificate in Data Processing) An earlier award for the successful completion of an examination in hardware, software, systems analysis, programming, management and accounting, . The term CDP has gotten heavy usage while the adoption of CDP has been relatively modest. CDP has the potential of freeing customers from the grip of traditional tape backup Using magnetic tape for storing duplicate copies of hard disk files. Users can add an internal or external tape drive to their desktop computers for backup purposes, and files are typically copied to the tapes using a backup utility that updates on a periodic schedule. technologies, but is CDP really the next wave of backup and recovery? The promise of CDP is that it can protect any data, anywhere in the data center and recover a consistent version of the data as it existed at a specific-point-in-time. This article explores CDP technology and offers guidance when choosing a data protection solution. Today's businesses are faced with an ever-increasing amount of data, threatening to undermine their existing storage management solutions. Creating a copy of yesterday's changed data and running it to tape is no longer adequate to support the real-time requirements of today's business. Critical data changes occur throughout the day, and to protect this data, customers are frequently turning to technologies such as CDP to offer improved recovery times. How Did CDP Get Started? Continuous data protection came into use about three years ago and since then, many different vendors soon touted their solution as CDP. This caused a lot of confusion in the industry, as each vendor claimed to have the only "true" CDP solution. To mitigate this, in January 2005 the CDP special interest group of the SNIA (Storage Networking Industry Association, San Francisco, CA, www.snia.org) An organization devoted to the advancement of mission critical storage systems. Founded in 1997, its goal is to determine the standards that must be developed to allow hosts and storage systems to interact via Data Management Forum was created. The mission of the CDP-SIG is to be the leading authority and resource on CDP solutions and to facilitate and promote CDP interoperability. A CDP product is one that will continuously monitor an object for changes and will preserve copies of all prior versions of the object. The user will have the ability to view and access these prior versions as required. The time to perform recovery changes from hours or days to seconds or minutes. The backup window is no longer a problem because there is no longer the concept of a backup window. There are two approaches to CDP object protection: file-system centric or block-based. Historically, the CDP file system approach started on the Windows environment (1) (upper case "W") Refers to computers running under a Microsoft Windows operating system. (2) (lower case "w") Also called a "windowing environment," it refers to any software that provides multiple windows on screen such as Windows, Mac, Motif and X Window. , where most applications utilized files to hold their data. Block based CDP started in the UNIX UNIX Operating system for digital computers, developed by Ken Thompson of Bell Laboratories in 1969. It was initially designed for a single user (the name was a pun on the earlier operating system Multics). (and now Linux) community, where database applications traditionally bypassed the file system and operated directly at the disk/block level. File-system CDP products are typically found in Microsoft Windows See Windows. (operating system) Microsoft Windows - Microsoft's proprietary window system and user interface software released in 1985 to run on top of MS-DOS. Widely criticised for being too slow (hence "Windoze", "Microsloth Windows") on the machines available then. environments, and usually offer a file-system or explorer interface for their configuration and recovery operations Operations conducted to search for, locate, identify, rescue, and return personnel, sensitive equipment, or items critical to national security. . Recovery is typically end-user driven and is either provided through an extension to the file system or as a shared file system based on NFS (Network File System) The file sharing protocol in a Unix network. This de facto Unix standard, which is widely known as a "distributed file system," was developed by Sun. See file sharing protocol and WebNFS. NFS - Network File System or CIFS (Common Internet File System) The file sharing protocol used in Windows. It evolved out of the SMB (Server Message Block) protocol in DOS, which is why the terms CIFS/SMB and SMB/CIFS are sometimes seen. The word "Internet" in the CIFS name has little relevance. that can be mounted to a recovery server. Block-based CDP products started in intelligent arrays such as those from EMC (1) (EMC Corporation, Hopkinton, MA, www.emc.com) The leading supplier of storage products for midrange computers and mainframes. Founded in 1979 by Richard J. Egan and Roger Marino, EMC has developed advanced storage and retrieval technologies for the world's largest companies. and HDS (Hitachi Data Systems, Santa Clara, CA, www.hds.com) A leading provider of high-end storage hardware, software and services. Part of the Information Systems & Telecommunications Division of Hitachi Ltd. and soon moved to host-based or appliance-based platforms. These CDP products operate as a layered feature of the underlying storage infrastructure, and usually operate independent of the host's file system and volume manager. Recovery is typically storage or database administrator driven, is provided through capabilities outside of the platform being protected, and is managed by the CDP implementation. What About Recovery? There are two general principles that govern all recovery: the recovery point objective (RPO RPO Recruitment Process Outsourcing RPO Recovery Point Objective (disaster recovery) RPO Royal Philharmonic Orchestra RPO Rochester Philharmonic Orchestra RPO Representative Poetry Online RPO Railway Post Office ) and the recovery time objective (RTO (Recovery Time Objective) The amount of time a computer system or application can stop functioning before it is considered intolerable to the enterprise. It can be computed to be from seconds to days, depending on how critical the application is to the organization. ). The RPO defines how much data you are willing to lose when you recover data. For example, if you backup twice a day your RPO would be 12 hours, which is the maximum amount of data loss that could occur between backup images. The RTO defines how long it will take to recover your business processes from a data failure. This includes not only the data recovery, but restarting the servers or applications that depend on that data. These recovery considerations must also be applied to local and remote recovery strategies. A true CDP product protects every data change, and the RPO approaches zero. On the other hand, with the vast amount of data being recoverable, how you choose the recovery point effects your recovery time. Unless a CDP product includes some application level awareness, all that the administrator knows about a CDP image is the date and time it was taken. It may take many recovery iterations to narrow down to the correct image, and some CDP vendors provide additional tools to help. One way to determine if CDP is right for you is ask yourself a set of qualifying questions. For example, are you worried about meeting the business SLAs established by the CIO CIO: see American Federation of Labor and Congress of Industrial Organizations. (Chief Information Officer) The executive officer in charge of information processing in an organization. ? Perhaps you are you being asked to measure business impact of downtime or are looking to modify or improve your backup site A backup site is a location where a business can easily relocate following a disaster, such as fire, flood, or terrorist threat. This is an integral part of the disaster recovery plan of a business. and strategy? Do you have rapidly changing data that is critical to business operations, and are you worried about shrinking backup windows to protect that data? If you answered yes to one or more of these questions, you should be seriously investigating CDP technology. What About Tape Backup or Archiving? The need for an off-site archive is greater these days then ever. New mandates in North America and Europe are requiring companies to retain off-site copies of their business data in a form that can be quickly retrieved. While traditional tape backups can be used to meet this mandate, many companies are now investigating newer technologies such as CDP, content addressable Reachable. When something is addressable, it can be identified and manipulated independently of its surroundings. For example, screen pixels and RAM memory are addressable. Each of the screen's picture elements can be individually turned on and off, and each of the memory's bytes can be storage and virtual tape libraries to improve their RTO. They may also be replicating their data to an off-site DR site. Companies that have invested heavily in backup technology don't need to throw away their investment when they implement CDP. A CDP product compliments existing backup products by providing an additional on-line, near instantaneous recovery of recently changed information. A CDP product will present a copy of data from any point in time. An important use for this copy is as the source for a traditional backup system. For example, a simple process would present a copy of the data to the backup server which then backs it up, and then throws away the CDP copy. If the backup were to fail, it would be a simple matter to recreate the image and restart the backup. Additionally you can use CDP technology to provide a richer backup environment, for example auditing the data before the backup is performed. What To Look for in a CDP Product? There are several features a mature CDP product brings to the market. These include: * Support of heterogeneous storage and server environments. Today's customers are refusing to be locked into a single vendor for their storage and server solution. Users should select a CDP product that doesn't restrict them to only a subset of their possible storage and server environments. * Awareness of applications and their environments. Application recovery is becoming more complex and time consuming, users should chose a product that integrates application specifics into the CDP recovery process. * Non-invasive to the application or server that is being protected. A CDP product should attempt to minimize any impact to the applications I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output. I/O - Input/Output throughput or CPU CPU in full central processing unit Principal component of a digital computer, composed of a control unit, an instruction-decoding unit, and an arithmetic-logic unit. load. This is best done by keeping the CDP footprint on the application server to a minimum, and moving any 'heavy-lifting' to an external server or appliance. * Out of band solution. An out of band CDP solution does not impede the flow of data from the server to storage. Hence a failure of the CDP product will not affect your applications access to storage. * Built on a scalable, reliable platform. If the CDP product is hosted on an appliance platform, the user should have the ability to add additional appliances that can scale their CDP capacity as the data protection needs grow. * Supports a federated Connected and treated as one. See federated database and federated directories. application environment. Many of today's complex applications (such as SAP R3) utilize servers and storage that span multiple hosts. Customers should choose a CDP product that supports these systems, as it provides the user with a consistent, federated image for recovery. * Supports business policies and SLAs. Companies assign different values to their different applications. A CDP product that is flexible in its support of differing protection and recovery policies can provide a better overall solution. * Can be extended by use of APIs/CLIs. Look for a CDP product that has functionality that can be easily extended by the customer to meet business needs. * Tightly integrated with business continuity technologies. A CPD CPD citrate phosphate dextrose; see anticoagulant citrate phosphate dextrose solution, under solution. Cephalopelvic disproportion (CPD) product that supports application clusters and remote replication provides a stronger solution then a CDP product that only provides a stand-alone solution. There are many resources available on the internet that discusses CDP technology, starting with the SNIA website. Additional resources include industry conferences such as Storage Networking World Storage Networking World (commonly called SNW) is a for data storage professionals in the United States. Sponsored by Computerworld and the Storage Networking Industry Association, SNW is held twice each year. , the Network Storage Conference, and the Gartner Storage Summit. Finally, take advantage of the wealth of information on CDP on various vendor websites such as Kashya. Gary Archer is the senior manager for product marketing at Kashya, Inc. (San Jose, CA) and is their representative on the SNIA CDP-SIG. www.kashya.com www.snia.org |
|
||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion