Disaster recovery for the masses: the role of OS-level server virtualization in disaster recovery.Disaster recovery is a hot topic again. Events like hurricane Katrina Disaster recovery planning is quite a complex undertaking. It includes considerations such evaluating each application and data source to determine both a Recovery Point Objective (RPO RPO Recruitment Process Outsourcing RPO Recovery Point Objective (disaster recovery) RPO Royal Philharmonic Orchestra RPO Rochester Philharmonic Orchestra RPO Representative Poetry Online RPO Railway Post Office ) and a Recovery Time Objective (RTO (Recovery Time Objective) The amount of time a computer system or application can stop functioning before it is considered intolerable to the enterprise. It can be computed to be from seconds to days, depending on how critical the application is to the organization. ). RPO is the amount of data that can be lost before it affects the organization, and RTO is the time it takes to recover and restart To resume computer operation after a planned or unplanned termination. See boot, warm boot and checkpoint/restart. the server and application. Together, these provide guidelines guidelines, n.pl a set of standards, criteria, or specifications to be used or followed in the performance of certain tasks. for how current data must be and how fast it must be recovered. In actuality ac·tu·al·i·ty n. pl. ac·tu·al·i·ties 1. The state or fact of being actual; reality. See Synonyms at existence. 2. Actual conditions or facts. Often used in the plural. , most organizations discover that only a very few (if any) applications require an always available, 0 RPO/RTO rating. According to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. a Gartner survey in 2005, only 18 percent of businesses implement transaction level replication to activate 100 percent uptime on any applications. Clearly these mission critical, always available applications are only a subset of the total applications in these businesses and the overall total percentage of applications treated in this manner would be much smaller. Once the data has been classified, it becomes even more difficult to deal with as different solutions and extremely high cost software and infrastructure are involved in maintaining and managing all of the components. Faced with these complexities and costs, many businesses decide that comprehensive disaster recovery is too daunting daunt tr.v. daunt·ed, daunt·ing, daunts To abate the courage of; discourage. See Synonyms at dismay. [Middle English daunten, from Old French danter, from Latin , and instead opt for minimal disaster recovery planning and implementation that covers only the most critical of applications and data. Protecting the Business--Without Going Out Of Business Traditionally there are three approaches to disaster recovery: * Always Available configurations (RPO/RTO 0) which include expensive duplicate servers (often in different data centers) and replicated content. Traffic latency (1) The time between initiating a request in the computer and receiving the answer. Data latency may refer to the time between a query and the results arriving at the screen or the time between initiating a transaction that modifies one or more databases and its completion. and slowdown is often an issue with replication over distances, so many organizations that make this heavy investment opt for a Storage Area Network (SAN) connecting to a Wide Area Network (WAN) connection to serve this traffic, further increasing the costs. * Fast Recovery configurations (RPO/RTO 1-12 hours) which include standby hardware that has data replicated over an acceptable RPO interval and may be activated in the required RTO. * Backup Recovery configurations (RPO/RTO > 12 hours) are simply recovering a server from the last available backup copy A disk, tape or other machine readable copy of a data or program file. Making backup copies is a discipline most computer users learn the hard way-- after months of work is lost. See backup and LAN free backup. . The backup recovery option is by far the slowest for recovery and least efficient. Common disaster recovery solutions today tend to provide for the always available, full disaster recovery scenario. Most organizations don't require this type of availability, and if they do, it is on a very limited amount of servers or applications. That said, most IT organizations recognize that some level of disaster recovery planning and implementation is a requirement, even if their business doesn't happen to be located in Hurricane Alley. According to a Gartner study regarding unplanned downtime The time during which a computer is not functioning due to hardware, operating system or application program failure. , true disasters only make up a small component of environmental failures, which are in themselves a small subset of all unplanned downtime. All businesses are constantly at risk of outages and disaster recovery isn't just important for a catastrophic data center loss. Each server and application should be protected. The problem then becomes the options available for disaster recovery planning. Available options tend to be expensive and exceed the RPO and RTO goals of the organization for a given application or server. The search for a cost effective solution with acceptable recovery times becomes the focus for most organizations. Disaster Recovery with OS-level Virtualization An umbrella term for enhancing a computer's ability to do work. Following are the ways virtualization is used. Hardware Virtualization Partitioning the computer's memory into separate and isolated "virtual machines" simulates multiple machines within one physical computer. Operating system-level server virtualization (1) Running applications in separate, isolated partitions within a single server. The "virtual machine" method can run different operating systems simultaneously, whereas the "OS virtualization" method runs applications for only one operating system (see virtual machine and OS solutions (such as SWsoft's Virtuozzo) hold an answer. These solutions utilize a different architecture and offer lower overhead than typical virtualization technologies See VT. See also virtualization. , creating isolated and secure virtual servers on a standard Linux or Windows operating system operating system (OS) Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs. on a single physical server. For this reason, they are an ideal component of disaster recovery solutions, providing both the lowest cost and the highest density virtualized servers available. Many supporting technologies required for a disaster recovery solution work well in conjunction with OS-level virtualization. The virtualization software manages the virtual infrastructure and provides many flexible options and capabilities that support a disaster recovery environment and address the RTO component of the solution or plan. The next step is managing the data from the original server. There are many available solutions and capabilities that reflect different levels of RPOs. What are the basic components to a disaster recovery solution? * The originating server/application/data. * The technology to replicate or backup the application and data. * The ability to recover or failover the server. OS-level Virtualization and Always Available Configurations: OS-level virtualization can be a crucial part of an always available disaster recovery solution. The originating server, housed on a SAN, may use the SAN to replicate to a virtualized server. Once a server fails, the SAN will re-reroute all traffic to the still available virtualized server. While "always available" remains a subset of disaster recovery solutions only for mission critical applications, this approach can bring the recovery server cost component considerably lower. OS-level Virtualization and Fast Recovery Configurations: The fast recovery solutions are gaining the most attention now as companies determine that they do have the ability to take some data and time loss in their systems. OS-level server virtualization is really ideal for fast recovery solutions because the virtualized servers reside on top of a running operating system. The footprint for the virtualized server is small, so its recovery time is the amount of time to load the application and data into memory, with no time at all required for starting the OS. SMS (1) (Storage Management System) Software used to routinely back up and archive files. See HSM. (2) (Systems Management Server) Systems management software from Microsoft that runs on Windows NT Server. Central, a premium mobile solutions provider, has actually implemented a fast recovery solution based on this model. SMS Central has several locations with disparate servers. They made low cost replicated virtualized servers of each, with five virtualized servers running on a single physical server. One of the original application servers was a MySql server that utilizes the virtualization application's replication capabilities to maintain the data integrity over the prescribed RPO. In the case of an event, some of their more critical applications are kept operational to keep the data loaded in memory, while other less critical ones are not started, or only started when necessary. This enables simple management and the maintained level of adequate system resources (1) In a computer system, system resources are the components that provide its inherent capabilities and contribute to its overall performance. System memory, cache memory, hard disk space, IRQs and DMA channels are examples. on the recovery server. This disaster planning disaster planning - disaster recovery scenario also netted the company an 85 percent reduction in new server infrastructure costs, as an added benefit of virtualization. In a fast recovery server configuration, the operating system runs continuously with minimal cost. Since many virtualized servers can reside on the single server, redundant servers become extremely cost effective. If the virtualized servers are not operational, the amount of standby servers can far exceed the server resources that would be required to support an operating server. Replication technologies are becoming more accessible as applications themselves are trying to meet organizational requirements for disaster recovery. Databases are a typical example. Databases provide their own replication capabilities. Data replication intervals can be set according to the appropriate RPO objective and replicated to the virtualized instance of the server. Again, the virtualized server doesn't actually need to be running and using computing computing - computer resources, enabling many applications and data sets to be maintained on a single physical server. As for the components of the complete solution, the originating server is on the same network as the virtualized server. Either SAN or application replication is deployed to maintain the data integrity within the desired RPO, and finally the virtualized server can be activated very quickly to make the service and application available. The best OS-level server virtualization solutions also provide very extensive and complete network configuration capabilities that help with the complexities involved in creating and duplicating servers on a network. OS-level Virtualization and Backup Recovery Configurations: Most organizations don't even consider backup and recovery as a disaster recovery solution, but for applications and data of extremely low importance, backup and recovery may be the least expensive approach to guaranteeing the final range of low criticality servers. Basic virtualization technology ensures that a server can be configured con·fig·ure tr.v. con·fig·ured, con·fig·ur·ing, con·fig·ures To design, arrange, set up, or shape with a view to specific applications or uses: and deployed in seconds anywhere, on any physical server. A backup can then be restored from any media into the virtualized server. More to Think About.... A server may be "down" if it is unavailable due to high traffic levels or any number of complications. When evaluating server virtualization in the context of disaster recovery, IT departments should look for capabilities that make a regular application server more flexible and more able to deal with an impending im·pend intr.v. im·pend·ed, im·pend·ing, im·pends 1. To be about to occur: Her retirement is impending. 2. outage out·age n. 1. A quantity or portion of something lacking after delivery or storage. 2. A temporary suspension of operation, especially of electric power. or disaster. An example of this is zero-downtime migration. Virtualization technologies remove the server and application from the complexities of the underlying hardware. Advanced solutions enable servers to be moved from any two networked servers (no SAN required) with zero-downtime migration. It is important that any server, any application can be moved without interruption to users or service. The same goes for flexible resource management. Virtualized applications should not be hindered in any way in this area. Resources must be allowed to be added or reduced in real-time without service interruption, and overloaded o·ver·load tr.v. o·ver·load·ed, o·ver·load·ing, o·ver·loads To load too heavily. n. An excessive load. Adj. 1. applications must quickly and easily be provided with more resources. The events of 2005 forced a renewed focus on disaster planning. At the same time, the general mandate continues to be "get more with less" out of overburdened o·ver·bur·den tr.v. o·ver·bur·dened, o·ver·bur·den·ing, o·ver·bur·dens 1. To burden with too much weight; overload. 2. To subject to an excessive burden or strain; overtax. n. 1. IT resources. OS-level server virtualization plays an important, and financially attractive, role in the process of preparing for--and functioning through--disaster scenarios. Carla Safigan, Virtuozzo product manager, SWsoft (Herndon, VA). www.swsoft.com |
|
||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion