Preparing for worst-case scenarios in the data center: lessons learned.The recent Eastern Seaboard blackout, which cost New York City New York City: see New York, city. New York City City (pop., 2000: 8,008,278), southeastern New York, at the mouth of the Hudson River. The largest city in the U.S. businesses more than $1 billion (or $36 million per hour) was yet another wakeup for a nation that has lived through a series of catastrophic events in its recent history. Operational downtime The time during which a computer is not functioning due to hardware, operating system or application program failure. is costly to businesses and unacceptable to customers. Current heightened awareness of data center susceptibilities, coupled with today's demanding business environment, has spurred IT managers to begin addressing disaster recovery needs. There are those who were prepared when the power went out, such as the e-commerce subsidiary of a major airline. Shutting down the company's e-commerce Website, which has more than 30 million unique visitors A count of how many different people access a Web site. For example, if a user leaves and comes back to the site five times during the measurement period, that person is counted as one unique visitor, but would count as five "user sessions. a day, for even a few minutes could deprive the airline of millions of dollars in revenue. To ensure continuity of business operations Business operations are those activities involved in the running of a business for the purpose of producing value for the stakeholders. Compare business processes. The outcome of business operations is the harvesting of value from assets , the company's director of operations implemented an extensive disaster recovery setup nearly two years ago. Designing the Data Center: Shatterproof shat·ter·proof adj. Resistant to shattering: shatterproof goggles. Adj. 1. shatterproof - resistant to shattering or splintering; "shatterproof automobile windows" Infrastructures The airline's disaster recovery setup consists of two data centers, each running 20Tbytes of total disk space and 200 servers. The primary site uses VERITAS Volume Replicator See port replicator. replicator - Any construct that acts to produce copies of itself; this could be a living organism, an idea (see meme), a program (see quine, worm, wabbit, fork bomb, and virus), a pattern in a cellular automaton (see life), or (speculatively) a robot or host-based software (running on about eight different hosts) to synchronously mirror its data to a secondary site ten miles away. In addition, everything in both data centers is either clustered for failover, or hot clustered, using VERITAS Cluster Manager A Cluster manager is a small service on each node of the cluster and it does active and desactive available services. Cluster Manager should create a specific list of node auth, manage and repliace it to each node. . But what happens if the power goes out? To keep operations up and running, the two sites are located on three different power grids, with the primary site feeding into two separate grids and the secondary site on yet another. As we learned in the recent widespread blackout, separate power grids may not be enough. To provide further insurance, the primary site has two diesel generators and a number of different uninterrnptible power supply (UPS) systems. The secondary site has its own UPS system, as well as a natural gas backup system Noun 1. backup system - a computer system for making backups ADP system, ADPS, automatic data processing system, computer system, computing system - a system of one or more computers and associated software with common storage . Planning for Survival: The Disaster Recovery Plan Beyond the proper system infrastructure, it's also essential to have a plan. In fact, one-fourth of IT managers in a recent independent study of more than 850 IT professionals worldwide say that the very survival of their companies would be at risk if disaster were to strike and they did not have their disaster recovery plans in place. It is little wonder then that the study--which was conducted for VERITAS Software Veritas Software Corp. was an international software company that was founded in 1983 as Tolerant Systems, renamed Veritas Software Corp. in 1989, and merged with Symantec in 2005. It was headquartered in Mountain View, California. by Dynamic Markets, Ltd.--also found that one-third of U.S. respondents have been prompted by terrorism to create a disaster recovery plan. Disaster recovery plans should always be accessible. They should also provide comprehensive coverage of system resources (1) In a computer system, system resources are the components that provide its inherent capabilities and contribute to its overall performance. System memory, cache memory, hard disk space, IRQs and DMA channels are examples. , including the entire data center, all of a company's remote offices, and both desktop and laptop computing environments. Many U.S. organizations today invest heavily in protecting data at the corporate headquarters, while leaving remote offices less protected, or possibly not protected at all. Also, research found that 34% of U.S. companies do not test their plans. It is essential that corporations begin to examine, test and update their disaster recovery plans on a regular basis. Prepared data centers have every site completely backed up and running on a fully automated and redundant system, as well as a disaster recovery plan in place. With today's backup and recovery technologies easing burdens on busy administrators and limited budgets, critical data at all sites can be protected at all times, even the next time the lights go out. www.veritas.com Bob Maness is senior director of product marketing at VERITAS Software (Mountain View, CA) |
|
||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion