Impact of new regulations on the data protection requirements for the financial industry.When U.S. regulatory agencies regulatory agency Independent government commission charged by the legislature with setting and enforcing standards for specific industries in the private sector. The concept was invented by the U.S. proposed a mandate that critical financial institutions must have fully operational recovery sites 200-300 miles away from their primary data center as an important component toward strengthening the resilience resilience (r n of the U.S. financial markets, industry leaders voiced strong opposition. Their key concerns were the limitations of existing technology and the perceived excessively high costs for implementing effective long-distance replication. Although in the final regulations the regulatory agencies did not specify absolute mileage MILEAGE. A compensation allowed by law to officers, for their trouble and expenses in travelling on public business. 2. The mileage allowed to members of congress, is eight dollars for every twenty miles of estimated distance, by the most usual roads, from his requirements, they strongly recommended a goal of "sufficient geographically dispersed dis·perse v. dis·persed, dis·pers·ing, dis·pers·es v.tr. 1. a. To drive off or scatter in different directions: The police dispersed the crowd. b. " sites, and urged each organization to take immediate action toward developing and implementing a plan to reach that goal. The Landscape Data protection and disaster recovery have been important issues for financial institutions for many years. Recent events, however, have catapulted them into the forefront, causing financial and government organizations to re-evaluate business continuity policies, plans and regulations, with a focus on strengthening "the resilience of critical U.S. financial markets." In September 2002, government agencies--the Federal Reserve, the Securities and Exchange Commission (SEC), and the Office of the Comptroller of the Currency The Office of the Comptroller of the Currency (or OCC) was established by the National Currency Act of 1863 and serves to charter, regulate, and supervise all national banks and the federal branches and agencies of foreign banks in the United States. (OCC OCC See: Options Clearing Corporation OCC See Options Clearing Corporation (OCC). )--published jointly the "Draft Interagency in·ter·a·gen·cy adj. Involving or representing two or more agencies, especially government agencies. White Paper on Sound Practices to Strengthen the Resilience of the U.S. Financial System" in direct response to the terrorist attacks of September 11, 2001 (www.sec.gov/rules/concept/34-46432.htm). The white paper outlined "preliminary conclusions with respect to the factors affecting the resilience of critical markets and activities in the U.S. financial system; sound practices to strengthen financial system resilience; and an appropriate timetable for implementing these sound practices." The agencies solicited comments on the draft white paper, and received about ninety letters from leaders of financial firms, industry associations, technology companies, etc. (www.sec.gov/rules/concept/s73202.shtml). [FIGURE 1 OMITTED] Probably the most controversial aspect of this interagency white paper was the suggestion that those financial institutions that "play significant roles in critical financial markets" must have fully operational recovery sites located at least 200-300 miles away from the primary data center site. Many organizations complained that, at best, this demand would be cost-prohibitive due to the very high cost of bandwidth, of intermediary Intermediary See: Financial intermediary intermediary See financial intermediary. storage, and of related infrastructure required by traditional long distance replication solutions (i.e., which require multi-hop configurations). At worst, they contended, it would be technologically impossible, due to the limitations of conventional technologies. "The regulators are trying to do the right thing, but let's do it in a way where we don't impose excessively high costs or implement something that is technologically not possible to do," said John Carlson John Carlson (born June 3, 1959) is a popular American conservative talk radio host on KVI, a Seattle talk radio station owned by Fisher Communications. His show formerly aired during the afternoon drive time. , senior director of the Banking Information Secretariat Secretariat, 1970–89, thoroughbred race horse. Trained by Lucien Laurin and ridden by Ron Turcotte, Secretariat won the Kentucky Derby, Preakness, and Belmont Stakes to capture the Triple Crown in 1973. Secretariat (foaled 1970) U.S. (BITS), a Washington-based organization made up of the hundred largest financial services The examples and perspective in this article or section may not represent a worldwide view of the subject. Please [ improve this article] or discuss the issue on the talk page. firms in the U.S. The federal agencies published their final regulations in April 2003 (www.sec.gov/news/studies/34-47638.htm). In an apparent response to the feedback to the paper, they do not immediately require a specific distance between primary and recovery data center sites; however, they still strongly recommend enough distance between these sites to ensure that they do not share the same infrastructure components (e.g., water, electricity, telecommunications). This revised approach was intended to give financial institutions the ability to focus on recovery objectives, and the flexibility to "manage costs effectively and allow for technological improvements." The Challenge In order to address the critical issues highlighted by federal regulators, it is important to understand the challenges faced by financial institutions as they consider data protection strategies and solutions. The data that drives today's globally oriented o·ri·ent n. 1. Orient The countries of Asia, especially of eastern Asia. 2. a. The luster characteristic of a pearl of high quality. b. A pearl having exceptional luster. 3. financial businesses is stored on networks of interconnected computers and data storage devices. This data must be 100% available and always accessible and up-to-date, even in the face of local or regional disasters. These conditions must be met at a cost that is affordable and without hampering normal company operations in any way. Distance is the key to reducing the risk. Power outages This is a list of famous wide-scale power outages. 1965
To reduce the business risk of an unplanned event of this type, an enterprise must ensure that an up-to-date replica Earlier document exchange software from Farallon Communications, Inc. that converted a Windows or Mac document into a proprietary viewing format. The viewer could be distributed separately or embedded within the document itself, turning it into a single-document viewer. of its business-critical data is stored at a secondary location. In addition, any local condition that damages data at the primary site must not be able to harm the replica of that data. Synchronous Refers to events that are synchronized, or coordinated, in time. For example, the interval between transmitting A and B is the same as between B and C, and completing the current operation before the next one is started are considered synchronous operations. Contrast with asynchronous. replication, used so effectively to create perfect copies in local networks, performs poorly over longer distances, requires high bandwidth and can easily replicate rep·li·cate v. 1. To duplicate, copy, reproduce, or repeat. 2. To reproduce or make an exact copy or copies of genetic material, a cell, or an organism. n. A repetition of an experiment or a procedure. corrupted data (undetected) to the secondary site. A copy of critical company data at a secondary site is also a prerequisite pre·req·ui·site adj. Required or necessary as a prior condition: Competence is prerequisite to promotion. n. for continuity in business operations. For an organization that already has multiple geographically dispersed installations, one of the existing locations is a logical first choice for the secondary site. A disaster recovery solution must be able to "bridge" the distance. The system must support rapid failover Invoking a secondary system to take over when the primary system fails. Up-to-date copies of all required data and applications are maintained on the secondary system in order to respond immediately if the primary system becomes unusable. Also called "fallover." See replication. to the secondary site, and it must support transparent failback following correction of the problem at the primary site. Current replication solutions have been unable to adequately satisfy all of these requirements (some of which are seemingly contradictory) especially over longer distances. Those that provide true up-to-date replication (through synchronous replication) cannot support the longer distances without severely degrading TO DEGRADE, DEGRADING. To, sink or lower a person in the estimation of the public. 2. As a man's character is of great importance to him, and it is his interest to retain the good opinion of all mankind, when he is a witness, he cannot be compelled to disclose the performance of the host applications. Those that most effectively conserve bandwidth are not absolutely up-to-date. None offers rapid failover and failback to any point in time when corrupted data has damaged its replica, rendering it unusable for recovery. Financial organizations need a cost-effective solution that provides synchronous levels of protection with no distance limitations and with no application degradation. These organizations also need a solution that is flexible, providing the ability to adapt to changing business requirements. Furthermore, they require a solution that will integrate with their current infrastructure, so as to minimize disruption, leverage existing investments, and minimize costs. In summary, financial organizations need a solution that breaks the limitations of current technologies, enabling enterprise data protection--with no limits. A Solution One viable solution is an advanced enterprise-class disaster recovery subsystem A unit or device that is part of a larger system. For example, a disk subsystem is a part of a computer system. A bus is a part of the computer. A subsystem usually refers to hardware, but it may be used to describe software. . Developed from the ground up with a completely new approach, such a technology overcomes many of the technical limitations of existing data protection products. The architecture is based on an appliance that connects to the SAN and IP infrastructure and provides bi-directional data replication across any distance for heterogeneous storage and server environments. At its core are a number of advanced patent-pending technologies that unleash powerful data protection solutions at a fraction of the cost of existing products. The advanced architecture includes: * Cluster of independent, intelligent appliances in the primary and secondary sites, with excellent data handling capability, enabling the solutions to: * Guarantee a consistent replica of business-critical data in the event of any failure or disaster * Deploy enterprise-class out-of-band (i.e., outside the data path) replication, non-disruptively and non-invasively. Positioning at the junction between the SAN and the IP infrastructure enables such solutions to: * Deliver synchronous levels of data protection at any distance and without intermediate storage, satisfying the SEC recommendations at a very reasonable cost * Support heterogeneous server, SAN, and storage platforms * Eliminate the need for protocol converters (networking) protocol converter - A device or program to translate between different protocols which serve similar functions (e.g. TCP and TP4). Some call this a "gateway", though others use that term for other kinds of internetworking device. or special networking equipment and edge connect devices. Advanced algorithms, that: * Dramatically reduce the WAN bandwidth requirements Bandwidth requirements (communications) The channel bandwidths needed to transmit various types of signals, using various processing schemes. Every signal observed in practice can be expressed as a sum (discrete or over a frequency continuum) of sinusoidal , enabling much more up-to-date data protection at any distance--at a fraction of the cost of existing solutions * Automatically manage the replication process, with strict adherence to user-defined policies that are tied to user-specified business objectives * Use real-time information to enable the system to dynamically adapt its replication mode * Maintain a consistent image of data at the remote site at all times, from which core business applications can be brought online in just minutes. Synchronous Levels of Protection Over Long Distances Financial organizations are required to recover very quickly and with no loss of critical transactions following any disruption of operations. When federal regulators recommended a 200-300 mile separation between primary and secondary data centers, many organizations responded that it would be technically impossible (i.e., that synchronous replication over long distances would lead to unacceptable latency (1) The time between initiating a request in the computer and receiving the answer. Data latency may refer to the time between a query and the results arriving at the screen or the time between initiating a transaction that modifies one or more databases and its completion. and that quick recovery would be compromised), and/or that it would be cost-prohibitive. Although true of traditional data protection technologies, some solutions address those limitations. Every time a host application writes a transaction to the local disk storage subsystem The part of a computer system that provides the storage. It includes the controller and disk drives. See storage system. , a data protection appliance writes it in parallel to the local compatible appliance. One can use this synchronous connection together with its buffer to deliver synchronous, up-to-date levels of protection with no application degradation, no distance limitation, and without the need for additional storage. In the event of a primary site failure, the system enables failover by flushing its buffer to the secondary site, with absolutely no loss of data. The buffer is in the appliance, which can be located Fibre-Channel distances away; this copy of the data should not be in close proximity to the local/primary data for better protection against local disasters. This capability shatters today's distance/latency limitations, and enables completely up-to-date protection from regional disasters with no impact on application performance. Furthermore, the appliance enables rapid recovery by ensuring that applications can be brought on line in just minutes with the most up-to-date replica of the data located at the secondary site. The ability to recover quickly is maintained even in the case of data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors. or even a rolling disaster (see "Rapid Recovery From Data Corruption or Rolling Disasters" below). As a result, financial organizations are able to establish secondary sites in geographically dispersed locations (with no distance limitation), with up-to-date replicas of their business critical data, without introducing latency, in a cost-effective manner (without requiring costly intermediary storage). Intelligent Use of Bandwidth Synchronous replication over long distances can require large amounts of expensive bandwidth, and costly networking equipment such as DWDM (Dense WDM) The term given to wavelength division multiplexing (WDM) when significantly more channels were being added. Since WDM is increasingly more "dense" all the time, both terms are used synonymously. See WDM. DWDM - wavelength division multiplexing and protocol converters often making it cost-prohibitive. The appliance, however, uses standard IP interconnect (1) To attach one device to another. (2) A physical port (plug, socket) or wireless port (transmitter, receiver) used to attach one device to another. and intelligent bandwidth reduction technologies that deliver unprecedented reduction in bandwidth requirements. As a result, it delivers the best possible levels of protection from the available bandwidth, while using low-cost connectivity and achieving dramatically reduced WAN costs, particularly over long distances. Data reduction is achieved through a number of application-aware and storage-aware algorithmic techniques. These advanced algorithms, running on an optimized appliance conserve bandwidth to the extent not possible with traditional compression technologies. [FIGURE 2 OMITTED] Universal Data Protection Financial institutions have developed sophisticated, complex IT infrastructures that are deployed to address the full range of requirements of the organization. In the vast majority of cases, these infrastructures include a wide range of server, SAN and storage platforms, from a variety of vendors. As a result, selecting and deploying a data protection strategy is a complex (and costly) endeavor, since current technologies are often tied to the specific platform or vendor. In contrast to many of today's storage-based and server-based data protection technologies, the appliance must be network-based (i.e., located between the SAN and IP infrastructure). As a result, it can offer an end-to-end solution (jargon) end-to-end solution - (E2ES) A term that suggests that the supplier of an application program or system will provide all the hardware and/or software components and resouces to meet the customer's requirement and no other supplier need be involved. Compare: turn-key solution. for data replication across heterogeneous server and storage platforms, enabling a complete data protection solution for the entire enterprise. Accordingly, the storage systems at the primary and secondary sites do not have to be the same, offering the flexibility to deploy lower-cost storage or to leverage existing storage. Business-Driven Data Protection Financial institutions are faced with the challenge of protecting a wide spectrum of data and applications, ranging from business-critical core business systems, to important but less critical applications. It is important to have the flexibility of multiple replication modes to match the desired recovery objective of each application. Unfortunately, however, this often requires multiple replication solutions, as well as the overhead required to manually manage the different options. The appliance offers the full continuum of replication modes, from synchronous, to asynchronous Refers to events that are not synchronized, or coordinated, in time. The following are considered asynchronous operations. The interval between transmitting A and B is not the same as between B and C. The ability to initiate a transmission at either end. , to small aperture-snapshot, to point-in-time. The replication process is managed automatically, with strict adherence to user-defined policies that are tied to business objectives. The system supports multiple policies, and adapts its replication dynamically, based on the available bandwidth and the application workload, to achieve the stated business objectives for each application. This greatly simplifies data and disaster recovery management for complex and heterogeneous environments Using hardware and system software from different vendors. Organizations often use computers, operating systems and databases from a variety of vendors. Contrast with homogeneous environment. . Efficient, Non-Disruptive Testing A critical component of any business continuity plan includes the regular testing of disaster recovery plans. The April 2003 "Interagency Paper on Sound Practices to Strengthen the Resilience of the U.S. Financial System" states that one of three key business continuity objectives of special importance for all financial firms is "a high level of confidence, through ongoing use of robust testing, that critical internal and external continuity arrangements are effective and compatible." A solution should enable direct Read/Write access to data at the secondary site, without the need to first make an additional copy and without disrupting the replication process. As a result, disaster recovery plans can be tested at the secondary site on a regular basis, without requiring additional storage and without impacting ongoing replication. Other workloads that can be deployed in the same non-disruptive manner at the secondary site include development and backups. Rapid Recovery from Data Corruption and Rolling Disasters Many disasters and other failures can cause data corruption in the primary site; unfortunately, failures may go undetected for some time, and the corrupted data is replicated to the secondary site. In these situations, backups or past snapshots have to be restored, which often means a long delay in recovery time. Any worthwhile solution efficiently maintains a snapshot (1) A saved copy of memory including the contents of all memory bytes, hardware registers and status indicators. It is periodically taken in order to restore the system in the event of failure. (2) A saved copy of a file before it is updated. history to allow convenient rollback A DBMS feature that reverses the current transaction out of the database, returning the data to its former state. A rollback is performed when processing a transaction fails at some point, and it is necessary to start over. See two-phase commit. to any point-in-time, enabling quick recovery. It supports multiple transactional-consistent snapshots at the remote site, allowing reliable recovery in database environments. In addition, frequent, small-aperture snapshots (seconds apart) are utilized to minimize the risk of data loss due to data corruption. In the event of a rolling disaster, reassembly reassembly - segmentation of the data from the most-recent uncorrupted snapshot can be completed, dramatically reducing recovery. Summary In response to the events of September 11th, federal organizations have proposed a set of guidelines guidelines, n.pl a set of standards, criteria, or specifications to be used or followed in the performance of certain tasks. and sound practices that are designed to "strengthen the resilience of the U.S. financial system." Again, one of the requirements included the need to establish backup sites A backup site is a location where a business can easily relocate following a disaster, such as fire, flood, or terrorist threat. This is an integral part of the disaster recovery plan of a business. 200-300 miles away from primary data centers. Many responses from industry leaders indicated that this requirement was not realistic--that it was technically impossible to maintain a no-data-loss environment at such distances, without requiring cost-prohibitive investments. Faced with technology and cost limitation, federal regulators dropped the specified mileage requirement, giving financial organizations the flexibility to "manage costs effectively and allow for technological improvements." Mehran Hadipour is vice president, product marketing, at Kashya (San Jose San Jose, city, United States San Jose (sănəzā`, săn hōzā`), city (1990 pop. 782,248), seat of Santa Clara co., W central Calif.; founded 1777, inc. 1850. , CA) www.kashya.com |
|
||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion