Where Will The Intelligence Reside In Storage Area Networks?System architects spend long hours pondering where functions belong in the big picture. After modeling designs to optimize cost, performance, and reliability metrics, recommendations are rendered and decisions are made. Yet, as the industry delivers the next inflection point Inflection Point An event that changes the way we think and act. -Andy Grove, Founder of Intel. Notes: For example, the fall of the Berlin Wall was an inflection point in global politics and the commercialization of the Internet was an inflection point in technology. in storage systems-the Storage Utility--the market will make its decisions based on what can be reliably deployed, managed, enhanced, and extended across heterogeneous systems heterogeneous system n. A chemical system that contains various distinct and mechanically separable parts or phases, such as a suspension. . If networks do not evolve gracefully, GIOs are given new "career opportunities." It looked like computing environments were on the verge On the Verge (or The Geography of Yearning) is a play written by Eric Overmyer. It makes extensive use of esoteric language and pop culture references from the late nineteenth century to 1955. of becoming homogeneous with a single OS (NT) running on a single hardware architecture (x86), but Solaris is strong in the Internet market; Linux disciples are multiplying and a new Unix system Noun 1. UNIX system - trademark for a powerful operating system UNIX, UNIX operating system operating system, OS - (computer science) software that controls the execution of computer programs and may provide various services , Project Monterey Project Monterey was an attempt to build a single Unix operating system that ran across a variety of 32-bit and 64-bit platforms, as well as supporting multi-processing. The core of the project was an attempt to create an enterprise-class UNIX for the IA-64, which at the time was , will debut this year. The demise of NetWare has long been predicted, but sales keep chugging along. Gigabit Fibre Channel, the SAN interconnect standard, will be replaced by 2Gbps fibre next year and 10Gbps is on the drawing boards, but 10 gigabit fibre may never see the light of day as a SAN Interconnect, since the industry heavyweights are backing InfiniBand with bandwidth up to 30Gbps in 2002. Platform heterogeneity het·er·o·ge·ne·i·ty n. The quality or state of being heterogeneous. heterogeneity the state of being heterogeneous. and looming infrastructure changes are major impediments IMPEDIMENTS, contracts. Legal objections to the making of a contract. Impediments which relate to the person are those of minority, want of reason, coverture, and the like; they are sometimes called disabilities. Vide Incapacity. 2. to the promised Storage Utility and primary determinants of where storage "intelligence" will reside in SANs. Traditionally, storage has been server-dependent with data management implemented at system level via file systems, device drivers, tools, and utilities. Distributed, server-dependent storage limits flexibility, resource sharing, and is costly to manage. Network Attached Storage has taken the market by storm with low cost and plug-and-play simplicity. If you need more storage, just plug-in a NAS (1) See network access server. (2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular appliance. If you need shared storage in a remote office where you can't afford a system manager, plug-in NAS. High-end NAS systems have an equally appealing feature-- self-contained data management. With RAID, tape, and software utilities, a NAS system will serve protected files all day long. NAS is effective if the network can handle the bandwidth demands and serving files satisfies application needs. The Storage Utility envisions the same plug-and-play simplicity and self-contained data management, but it offers more flexibility, data protection, and security without chewing-up client network bandwidth. A SAN is a specialized back-end network, enabling local or geographically dispersed servers to access shared storage resources and a prerequisite to the promised Storage Utility. Yet, to fully realize the Storage Utility vision, storage management functions need to migrate from heterogeneous servers to intelligent devices, providing consistent storage services to all SAN platforms. External RAID arrays are inherently OS independent. They map physical disks to logical volumes protected against disk failures and export their volumes to servers across shared interconnects. Today's RAID controllers have the processing power and memory to do a lot more, will evolve into SAN storage managers in the near-term, and potentially SAN data managers in the longterm. A storage manager allows access from a single system to a set of addressable Reachable. When something is addressable, it can be identified and manipulated independently of its surroundings. For example, screen pixels and RAM memory are addressable. Each of the screen's picture elements can be individually turned on and off, and each of the memory's bytes can be storage locations, whereas a data manager enables concurrent access The ability to gain admittance to a system or component by more than one user or process. For example, concurrent access to a computer means multiple users are interacting with the system simultaneously. from multiple systems to the same set of addressable storage locations. The former deals with block translation and the latter shares data through distributed file systems Software that keeps track of files stored across multiple networks. When the data are requested, it converts the file names into the physical location of the file so it can be found. . Storage management is migrating to controllers; four functions vital to the notion of a Storage Utility have already been delivered in RAID arrays: Storage Virtualization Treating storage as a single logical entity without regard to the hierarchy of physical media that may be involved or that may change. It enables the applications to read from and write to a single pool of storage rather then individual disks, tapes and optical devices. , Volume Mapping, Remote Mirroring, and Virtual Volumes. Longer term, back-end device sharing may evolve to back-end data sharing The ability to share the same data resource with multiple applications or users. It implies that the data are stored in one or more servers in the network and that there is some software locking mechanism that prevents the same set of data from being changed by two people at the same time. . NAS technology at a finer granularity of access. SAN and NAS technologies will coexist and may eventually merge-the only differences being the protocols they speak and whether they plug into front-end client networks or backend storage networks. Storage Virtualization Decentralized de·cen·tral·ize v. de·cen·tral·ized, de·cen·tral·iz·ing, de·cen·tral·iz·es v.tr. 1. To distribute the administrative functions or powers of (a central authority) among several local authorities. , server-dependent storage is difficult to protect and expensive to manage. Hence, the trends toward re-centralizing storage that enables a pool of storage to be shared by a group of servers. Storage virtualization allows disks to be aggregated, subdivided, and parceled Out to SAN nodes. As disks are added, the RAID controller transparently increments the usable capacity of the pool. Disks with varying capacities can be mixed and matched without wasting disk space. Drives are logically combined into RAID groups and assigned a RAID level such as striping Interleaving or multiplexing data to increase speed. See disk striping. striping - data striping or mirroring that provide the best optimization of cost, performance, and reliability for a specific application. The capacity of the RAID group can be allocated to a single large logical volume or subdivided into separately addressable volumes. Volumes can be expanded on the fly to meet changing application storage requirements or deleted and returned to the pool when they are no longer needed. Storage virtualization was once the domain of host-based software, but has migrated to array controllers where the disk pool can be effectively managed as a single entity. Volume Mapping Since RAID controllers present logical volumes to all attached nodes, SANs have to deal with disk ownership issues. Unix systems are generally well behaved Adj. 1. well behaved - (usually of children) someone who behaves in a manner that the speaker believes is correct; "a well-behaved child" well-behaved and only access volumes that have been explicitly mounted. Other OSs claim ownership of every volume they can see on their I/O channels by writing an identifying signature. If multiple servers claim ownership of the same disk, the second server to claim ownership can corrupt data written by the first. Clusters solve this problem with software that enforces storage access policies. In storage networks, an access control or mapping See O-R mapping. mechanism creates virtual private networks between nodes and volumes to prevent one SAN node from stepping on another node's data. Mapping volumes to nodes also creates a security barrier by restricting access to sensitive data. Three mapping strategies have been implemented. Filter drivers are used to restrict system access to specified volumes. The drivers use proprietary protocols, are OS-specific, and may need to be upgraded with new OS releases. Management complexity increases unless the filter drivers are integrated into a common management framework. The second approach uses "zoned" switches that map server ports to array ports. Zone or "hard mapping" is less flexible, requiring physical configuration changes to alter mapping schemes. Additionally, the volumes behind an array port cannot be assigned to servers in different zones. The third approach uses array controllers. Mapping physical disks to logical volumes is intrinsic to RAID controllers. Dynamically mapping volumes to SAN nodes is a natural extension of this technology. No overhead is imposed on the servers and I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output. I/O - Input/Output latency remains unchanged. Mapping tables can be replicated across dual active controllers for high availability Also called "RAS" (reliability, availability, serviceability) or "fault resilient," it refers to a multiprocessing system that can quickly recover from a failure. There may be a minute or two of downtime while one system switches over to another, but processing will continue. and integrated into the array's management utility. Mylex delivers a patented controller-based mapping feature called SANmapping. Mapping tables uniquely identify SAN nodes by their worldwide names and specify their access privileges to logical volumes. Remote Mirroring Studies show the hourly cost of downtime ranges from $100,000 in airline reservation systems The Airline Reservations System (ARS) was one of the earliest changes to improve efficincy. ARS eventually evolved into the Computer Reservations System (CRS), and then into Global Distribution System (GDS). to over $2 million for credit card sales. With electronic commerce gaining ground in every market segment, companies need to develop a continuous customer service strategy. Remote mirroring replicates data from online production servers to back-up servers at remote sites that can take over if a disaster strikes the primary site. Remote mirroring was initially implemented in host-based software systems. For each application write, the software issues a write I/O operation to the local disk pool and a second write I/O across a communication link to the remote site. The overhead on the production server is a function of the application's read/write mix and the efficiency of the software. Up to 5% of a server's CPU CPU in full central processing unit Principal component of a digital computer, composed of a control unit, an instruction-decoding unit, and an arithmetic-logic unit. cycles are burned propagating writes to the back-up site. IBM (International Business Machines Corporation, Armonk, NY, www.ibm.com) The world's largest computer company. IBM's product lines include the S/390 mainframes (zSeries), AS/400 midrange business systems (iSeries), RS/6000 workstations and servers (pSeries), Intel-based servers (xSeries) and EMC (1) (EMC Corporation, Hopkinton, MA, www.emc.com) The leading supplier of storage products for midrange computers and mainframes. Founded in 1979 by Richard J. Egan and Roger Marino, EMC has developed advanced storage and retrieval technologies for the world's largest companies. were among the leaders in migrating remote mirroring from system software to RAID arrays and Compaq has delivered remote mirroring on arrays for x86 servers. This strategy has two obvious benefits: server overhead is eliminated and remote mirroring is host independent. An array controller can mirror writes issued from any Os, an important consideration as heterogeneous SANs are deployed (See Fig). Virtual Volumes Another feature driven by e-commerce and the exponential growth Extremely fast growth. On a chart, the line curves up rather than being straight. Contrast with linear. in databases is creating virtual volumes for read-only applications. Global electronic commerce requires around-the-clock service access. Continuous customer access conflicts with data back-up procedures that shut down a database before copying it to tape. Decision support applications also require a point-in-time copy. An early solution was N-way mirrors. RAID software was used to create a three-way mirror set and one of the mirrors was taken offline. A fault-tolerant two-way mirror two-way mirror n. See one-way mirror. remained on-line for application access and a copy of the data was available for read-only applications. Creating a three-way mirror with system software consumes server and channel resources and requires 50% more disk space. IBM, StorageTek, and others have designed innovative array controllers that create point-in-time virtual copies of on-line data nearly instantaneously. One approach uses a temporary storage area to save the "before state" of on-disk data. Prior to servicing write I/Os, the controller copies the "before-state" of disk blocks to a temporary storage area. A data structure directs read-only applications such as backup to the online volume for blocks that have not been updated since the virtual copy was created and to the temporary storage area for blocks that have been updated. Controller-based virtual volume technology eliminates server overhead, dramatically reduces channel and disk utilization, and is server independent. Shared Access And Shared Data Most SANs deployed this year will use the shared access model -- shared interconnect and controller resources with disks allocated to a single SAN node. The shared access model enables storage pooling to increase flexibility and lower costs. However, in some environments, applications on different SAN nodes require simultaneous access to the same stored files. Shared data clusters use a lock manager to synchronize See synchronization. file access. Nodes acquire locks on disk blocks before accessing them. As the number of cluster nodes increase, so do the lock-related messages and the node hosting the lock manager, limiting scalability. In the future, a distributed SAN file system may be implemented in an array controller to address this. A disk I/O may include an implied lock on accessed blocks . Since each array controller only needs to manage locks pertinent to the volumes it controls, the lock manager could be distributed across array controllers in the SAN, eliminating the hot node problem. Smarter Arrays A case can be made for host-based storage management in homogeneous environments where a single set of software products provide consistent function and behavior across SAN nodes and easily integrate into a common management framework, but most IS shops are heterogeneous today and for the foreseeable future. In this environment, self-managing arrays will reduce complexity, lower costs, and increase purchasing flexibility, eventually becoming the enabling technology behind the Storage Utility. Kevin Smitb is the senior marketing director RAID controllers at Mylex Corp. |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion