Designing SANs With Windows 2000.Alternate path software enhances robustness NT was designed for workgroup and departmental servers that typically host a single application and connect eight to ten disk drives. The I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output. I/O - Input/Output architecture was based on the assumptions that servers would implement SCSI SCSI in full Small Computer System Interface Once common standard for connecting peripheral devices (disks, modems, printers, etc.) to small and medium-sized computers. SCSI has given way to faster standards, such as Firewire and USB. buses to connect disk storage and SCSI disks would remain single ported devices that connect to a single bus. As Windows 2000 moves into the enterprise, the assumptions change. Fibre Channel has become the industry standard interconnect with addresses for 127 devices on a single FC-AL (Fibre Channel-Arbitrated Loop) See Fibre Channel. FC-AL - Fibre Channel-Arbitrated Loop. loop and sixteen million on a fabric with cascaded switches. Unlike SCSI disks, Fibre Channel disks are dual-ported to enhance performance and fault tolerance See fault tolerant. (architecture) fault tolerance - 1. The ability of a system or component to continue normal operation despite the presence of hardware or software faults. This often involves some degree of redundancy. 2. , and can be accessed over a pair of redundant fibre links. Eventually, Windows 2000 will be enhanced to fully satisfy the requirements of enterprise computing Refers to information technology in the larger company. See enterprise data and enterprise networking. , but until it does software can be added to the I/O stack to better meet the continuous availability requirement of enterprise applications such as e-commerce. Windows 2000 uses the SCSI FCP (Fibre Channel Protocol) See Fibre Channel. FCP - Flat Concurrent Prolog. ["Design and Implementation of Flat Concurrent Prolog", C. Mierowsky, TR CS84-21 Weizmann Inst, Dec 1984]. (SCSI over Fibre Channel) to access disks across fibre topologies. Since Windows 2000 supports 128 SCSI targets, the channel can be mapped to emulate multiple SCSI buses. Each SCSI target is mapped to LUNs (Logical Unit Numbers) and LUNs are mapped to physical disk drives. LUNs are the logical units addressed by the OS, which identifies them by their Target ID and LUN identifier. When Windows 2000 boots, it scans its I/O channels looking for Looking for In the context of general equities, this describing a buy interest in which a dealer is asked to offer stock, often involving a capital commitment. Antithesis of in touch with. LUNs and assumes that each discovered LUN with a unique Target ID/LUN identifier is a unique logical disk. A disk object representing each logical disk is constructed and made available to the file system. At the enterprise level RAID array controllers generally are configured in fault tolerant The ability to continue non-stop when a hardware failure occurs. A fault-tolerant system is designed from the ground up for reliability by building multiples of all critical components, such as CPUs, memories, disks and power supplies into the same computer. dual active pairs to increase system reliability. Each controller has one or more fibre host ports for connecting to a loop or SAN fabric and one or more pairs of backend fibre storage ports for connecting a set of dual-ported fibre disks over redundant fibre loops. Each physical disk is mapped to a LUN and each LUN can be configured with an affinity to each controller host port. In this way, all of the LUNs can be accessed across all of the controller's host ports to enhance data accessibility. However, this creates problems for Windows 2000 since each LUN will have multiple Target ID (TID tid 3 times a day )/LUN identifier (LUN ID) addresses as illustrated in Figure 1. In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke" put differently , since Windows 2000 assumes that I/O channels are independent and connect different disks, it will see the same LUN with multiple addresses and assume that it has discovered multiple unique LUNs. It will make these duplicated LUNs accessible by the file sys tem. Since the OS presumes it is dealing with multiple unique LUNs, it will not implement access controls to prevent one application from stepping on another application's data, and data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors. will result. Alternate Path Software A simple solution to the problem is a class of product called Alternate Path Software (APS) which filters duplicate LUNs and provides a level of indirection Not direct. Indirection provides a way of accessing instructions, routines and objects when their physical location is constantly changing. The initial routine points to some place, and, using hardware and/or software, that place points to some other place. between the OS and the data path used to access LUNs. APS provides multiple path management by examining LUN Serial Numbers (LSN LSN Learning and Skills Network (UK) LSN Log Sequence Number LSN Large Scale Networking LSN Legal Services Network (American Association of Retired Persons) LSN Logical Sector Numbers LSN Leukosialin ) to detect duplicate disk objects representing a single LUN. APS constructs a Disk Object Group for LUNs with the same serial numbers. One replicate is designated the Primary Disk Object and the other the Secondary Disk Object. (Fig 2). APS then filters file system access to the Disk Object Group by activating the Primary Disk Object and making it accessible to the file system (Fig 3). This provides a data path to the LUN using HBA (Host Bus Adapter) See host adapter. 0 and Controller 0 as well as any SAN networking elements, such as switches and hubs that implement the data path. If an active disk object becomes unavailable due to a path failure, APS automatically switches to the Secondary Disk Object that represents an alternate data path through the SAN to the LUN in the storage array (Fig 4). APS uses the SCSI Ping command (Test Unit Ready) to determine path availability. After a path failover, APS will ping the primary path at periodic intervals to determine if the path has been repaired. When the failed path has been restored to operational status, APS will automatically fail back to the original path. APS maintains the context with the file system during failover and fail back operations so path transitions are transparent and non-disruptive to the OS. Besides multiple path management to improve system availability, APS enables I/O load balancing to more efficiently utilize I/O resources and enhance system performance. Most APS products provide a load leveling mechanism so that each available data path is simultaneously active processing I/O operations. LUN count leveling is an economical form of load balancing appropriate for x86-based servers. With LUN count leveling, APS automatically distributes the LUNs evenly across the available data paths. Automatic load balancing reduces system management complexity and lowers costs. APS products are generally integrated with the OS reporting errors and events, such as path transitions to the OS error logs. System managers can use the log information to monitor the status of I/O paths and take remedial actions before SAN components fail. When paths fail, APS automatically alerts system administrators that the system is in a critical condition and vulnerable to losing access to the storage pool. Kevin Smith is the senior marketing director of external RAID controllers at Mylex Corp (Boulder, CO). |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion