SAN Community Ignores Legacy Systems At Its Peril.Bridging the data gap between open systems and mainframe storage It is estimated that over 70% of mission-critical data is stored on mainframes, yet the Storage Area Network (SAN) community has all but ignored legacy systems. Today's SANs equal open systems. No solutions exist that are specifically optimized to address interconnectivity between open systems storage and mainframe storage, despite the fact that mainframes are an essential business resource for the largest revenue producing companies in the world. E-commerce is often associated with Unix and Windows NT (Windows New Technology) A 32-bit operating system from Microsoft for Intel x86 CPUs. NT is the core technology in Windows 2000 and Windows XP (see Windows). Available in separate client and server versions, it includes built-in networking and preemptive multitasking. systems. What is often overlooked is that mainframes are an integral part of the e-commerce supply chain. Telephone companies, shipping companies, credit card companies, banks, etc. all have room after room of mainframes supporting e-commerce. Mainframes are involved in online transactions, from before a customer clicks on the BUY icon up until the product arrives at the customer's door. If "data" is synonymous with synonymous with adjective equivalent to, the same as, identical to, similar to, identified with, equal to, tantamount to, interchangeable with, one and the same as "business," then interconnecting open systems data storage and mainframe data storage is a requisite part of the new economy. The explosive growth of Internet-generated information has created a dramatically higher level of data storage requirements within large organizations. Experts believe that data storage is growing by over 100% annually. A key application driving this growth is data warehousing See data warehouse. data warehousing - data warehouse . According to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. Marc Farley, author of Building Storage Networks, data warehousing has been "the main impetus behind the desire to transfer data between mainframes and open systems servers." Interconnecting open systems and mainframe storage isn't easy. Their data structures, protocols, and fundamental architectures are very different. In the case of interoperability, both environments are double-edged swords: their strength can be their weakness. The nature of open systems, well, is that it's open. The premise of the open systems environment is that multiple vendors are building compatible products that interoperate, allowing customers to have a common set of applications and devices from competitive sources. The reality is that vendors develop products that may or may not have some degree of compatibility, depending on how much the market drives them to do so. It is staggering when you think about the scope of an open systems environment that includes chips, processors, boards, interface cards, communication protocols, cabling, operating systems Operating systems can be categorized by technology, ownership, licensing, working state, usage, and by many other characteristics. In practice, many of these groupings may overlap. , infrastructure devices, and applications with thousands of vendors developing, innovating, and delivering products to the market. It's no wonder that open systems can be unreliable and unstable. Mainframes, on the other hand, are closed and intentionally proprietary. As a result, the mainframe environment is extremely stable and reliable, as very few vendors control how everything works end-to-end. The mainframe world is able to achieve superior reliability and performance because operating systems, communication protocols, hardware, and system components are all tightly integrated. What is being done today to bridge the data gap between open systems and mainframe storage? In order to connect these two disparate worlds, companies have been forced to engineer around inefficiencies. There are technologies that allow data to be moved between open systems and mainframe storage, but they do not provide advanced functionality, intelligence, or management features and they are prohibitively slow. There are various ways that mainframe and open systems data can be accessed today. Over the standard IP network, data may be transferred by all standard means, including FTP FTP in full file transfer protocol Internet protocol that allows a computer to send files to or receive files from another computer. Like many Internet resources, FTP works by means of a client-server architecture; the user runs client software to connect to and NFS (Network File System) The file sharing protocol in a Unix network. This de facto Unix standard, which is widely known as a "distributed file system," was developed by Sun. See file sharing protocol and WebNFS. NFS - Network File System . Additionally, there are several vendors that provide middleware Software that functions as a conversion or translation layer. It is also a consolidator and integrator. Custom-programmed middleware solutions have been developed for decades to enable one application to communicate with another that either runs on a different platform or comes from a to move and transform data between these two environments. There are also devices that support SCSI SCSI in full Small Computer System Interface Once common standard for connecting peripheral devices (disks, modems, printers, etc.) to small and medium-sized computers. SCSI has given way to faster standards, such as Firewire and USB. , Fibre Channel (FC), and ESCON (Enterprise Systems CONnection) An IBM S/390 fiber-optic channel that transfers 17 Mbytes/sec over distances up to 60 km depending on connection type. ESCON allows peripheral devices to be located across large campuses and metropolitan areas. interfaces, but these products have no intelligence and provide only physical connectivity. Another solution is to utilize storage subsystems The part of a computer system that provides the storage. It includes the controller and disk drives. See storage system. that support SCSI, FC, and ESCON interfaces. Although the storage subsystem has software that performs the data transformation, the system running the software is not integrated into the storage subsystem and is actually a separate process. This causes latency and limits functionality. Additionally, storage subsystems are vendor dependent, which restricts customers' options. There are major problems with all of these solutions. Each one requires host-based software to perform its tasks. These host-based software solutions have proven to be slow and cumbersome, severely impacting overall business productivity. The hosts are production systems and any additional processing impacts their performance. Moving large volumes of data goes from being impractical to impossible. Additionally, most of these solutions provide no online presence on the data path. In order to share data, the mainframe needs to actively open a new data path. This is done on a point in time basis and not continuously. This means that the data is old as soon as it is stored. None of these approaches optimizes or enhances performance. Today, companies have moved from gigabytes of stored data to terabytes of stored data. Predications are that stored data will soon be entering the petabyte One quadrillion bytes (one trillion kilobytes). Also PB, Pbyte and P-byte. See peta, binary values and space/time. (unit) petabyte - 2^50 = 1,125,899,906,842,624 bytes = 1024 terabytes or roughly 10^15 bytes. 1024 petabytes is one exabyte. range. The current solutions do not have the ability to move and manage this amount of data. The result is a direct negative impact on an organization's revenue. The consequences could be severe. Finally, these solutions offer very little intelligence, scalability, or robustness. They may be adequate for raw file transfer, but the ability to move volumes of data, multiple simultaneous replications, online backups Using the Web to store copies of data for backup. There are numerous providers on the Internet that charge for storage, and fees are typically based on capacity. Online backup services provide offsite backup, which is essential for disaster recovery. See backup types. , etc. between platforms does not exist. Intelligent SAN Gateways Since data is the essential element for businesses, controlling the data is paramount. Open systems data and mainframe data have core differences and the only way to effectively connect these two worlds is to intelligently manipulate the data through a gateway. At the same time, a high performance solution is necessary since the amount of data is growing exponentially. A balance is needed since intelligence and performance are requisite. The problem is, whenever intelligence is injected into a process, latency is introduced. Performance without intelligence offers a limited solution. What is required is an intelligent SAN gateway that sits in the middle of the data path that not only interconnects mainframes and open systems storage on an interface level, but also on a data level. The intelligent SAN gateway should be vendor independent and support multiple platforms Refers to two or more operating environments, which typically include the CPU family and operating system. For example, if versions of a program run on Windows and the Macintosh, the software is said to support multiple platforms. . Let's first examine the bandwidth issues. There is a paradigm shift A dramatic change in methodology or practice. It often refers to a major change in thinking and planning, which ultimately changes the way projects are implemented. For example, accessing applications and data from the Web instead of from local servers is a paradigm shift. See paradigm. between open systems and mainframe architecture. Open systems uses a few fat pipes and mainframes use many thin pipes. On the open systems side, FC can move data at 1Gbit. ESCON only supports approximately 17MB of bandwidth, creating the bottleneck A lessening of throughput. It often refers to networks that are overloaded, which is caused by the inability of the hardware and transmission lines to support the traffic. It can also refer to a mismatch inside the computer where slower-speed peripheral buses and devices prevent the CPU . The intelligent SAN gateway would address this issue by aggregating ESCON channels to create fatter pipes. This solution would effectively take multiple ESCON channels and create a virtual channel that provides much higher bandwidth. The intelligent SAN gateway needs to be scalable with the ability to add additional ports and, therefore, provide as much bandwidth as required (see Fig). A new interconnect technology, FICON (FIber CONnector) An IBM mainframe channel introduced with its G5 servers in 1998. Based on the Fibre Channel standard, it boosts the transfer rate of ESCON's half-duplex 17MB/sec to a full-duplex 100MB/sec. (Fibre Connection) is becoming available for mainframe storage. While FICON provides key benefits, including increased bandwidth, major issues still exist that need to be resolved before it can be fully deployed. Existing mainframe systems cannot upgrade to FICON, so companies will need to put in bridges in order to connect ESCON and FICON. While new mainframes have FICON channels installed, at this time there are no control units that support this new interconnect technology. The intelligent SAN gateway needs to support the existing infrastructure and adapt to the new technology as it becomes available. Processing is another issue that needs to be addressed. The responsibility needs to be taken away from the host and storage subsystems and put on an outboard Not built in. Outboard devices are external to the main unit. Contrast with inboard. See offboard. system. The proposed intelligent SAN gateway would be capable of moving a great deal of data and act on it in real time. To achieve this without impacting performance, the intelligent SAN gateway will need to be equipped with a great deal of horsepower horsepower, unit of power in the English system of units. It is equal to 33,000 foot-pounds per minute or 550 foot-pounds per second or approximately 746 watts. . This could be accomplished with multiple RISC processors RISC processor [Reduced Instruction Set Computer], computer arithmetic-logic unit that uses a minimal instruction set, emphasizing the instructions used most often and optimizing them for the fastest possible execution. on a per port basis. The placement of the intelligent SAN gateway is also a crucial issue. The gateway needs to sit in the middle of the data path in order to intercept and interpret data as it transverses open systems and mainframe environments. Placing the gateway anywhere else would limit its ability to have the access and control of the data required to be effective. Intelligence needs to be provided so that the data can be transformed on the fly between mainframe and open systems storage. The platform itself would have embedded Inserted into. See embedded system. intelligence working with lightweight agents residing on various hosts and storage systems. Such a device should extend beyond basic file transfers and raw data conversion. The intelligent SAN gateway should perform advanced functions and provide volume transfers, volume remapping, online backups, point-in-time copy, snapshot copy, asynchronous Refers to events that are not synchronized, or coordinated, in time. The following are considered asynchronous operations. The interval between transmitting A and B is not the same as between B and C. The ability to initiate a transmission at either end. and synchronous mirroring, and other applications. There is an elephant in the room Not to be confused with White elephant. The elephant in the room (also elephant in the living room, elephant in the corner, elephant on the dinner table, elephant in the kitchen, horse in the corner, 400lb gorilla in the room, etc. and everyone is ignoring it. True heterogeneous enterprise SANs will never exist unless the issue of interconnecting mainframe storage is addressed. Basic connectivity and applications are grossly insufficient for today's requirements, let alone the future. The time has come for intelligent, robust, and powerful solutions that will solve major interoperability, performance, and scalability issues found in today's storage area networks. Tony Asaro is the director of business development at SANgate Systems, Inc. |
|
||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion