Has Ethernet's time come for storage systems?The convergence of all network systems onto a single infrastructure has been the "holy grail" of enterprise SAN managers--long sought after, but never attained. Fibre Channel and InfiniBand are firmly entrenched en·trench also in·trench v. en·trenched, en·trench·ing, en·trench·es v.tr. 1. To provide with a trench, especially for the purpose of fortifying or defending. 2. as storage and compute interconnect technologies, and the common wisdom is that the way forward is to bridge these independent systems into the IP/Ethernet network using expensive and performance-robbing gateways. But recent enhancements to Ethernet may challenge this view. Improvements in transmission latency--a key factor in network and application performance--mean Ethernet is now well suited to become the universal interconnect technology that can deliver the performance, low cost and interoperability that will drive the next-generation data center. This vision--enabled by the 10 Gigabit per second (Gbps) performance of Ethernet and its ubiquity Ubiquity See also Omnipresence. Burma-Shave their signs seen as “verses of the wayside throughout America.” [Am. Commerce and Folklore: Misc. for every other networking purpose in the data center--can deliver big benefits to enterprises. First off, the higher performance of 10-Gigabit Ethernet means support for more users and faster response. Additionally, the storage and compute sub-systems become services within an enterprise-wide Ethernet "cloud," giving network managers the flexibility to apportion ap·por·tion tr.v. ap·por·tioned, ap·por·tion·ing, ap·por·tions To divide and assign according to a plan; allot: "The tendency persists to apportion blame as suits the circumstances" storage to servers throughout the enterprise without regard for physical connectivity, and to give access to any user in the enterprise as long as they are connected to the network. If the reader is skeptical of this grand vision, it's understandable. Ethernet has been knocking on the high-performance interconnect door for a long time, but up to now has not met the needs of the application. High CPU CPU in full central processing unit Principal component of a digital computer, composed of a control unit, an instruction-decoding unit, and an arithmetic-logic unit. processing requirements and a highly "chatty chat·ty adj. chat·ti·er, chat·ti·est 1. Inclined to chat; friendly and talkative. 2. Full of or in the style of light informal talk: a chatty letter. " Internet-strength transmission protocol have introduced a latency that has interfered with real-time performance needed by storage and compute interconnects. Latency's Impact on SAN Performance Latency--the fraction of a second it takes for the first bit of a packet to reach its destination from its source--is often not widely recognized as a key component in SAN performance, taking a backseat in many minds to bandwidth measurements. In fact, both are key figures and the suitability of Fibre Channel for SANs can be tied to its low latency Low latency allows human-unnoticeable delays between an input being processed and the corresponding output providing real time characteristics. This can be especially important for internet connections utilizing services such as online gaming and VOIP - VOIP is not as important as level as well as its bandwidth performance. Network latency See latency. calculations take into consideration the processing time required by network adapters to process and transmit (or receive) the packet, as well as the latency in the interconnect network, which includes cable length, load on the network and the processing latency of the interconnect switch. Latency becomes an issue in storage-dependent applications because the small delays in the network are compounded as data is assembled from various servers and storage devices to answer a query. Distributed databases, for example, give a vivid example of how latency can have a dramatic impact on performance. Distributed databases generate large numbers of short messages between nodes, which are used to communicate pending transactions and other important information to other nodes in the cluster. A database will typically gain exclusive access to a record by locking it before modifying the record. This prevents multiple databases from corrupting data by trying to simultaneously modify the record. To gain exclusive access, a database will send a lock request message to other databases in the cluster. Then, once the request is granted, the database will modify the record, and send another message unlocking the record for others to access it. A typical database query, such as "show me the last 12 months of sales of cars sold in the US by model, ranked by color", could generate hundreds-of-thousands--if not millions--of messages. Taxing each message with tens or hundreds of microseconds of interconnect latency would severely limit the practical ability to distribute and scale the enterprise database. In fact, it is estimated that without a low-latency interconnect, a distributed database application can effectively be limited to only four servers, curtailing the promise of scaling these systems using a large number of low-cost commodity servers. Scaling would then be limited to a small number of expensive servers blocking the paradigm-changing promise of these servers. Attacking Ethernet Latency at the End Points Squeezing the latency from Ethernet has meant efforts by both the standards bodies Following are some of the standards bodies defined in this database. For Windows users of CDE, look up Lessons/Review/Associations. For Web users of CDE's online HTML version, review the Lessons list at the bottom of the definition. Organization Covers ANSI U.S. and by vendors that have attacked both the network end points and the interconnect switch. New technology enhancements resulting from more than nine different technology standards initiatives are serving to comprehensively reshape Ethernet to better serve the data center as an interconnect. Two of these standards efforts, in particular, are designed to help reduce latency at the end points by lessening the impact of the TCP/IP TCP/IP in full Transmission Control Protocol/Internet Protocol Standard Internet communications protocols that allow digital computers to communicate over long distances. transmission protocol that sits atop Ethernet. Today's Ethernet adapter The Ethernet hardware required to attach to an Ethernet network. It typically resides on an expansion board, but is sometimes built into the motherboard. An Ethernet adapter is required in each client and server. See Ethernet and network adapter. cards have latency of about 30 microseconds, which includes the latency of moving data from the application to the wire. By comparison, InfiniBand adapter cards accomplish the same handoff in about 3 microseconds. With remote direct memory access (RDMA (Remote Direct Memory Access) A communications protocol that provides transmission of data from the memory of one computer to the memory of another without involving the CPU. InfiniBand, Virtual Interface (VI) and RDMA Over IP are all forms of RDMA. ), an addition to the IP standard, computers can directly place information onto another computer's memory with minimal demand on the memory bus and on the CPU, replacing TCP/IP copy requirements that add latency and consume CPU and memory resources. Another solution to the problem is a TCP offload engine TCP Offload Engine or TOE is a technology used in network interface cards to offload processing of the entire TCP/IP stack to the network controller. It is primarily used with high-speed network interfaces, such as gigabit Ethernet and 10 gigabit Ethernet, where processing (ToE), which is a specialized protocol processor on a network interface card that accelerates all TCP (1) (Transmission Control Protocol) The reliable transport protocol within the TCP/IP protocol suite. TCP ensures that all data arrive accurately and 100% intact at the other end. operations and offloads them from a host processor. New adapter cards with both RDMA and ToE hit the market in 2004, with vastly improved latency performance. Second- and third-generation cards, which are beginning to hit the market, offer measured latency performance that is equivalent to that of InfiniBand and Fibre Channel. The Last Bottleneck: The Switch Today, an Ethernet switch A device that connects clients and servers to each other in an Ethernet network. See switched Ethernet. introduces multiple microseconds of latency to the link. Compared to the end points, the importance of the switch might be understated, because by fixing the end points, you address more than two-thirds of the problem. However, interconnect networks are built using a hierarchy of switches expanding the impact of the switch latency. In a two-tier federation of switches, data would touch three switches between any endpoint, making the switch responsible for nearly half of the total end-to-end latency. The switch latency issue is being addressed by vendors who are starting to look anew at this market and release new switch chips that can serve as the heart of an interconnect. These switches are growing in their data capacity to remove congestion The condition of a network when there is not enough bandwidth to support the current traffic load. congestion - When the offered load of a data communication path exceeds the capacity. as a source of latency, and are implementing new flow control mechanisms that smooth the flow of packets through the system. Novel methods for implementing intelligent cut-through switching (networking) cut-through switching - The application of wormhole routing to packets in a packet switching system so that forwarding of a packet starts as soon as its destination is known, before the whole packet has arrived. Compare store and forward. dramatically reduce the delay through the switch. Use of these techniques has resulted in a drop in the latency of an Ethernet switch chip--the key component in interconnect latency--by a factor of 10 to around 200 nanoseconds. Putting this chip into the switch used to build the two-tier interconnect network described above results in a latency of less than a microsecond One millionth of a second. See space/time and ohnosecond. (unit) microsecond - One millionth (10^-6) of a second. , which, when added to an RDMA or ToE-supported NIC (1) (Network Interface Card) See network adapter. See also InterNIC. (2) (New Internet Computer) An earlier Linux-based computer from The New Internet Computer Company (NICC), Palo Alto, CA. means a total latency of less than 10 microseconds. This is a dramatic improvement over the last generation switches, and provides acceptable SAN interconnect performance. Convergence Realized With these advances, Ethernet is now on par with Fibre Channel and InfiniBand--but is that enough to cause the market to change? Normally, it would take more for a new technology to oust a well-performing incumbent, but history has shown that when Ethernet is equal to a task, other considerations of interoperability and operating efficiency take over and push decisions in its favor. While latency is the key to Ethernet's suitability for SAN interconnect, it is the realization of the long-held dream of network convergence and lower costs that will make it a success. Mike Zeile is vice president of marketing for Fulcrum fulcrum: see lever. Microsystems. He can be reached at mzeile@fulcrummicro.com. www.fulcrummicro.com Opening shots in continuing stories ... |
|
||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion