Enabling Technology For The Data Center OF The Future: DAFS And The InfiniBand Architecture.I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output. I/O - Input/Output is the lifeblood of computing. You can execute all the compute cycles you like, but if you can't get the relevant information into and out of the processor and to the end user on a timely basis, what's the point? This has always been true, but never so much as in the Network Economy. Researchers estimate that there are now about one million terabytes of online information, an amount that grows daily, with every Web page posted and every online transaction executed. If data isn't smoothly moving among storage, servers, applications, and users, then networks and users are just spinning their wheels. Avoiding I/O bottlenecks and getting data where it needs to be is the central challenge for the e-business data center of the future. The InfiniBand and Architecture and the Direct Access File System (DAFS (Direct Access File System) A high-performance file sharing protocol based on the VI memory-to-memory architecture. Designed for storage area networks (SANs), DAFS provides bulk data transfer directly between the application buffers of two machines without ) protocol are key to accomplishing this critical job. Getting Data Where It Needs To Be A variety of I/O technologies are used today. Ethernet and the TCP/IP TCP/IP in full Transmission Control Protocol/Internet Protocol Standard Internet communications protocols that allow digital computers to communicate over long distances. protocol drive LANs, WANs, and ultimately the Internet. In the data center we also see specialized networks, such as Fibre Channel storage area networks (SANs) and specialized cluster links. At the system level the PCI bus PCI bus - Peripheral Component Interconnect and the SCSI SCSI in full Small Computer System Interface Once common standard for connecting peripheral devices (disks, modems, printers, etc.) to small and medium-sized computers. SCSI has given way to faster standards, such as Firewire and USB. bus are prevalent. The PCI bus occupies a special place, because it has typically been the foundation for all the others. Gigabit Ethernet An Ethernet standard that transmits at 1 Gbps. Used mostly to connect high-end workstations and servers as well as for network backbones, Gigabit Ethernet transmits full duplex from point to point using switches and half duplex in a shared environment (CSMA/CD) using a hub. , Fibre Channel, and SCSI links are usually implemented as PCI (1) (Payment Card Industry) See PCI DSS. (2) (Peripheral Component Interconnect) The most widely used I/O bus (peripheral bus). adapter cards or PCI components, so all of their traffic is also PCI traffic. But as performance, reliability, scalability, and density demands increase, shared buses have become increasingly challenged to meet the needs of the new data center. This is especially true as the expansion and I/O requirements of individual servers increasingly take a back seat to the requirements of connecting multiple servers and storage. The typical data center for Web applications or B2B (Business to Business) Refers to one business communicating with or selling to another. See B2B e-commerce, B2C and B2G. B2B - business to business exchanges, for example, has tens or hundreds of servers arranged in 19-inch racks, all requiring shared access to terabytes of file storage. The workload is defined by network requests coming in through routers, switches, firewalls, load balancers, caching appliances, and the like. Since file-sharing by multiple servers is a fundamental requirement of this environment, storage is usually aggregated into shared storage pools, accessed by the servers using a file access protocol such as NFS (Network File System) The file sharing protocol in a Unix network. This de facto Unix standard, which is widely known as a "distributed file system," was developed by Sun. See file sharing protocol and WebNFS. NFS - Network File System . The result? A complex and sophisticated infrastructure that has exploded in importance in just the past few years. Consequently, the issues surrounding how individual servers connect to "the outside world" have been replaced with issues focused on how such large complexes of discrete units can be organically composed. The historical I/O solution uses Fast and Gigabit Ethernet networks running TCP/IP for the majority of inter-unit links, with limited numbers of specialized Fibre Channel SANs or proprietary clusters for specific back-end needs. But that's a lot of diversity for solving essentially one interconnect problem. Diversity leads to complexity, introducing more risk of configuration errors, requiring more skilled staff to install and manage, and souring the economic picture by lowering product volumes and raising costs. Even when carefully planned, the combined wiring for racks of servers, disks, and network equipment is ugly and error-prone. And even if you could solve the basic rack wiring mess, network data centers still have a number of critical scaling problems. Take latency and efficiency. The TCP/IP protocol suite has proven itself a wonderful foundation for communications of all kinds over essentially unreliable networks. But that very fact makes it inefficient. TCP/IP and TCP/IP-based protocols have a complex, layered design, with lots of inter-layer dependencies that can easily demand thousands of lines of code The statements and instructions that a programmer writes when creating a program. One line of this "source code" may generate one machine instruction or several depending on the programming language. A line of code in assembly language is typically turned into one machine instruction. and significant buffer memory to implement. Handling gigabit-class network traffic, servicing interrupts, moving data through long code-paths, and numerous kernel-to-application context switches are all expensive operations. Taken together, they yield long message latencies and use up a significant percentage of available processor power. InfiniBand Architecture Revealed The InfiniBand Architecture is a new I/O New I/O, usually called NIO, is a collection of Java programming language APIs that offer features for intensive I/O operations. It was introduced with the J2SE 1.4 release of Java by Sun Microsystems to complement an existing standard I/O. architecture for the data center, designed specifically to support a clean message-passing paradigm, multiple parallel channels, intelligent I/O Intelligent I/O - Intelligent Input/Output controllers, high-speed switches, and "just works" RAS (1) See network access server. (2) (Remote Access Service) A Windows NT/2000 Server feature that allows remote users access to the network from their Windows laptops or desktops via modem. See RRAS and network access server. (reliability, availability, serviceability (system, design, hardware, software) Reliability, Availability, Serviceability - (RAS) Three key attributes of a computing system design. See reliability, availability, and serviceability. ). It is "switched fabric" technology, designed to deliver a point to point, scalable interconnect infrastructure, as well as hot-plug flexibility and true wire-speed communications. In addition, its baked-in notions of link availability, fault isolation, multi-path I/O across redundant links, zoned/partitioned switches, and quality of-service make it ideal for fast-changing and fast-growing configurations. The InfiniBand Architecture spec creates three different performance classes of link, known as 1x, 4x, and 12x. The lx link will be the high- volume, low-cost InfiniBand Architecture implementation. Each lx link can transmit 2.5Gbps in each direction, or about 500MBps overall--on par with a very fast PCI bus, and well in excess of the latest SCSI (320MBps), Gigabit Ethernet (100MBps), and Fibre Channel (200MBps) designs. But the real kicker is that in addition to having more bandwidth on each link, it's quite straightforward to support multiple parallel links. And bandwidth isn't the only challenge to which the InfiniBand Architecture rises. It also surpasses other interconnects in terms of efficiency, availability, and flexibility. The InfiniBand Architecture transitions system I/O from a handful of shared bus expansion slots to a flexible switched network design, enabling efficient, fast I/O at a distance. No longer must each server in a rack devote precious space and cost to having its own hard disk, CD-ROM CD-ROM: see compact disc. CD-ROM in full compact disc read-only memory Type of computer storage medium that is read optically (e.g., by a laser). , and PCI expansion slots. A few processors, memory modules, and support chips can be packaged with a power supply, fans, and an InfiniBand Architecture HCA HCA, n.pr See acid, hydroxycitric. into much less pace than today's 1U servers--forming a compute module. The computer has been generalized into an InfiniBand Architecture fabric of compute, storage, network/router, and similar modules. This modular approach provides high purchase efficiency and excellent incremental scalability, making it a very attractive package for enterprises and service providers alike. Clustering will particularly benefit. Clusters need low-latency, high-bandwidth connections in order to support seamless load, balancing, rapid application failover, and parallel database optimizations. But while such links have been desirable, they have been expensive and proprietary. But with the InfiniBand Architecture, such speedy, cluster-class links will soon be part of virtually every server. DAFS DAFS is a file access protocol based on NFS, which is designed specifically to take advantage of the Direct (RDMA (Remote Direct Memory Access) A communications protocol that provides transmission of data from the memory of one computer to the memory of another without involving the CPU. InfiniBand, Virtual Interface (VI) and RDMA Over IP are all forms of RDMA. ) and asynchronous I/O Overlapping input and output with processing. Both the hardware and the software must be designed for this capability. The peripherals must be able to run independent of the CPU, and the software must be designed to manage it. features of InfiniBand and VI-compliant interconnects in high-performance clustered environments. DAFS is a lightweight protocol A communications protocol designed with less complexity in order to reduce overhead. For example, it uses fixed-length headers because they are faster to parse than variable-length headers. that enables applications to directly access transport resources. Consequently, a DAFS-enabled application can transfer data from its application buffers to the network transport, bypassing the operating system operating system (OS) Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs. while still preserving file semantics. This translates into high-performance "native" file I/O Input/Output operations such as open, close, read, write and append, all of which deal with standard disk or tape files. The term would be used to refer to regular file operations in contrast to low-level system I/O such as dealing with virtual memory pages or OS tables of contents. over the InfiniBand fabric, with significantly improved CPU CPU in full central processing unit Principal component of a digital computer, composed of a control unit, an instruction-decoding unit, and an arithmetic-logic unit. utilization, greatly reduced system overhead (due to fewer data copies, context switches, and interrupts), and far less network protocol processing. Also, since it is designed specifically for 24x365 data center environments where clusters of application servers need low latency access to shared pools of file storage, DAFS provides data integrity and availability features such as consistent high-speed locking, graceful failover of clients and servers, fencing, and data recovery. DAFS is a perfect complement to the InfiniBand Architecture as a key technology for the data center of the future, enabling independently and massively scalable storage and compute power, high-performance file I/O, data sharing, resiliency to application server and file server failure, ease of management, and a heterogeneous, standards-based environment that promotes cost efficiency and vendor independence. The Data (enter of the Future starts Here The InfiniBand Architecture has built up steam, and real products are just now getting underway. With version 1.0 of the specification released in October 2000 companies have shipped prototypes and samples in near-record time. Servers, peripherals, switches, and other infrastructure are expected to market in late 2001 and early 2002, followed by IT appliances, cluster hubs, networking equipment, storage arrays, and Web switches. To take advantage of this fundamental change in data center designs, OEMs, IHVs, and ISVs began building last year and are continuing their development efforts. By developing prototypes, planning a market-entry strategy, and gaining the experience necessary to win in the next generation data center, more infrastucture players are following suit. In the Network Economy, effective I/O is the ticket to success and the InfiniBand Architecture is one of the keys that will make it possible. Jim Pappas is a co-chair of the DAFS Collaborative and director of technology initiatives at Intel (Hillsboro, OR). David Dale is a co-chair of the DAFS Collaborative and industry evangelist for Network Appliance (Waltham, MA). |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion