Printer Friendly
The Free Library
14,503,119 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Distributed File Systems For Storage Area networks.


This article discusses some fundamental aspects of Distributed File Systems (DFS (Distributed File System) An enhancement to Windows NT/2000 and 95/98 that allows files scattered across multiple servers to be treated as a single group. With Dfs, a network administrator can build a hierarchical file system that spans the organization's LANs and ) for Storage Area Networks (SANs). Many desirable features make a SAN DFS attractive to a production environment that processes huge volumes of bulk data and requires high bandwith. Throughout the discussion, the traditional server-attached storage model is compared against the distributed network-attached storage See NAS.  model. The remainder of the article will focus on issues of DFS configuration, data placement strategies, cross platform data sharing The ability to share the same data resource with multiple applications or users. It implies that the data are stored in one or more servers in the network and that there is some software locking mechanism that prevents the same set of data from being changed by two people at the same time. , fault tolerance See fault tolerant.

(architecture) fault tolerance - 1. The ability of a system or component to continue normal operation despite the presence of hardware or software faults. This often involves some degree of redundancy.

2.
, and performance. Finally, we will show that distributed file systems can provide superior performance over traditional server-attached storage shared file systems.

Various shared file systems have been in use for more than fifteen years. They started as simple systems using file server protocols such as the Network File System (NFS (Network File System) The file sharing protocol in a Unix network. This de facto Unix standard, which is widely known as a "distributed file system," was developed by Sun. See file sharing protocol and WebNFS.

NFS - Network File System
). These products are still a very large part of today's shared storage solutions. With the advent of open standard technologies such as Fibre Channel and its ability to imbed im·bed  
v.
Variant of embed.


imbed
Verb

[-bedding, -bedded] same as embed

Verb 1.
 SCSI commands, it is possible to connect mass storage directly into a network at relatively low cost. Using compatible Fibre Channel Host Based (1) A system controlled by a central or main computer. A host-based system typically refers to a hierarchical communications system controlled by a central computer.

(2)
 Adapters (HBAs), any work-station can address the storage and access data to and from it, just as it would do to a directly connected SCSI SCSI
 in full Small Computer System Interface

Once common standard for connecting peripheral devices (disks, modems, printers, etc.) to small and medium-sized computers. SCSI has given way to faster standards, such as Firewire and USB.
 storage device. The network, which includes HBAs, hubs, switches, and network-attached storage, is called a Storage Area Network (SAN). Fibre Channel is a high-speed (100MB/sec) channel that is capable of connection distances measured in kilometers. The Fibre Channel protocol uses addressable Reachable. When something is addressable, it can be identified and manipulated independently of its surroundings. For example, screen pixels and RAM memory are addressable. Each of the screen's picture elements can be individually turned on and off, and each of the memory's bytes can be  nodes in such a way that storage devices can be configured in a network fabric rather than point-to-point. This capability provides a storage network that is highly scalable, has high performance, and allows complete sharing of any connected device.

The distinct difference between using the conventional server-attached storage shared file system and a SAN is that any workstation connected to the network fabric can directly access the network-attached storage devices. The new connectivity obviates the need for a server and promotes a more distributed approach to managing data. This concept has started a new surge of distributed storage Storing data in multiple computers or in computers that are geographically dispersed. This was an early term for storage that evolved into SANs and storage virtualization. See SAN and storage virtualization.  product development and has been coined the "SAN solution."

Many vendors have committed resources to developing hardware products that comprise a SAN by producing a variety of Fibre Channel compatible hardware components. In and by themselves, they offer a high-speed access method to storage. What has been missing from the formula is the ability for software to leverage the distributed nature of the storage network. At the core of every operating system operating system (OS)

Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs.
 are file systems that abstract disk storage and allow multiple programs to reliably share this resource. Existing local file systems such as Microsoft's NTFS (NT File System) An optional file system for Windows NT, 2000 and XP operating systems. NTFS is the more advanced file system, compared to FAT32. It improves performance and is required in order to implement numerous security and administrative features in the OS.  1, SGI's XFS XFS X Font Server (Sun)
XFS Extended File System
XFS X-Fleet Sentinels (gaming clan)
XFS Extensions for Financial Services (software interface specification) 
, and Apple's HFS (Hierarchical File System) The file system used in the Macintosh. The first version, known as "Mac OS Standard," was introduced in 1985. HFS+, an enhanced version, came out in 1998 in preparation for the upcoming Mac OS X operating system.  assume that any visible storage, whether in a network or locally connected, is owned by the local workstation. If the file system accesses the storage without coordination with other workstations in the network and two or more inadvertently share access to the network-attached storage devices, data corruption Data corruption refers to errors in computer data that occur during transmission or retrieval, introducing unintended changes to the original data. Computer storage and transmission systems use a number of measures to provide data integrity, the lack of errors.  occurs. The early SAN devices were used only in a server-attached shared storage environment because of this problem and due to no readily available software solution. In the last three or four years however, a number of mostly third-party vendors have begun developing and offering distributed file system solutions that exploit the SAN distributed environment.

This article will discuss how a SAN File System (SAN FS) operates. We will discuss the difference between a typical server-attached shared storage model using NFS and a distributed network-attached storage model using Mountaingate's CentraVision File System (CVFS). Then, we will address important SAN FS features such as configuration, data placement strategies, cross platform data sharing, fault tolerance, and performance.

STORAGE MODELS

* The Server-Attached Shared Storage Model. In a typical data processing data processing or information processing, operations (e.g., handling, merging, sorting, and computing) performed upon data in accordance with strictly defined procedures, such as recording and summarizing the financial transactions of a  center today, there may be many different methods of sharing storage among multiple workstations. The most prevalent storage model is server-attached. This model uses one or more large workstations or file servers to locally attach the shared disk storage. The storage is then shared over some network topology See topology.  using protocols like NFS. The systems are inter-connected through network interfaces such as Ethernet, HiPPI, or ATM. Extensive effort has been made by numerous vendors to make the file server model a high performance solution. A problem exists in that the model must copy from the data storage to the server's memory before it can transmit the information to the requestor. The performance, therefore, is gated by the speed of the server's memory and processing power (Fig 1).

* The Distributed Network-Attached Storage Model. By making storage a part of the network rather than a part of the local server, a DFS can use a different access method. The storage model now changes from server-based to peer-to-peer between the workstation and the storage. The advantage to this model is that with proper system software the data can be transmitted directly from the storage to any application without using a file server as a middleman mid·dle·man  
n.
1. A trader who buys from producers and sells to retailers or consumers.

2. An intermediary; a go-between.
 (Fig 2).

When storage is placed on the network it is, in fact, a distributed object Distributed objects are software modules that are designed to work together, but reside either in multiple computers connected via a network or in different processes inside the same computer. . Many workstations can access the storage equally, which adds a new level of complexity. The file system software on each workstation must now somehow coordinate critical operations to the metadata. Metadata operations are related to the management of the allocation and name space of the data on the storage. For example, a user creates a file called myfile and writes 2,000 bytes to it. In this situation, two operations must be globally unique to the file system. First, the name space must be guaranteed to be unique. There cannot be two files called myfile in the same directory. Second, the allocation of data blocks must not collide with other space allocation. If two workstations simultaneously request a block of storage, then they must not receive the same block. These operations require some sort of locking, such that serialization se·ri·al·ize  
tr.v. se·ri·al·ized, se·ri·al·iz·ing, se·ri·al·iz·es
To write or publish in serial form.



se
 occurs. A DFS must, therefore, have an object that manages, at a minimum, the locking of metadata compo com·po  
n. pl. com·pos
Any of various combined substances, such as mortar or plaster, formed by mixing ingredients.



[Short for composition.]
 nents. These objects may be in themselves highly distributed components, but there must be some agreement of mutual exclusion (parallel, operating system) mutual exclusion - (Or "mutex", plural: "mutexes") A collection of techniques for sharing resources so that different uses do not conflict and cause unwanted interactions. One of the most commonly used techniques for mutual exclusion is the semaphore. .

* The Hybrid Network-Attached Storage Model. There are other ways of managing critical metadata operations. For example, a DFS can use a distributed lock manager A distributed lock manager (DLM) provides distributed applications with a means to synchronize their accesses to shared resources.

DLMs have been used as the foundation for several successful clustered file systems, in which the machines in a cluster can use each other's
 and metadata server using a communications network The transmission channels interconnecting all client and server stations as well as all supporting hardware and software. . This method requires both a SAN and a communications network to successfully operate the file system. It is possible to operate a communications network over Fibre Channel, but in essence, it must be considered a separate logical network from the SCSI interfaces to the network-attached storage. This model is referred to as a hybrid network-attached storage model. A hybrid model consists of a control path that uses a metadata server protocol and a data path (like Fibre Channel) that directly accesses the storage (Fig 3).

An example of a hybrid file May refer to a graphics file that contains vector graphics and bitmapped graphics. See metafile.  system s Mountaingate's CVFS. A metadata server using a TCP/IP TCP/IP
 in full Transmission Control Protocol/Internet Protocol

Standard Internet communications protocols that allow digital computers to communicate over long distances.
 network manages all metadata requests. All data operations are managed directly by the requesting application server using a Fibre Channel SCSI interface. Thus, the server-based model is used for controlling metadata operations and the distributed model is used for data. A minor drawback to this model is that the speed of the TCP/IP network and the capability of the meta-data server gate metadata operations. However, the volume of data transmitted between the application server and the metadata server are just a fraction of the total bandwidth available. When contrasted to the hundreds of megabytes per second (unit) megabytes per second - (MBps, MB/s) Millions of bytes per second. A unit of data rate. 1 MB/s = 1,000,000 bytes per second (not 1,048,576).  transmitted through the data path versus kilobytes of control data transmitted over the TCP/IP network, the model has been shown to scale very well.

CONFIGURATION

In server-attached storage models using protocols such as NFS, configuration is typically simple.A top-level directory is exported from the server and any application that has permission can access the file system. Details of the file system are managed on the server since NFS is really a network file system on top of a local file system. For a DFS, configuration tends to be more complex. It contains the same complexities that are found in managing a TCP/IP network. There are issues of storage identification (Storage Network Address versus IP address), connectivity (HBAs versus NICs), and topology (switches and hubs.)

In CVFS, there is a concept of Network Storage Pools first described by Soltis' Global File System (GFS See Google File System.

GFS - Grandfather, Father, Son
), termed Stripe Groups. Stripe groups are sets of storage devices grouped into a logical set. The complete set of stripe groups is then portrayed to the system and end-user as a single file system. A stripe group might be a stripe of JBOD (Just a Bunch Of Disks) A group of hard disks in a computer that are not set up as any type of RAID configuration. They are just a bunch of disks.

JBOD - Just a Bunch Of Disks
 (Just a Bunch Of Disks See JBOD.

(jargon, storage) Just a Bunch Of Disks - (JBOD, or "Just a Bunch of Drives") A storage subsystems using multiple independent disk drives, as opposed to one form of RAID or another.
) units, a RAID-5 disk array, or any other storage set that makes sense (Fig 4).

These groupings of storage can be tuned to optimize each pool so that it makes the best use of their bandwidth in the SAN. For example, eight Seagate Barracudas in a Fibre Channel chassis configured as a JBOD using Raid-0 can just about saturate sat·u·rate
v. Abbr. sat.
1. To imbue or impregnate thoroughly.

2. To soak, fill, or load to capacity.

3. To cause a substance to unite with the greatest possible amount of another substance.
 a single Fibre Channel. By placing this component on one channel and connecting it to a Fibre Channel Fabric A Fibre Channel fabric (or Fibre Channel switched fabric, FC-SW) is a switched fabric of Fibre Channel devices enabled by a Fibre Channel switch. Fabrics are normally subdivided by Fibre Channel zoning. Each fabric has a name server and provides other services.  Switch, it makes the best use of its potential bandwidth. It also allows the file system and associated applications the flexibility of allocating storage blocks to specific stripe groups based on their described characteristics.

Stripe groups enable the CVFS file system to support concurrent maintenance. Concurrent maintenance is the concept where a stripe group can be taken off-line for maintenance without severely impacting access to the rest of the file system. The importance of this feature grows as a function of the size of the file system. Downing a 5TB file system that is attached to many users can be quite disruptive. It is more efficient to down only the failed component and keep the rest of the file system running. CVFS supports the ability to up and down its stripe groups individually. It can also switch on or off specific read or write capabilities to a stripe group. This feature is used when a component of a group is failing and data must be drained from the failing group to another. The stripe group is marked write disabled so no further allocations or writes will be made to it. Then, the group is maintenance copied to other fully functional pools in the SAN.

DATA PLACEMENT

Using NFS, there are no mechanisms for an application to describe how the data is to be used. Allocation of data is controlled by the local file system on the file server. Creating separate mount points for each unique file system group on the server and placing the onus of decision upon the application to use the proper storage path typically solves this problem. While this solution is workable, it lacks flexibility. If something in the server-attached storage changes, then all the applications that share the storage must manually change their access method. For example, they would be required to change the path name of the mounted NFS storage. It would be more efficient if issues of storage placement were handled transparently in the distributed file system. If a storage requirement changes, then administration could be applied to the file system without notifying all of the applications to change their access method.

The CentraVision File System has a concept of data placement. Data placement is the ability to allocate storage onto different stripe groups using specific constraints. For example, a file system has a stripe group optimized for High Definition (HD) Video streaming See streaming video and video stream. . Another group is optimized for regular bookkeeping or text files. Data placement is achieved either by using a special command or an Application Program Interface (API) call to the file system. The application can request its high bandwidth file on the HD stripe group and let all the other files default to the regular stripe group. This separation of performance data from bookkeeping data allows optimal usage of the two stripe groups. It also can prevent excessive interleaving interleaving - sector interleave  of data blocks that can reduce performance due to channel contention and storage head motion (disk chatter). This feature is transparent to most applications. It does not require interaction if something on the file system changes. Essentially, changes only have to be admini stered to the file system and not to all of the participating share storage applications.

CROSS PLATFORM DATA SHARING

It is unusual that a data center has all of its machines running the same operating system. A typical film and video post-production house has SGI (SGI, Sunnyvale, CA, www.sgi.com) A manufacturer of workstations and servers, founded in 1982 by Jim Clark. The company was founded as Silicon Graphics, Inc., but changed to its acronym in 1999. , NT, and Apple systems. These systems are typically used for different specific functions in the workflow, but they all are working on some aspect of the same project. When data resides on server-attached shared storage, it is very common for projects to transfer important pieces of data to the local workstation's storage simply to save time. This data replication is very expensive and the time it takes to transfer the data increases the project's time to complete. It also adds a new class of problems if the shared storage version of data is updated, but the local workstation's copy is not.

Data sharing without data movement can improve production workflow times significantly. This is where a SAN environment can truly save money. Having the same data on the same storage being manipulated by a heterogeneous mix of workstations and applications is a necessity for a true SAN environment. Most distributed SAN file systems have or are planning to have cross-platform support. CVFS is supported on all SGI and Windows NT (Windows New Technology) A 32-bit operating system from Microsoft for Intel x86 CPUs. NT is the core technology in Windows 2000 and Windows XP (see Windows). Available in separate client and server versions, it includes built-in networking and preemptive multitasking.  workstations with Mac support on the horizon.

One of the more complex and expensive issues in the development of a DFS is allowing the file system to be used simultaneously by these different operating system platforms. Each platform has its unique features and idiosyncrasies. For example, on most Unix platforms, there is the concept of a User Identifier (operating system) user identifier - 1. (Or "uid", "user id") A number or name which is unique to a particular user of a computer or group of computers which share user information. The operating system uses the uid to represent the user in its data structures, e.g.  (UID (programming, database) uid -

1. user identifier.

2. unique identifier - of any sort, possibly following sense 1.

Compare with SKU for sense-development.
), a Group Identifier (operating system) group identifier - (gid) A unique number, between 0 an 32767, identifying a set of users under Unix. Gids are found in the /etc/passwd and /etc/group databases (or their NIS equivalents) and one is also associated with each file, indicating the group to which its  (GID 1. (operating system) gid - group identifier.
2. (filename extension) gid - global index.
), and permissions, or modes for owner, group, and all others. On Windows NT, there are access lists and privileges, It is very difficult to map these disparate models together. CVFS accomplishes this by having the ability to map NT and Apple users to a common Unix user/group model. As SAN products become more popular, consumers will expect a more complete solution to address security issues.

Another constraining issue is that most operating systems Operating systems can be categorized by technology, ownership, licensing, working state, usage, and by many other characteristics. In practice, many of these groupings may overlap.  differ in their support of 64-bit sizes in file systems. In large bulk-data processing applications, file sizes regularly can exceed 4GB. A majority of applications use the default POSIX (Portable Operating System Interface for UNIX) An IEEE 1003.1 standard that defines the language interface between application programs and the Unix operating system.  stat O system call interface but vendors describe 32-bit containers for its file size. CVFS maintains all of its file sizes in 64-bit containers and sometimes has to modify the system interface to support only 32-bit sizes. This problem must be addressed and solved by operating system vendors and the standards community. In the interim, some constraints will be placed on certain incompatible cross platform operations.

Heterogeneous support is one of the big advantages to the NFS server-attached storage model. The NFS protocol is consistent and is easily implemented on a number of operating system platforms. However, poor performance is the result if data and metadata must be abstracted into Remote Procedure Call (RPC (Remote Procedure Call) A programming interface that allows one program to use the services of another program in a remote machine. The calling program sends a message and data to the remote program, which is executed, and results are passed back to the calling ) protocols and small buffered data blocks. The cost is measured in high latency and low bandwidth. CVFS appears to each platform as a local file system. It does not use bridges or adapters to convert a native format into the format of the target operating system. It also uses the Direct Memory Access (DMA (1) (Digital Media Adapter) See digital media hub.

(2) (Document Management Alliance) A specification that provides a common interface for accessing and searching document databases.
) capabilities of each hardware component on each platform. This capability delivers high performance to all the different system platforms without compromise.

FAULT TOLERANCE

In a file server environment, if the server goes down, all the application servers accessing the shared storage are essentially down. If their critical data is on the shared storage, there is nothing that can be done until the file server is back on-line. In other words Adv. 1. in other words - otherwise stated; "in other words, we are broke"
put differently
, the model has a single point of failure that is difficult to solve. Many vendors have produced a number of solutions to this problem, but most are expensive to implement. The predominant NFS solution is to use fail-over file servers and switched storage units.

Using a DFS, the single point of failure problem can be solved in many different ways. Each workstation in the DFS still has access to the storage even if other workstations have failed or are off-line. Storage networks by their nature can be highly fault tolerant The ability to continue non-stop when a hardware failure occurs. A fault-tolerant system is designed from the ground up for reliability by building multiples of all critical components, such as CPUs, memories, disks and power supplies into the same computer. . Of course, there are hardware planning issues to consider. For example, using dual channels to the SAN Fabric Switch or even using dual switches will reduce point of failure problems. By using mirrored storage arrays, even the data can be redundantly stored.

CVFS and other hybrid network-attached storage models inherit the file server's single point of failure problem because the metadata is usually served by one machine at a time. The salvation is that the metadata can be easily placed and shared in the SAN. If the metadata server fails on one machine, another machine can quickly become the metadata server and take over the file system's management. This solution can be achieved with no additional hardware costs since each workstation already has access to the storage.

An additional consideration to make a hybrid file system more fault tolerant is to have redundant communication networks. If the communication network is somehow partitioned due to a hardware failure, some portion of the SAN users may be excluded from metadata server access. If a metadata server fail-over occurs because a workstation cannot reach the active server due to a network partition, we experience the symptom coined the split brain. This problem can potentially be severe if two servers attempt to manage a single file system. By making the control network redundant, this problem is greatly reduced. CVFS also has additional measures in place to prevent this problem by verifying fail-over situations through the Fibre Channel storage network itself.

PERFORMANCE

Performance of a distributed file system in a SAN must first be measured by its ability to reduce the time spent to complete a project. Not having to move data from one platform to another saves time. Using optimal placement strategies to best exploit the capabilities of the storage saves time. Not having file server bottlenecks and having direct, high-speed connections to each workstation saves time. These reasons by themselves can justify the adoption of SAN DES technology. When vendors address NFS performance, they measure the number of transactions or operations a second that a NFS file server can produce. While this is a very useful measure of performance, it ignores the issue of data bandwidth to the application. An NFS file server may be able to execute thousands of transactions per second In a very generic sense, the term Transactions Per Second refers to the number of atomic actions performed by certain entity per second. In a more restrictied view, the term is usually used by DBMS vendor and user community to refer to the number of database transactions performed . However, to an individual application it may only be able to generate data transfer rates of a few megabytes per second. A SAN DFS can attain a high transaction rate by its very nature. Soltis treats this subject in comparing their SAN DFS to NFS. In addition to a high transaction rate, a SAN DFS can also sustain high data transfer rates to an individual application. The net effect is superior application performance and thus less time to complete a project.

The next measurement we must make of a distributed file system is that of single workstation performance. This type of performance is typically measured as the number of megabytes per second of real data payload that can be delivered to the application. Using a single 100MB/sec Fibre Channel connection and SCSI protocol, a DFS can deliver about 94MB/sec data-payload performance capability on that single channel. This assumes that the storage is capable of delivering that speed. Therefore, a single high-performance workstation, given the right IO conditions and storage speeds, should be able to sustain at least the channel bandwidth. With more than one workstation competing for the same 100MB/sec channel, performance bottlenecks can occur due to storage disk head motion and channel contention. These artifacts artifacts

see specimen artifacts.
 are hard to manage. What a DFS needs to manage is the overhead of the file system and to some extent the access distribution. The measurement for multiple workstations accessing a DFS is called the aggregate performance. The goal is to make the single channel aggregate performance as close to the best single workstation performance as possible.

As mentioned earlier, ganging storage devices together as a striped storage pool can greatly enhance performance. A Seagate Barracuda barracuda, slender, elongated fish of tropical seas. Barracudas have long snouts and projecting lower jaws armed with large, sharp-edged teeth. They are ferocious, striking at anything that gleams, and are considered excellent game fishes.  disk can achieve a read performance of around 10MB/sec. By placing eight Barracudas on the same Fibre Channel and striping Interleaving or multiplexing data to increase speed. See disk striping.

striping - data striping
 the IO across all the units, close to channel speed can be achieved. Conversely, a single Ciprico RAID-3 Array might best be placed on a channel by itself since it can saturate most of the Fibre Channel's bandwidth.

SINGLE WORKSTATION PERFORMANCE

Figure 5 shows individual performance of a Fibre Channel loop with eight Seagate Barracuda 9GB drives striped RAID-0. The stripe breadth is 128 blocks and each block is 4,096 bytes. The host is an SGI Challenge using the 6.2 IRIX A Unix-based operating system from SGI that is used in its computer systems from desktop to supercomputer. It is an enhanced version of Unix System V Release 4. IRIX integrates the X Window system with OpenGL, creating the first real time 3D X environment.  operating system running with a Prisa Fibre Channel HIO HIO Hole-In-One (chiropractic)
HIO Health Insuring Organization
HIO Harvard International Office
HIO Health Information Organization
HIO Horse Industry Organization
HIO Hogere Informatica-Opleiding
HIO High Income Opportunities
 host adapter Also called a "controller" or "host bus adapter," it is a device that connects one or more peripheral units to a computer. It is typically an expansion card that plugs into the bus. IDE and SCSI are examples of peripheral interfaces that call their controllers host adapters. See host. .

A server-attached storage comparison is not shown here since it is difficult to create a comparative environment. We defer to other papers written about NFS performance. For example, in a white paper written by Sun Microsystems, Inc. (SUN NFS 1995), they demonstrate NFS write through performances measured at around 7MB/sec. This is due to the file system's inherent necessity to run through a server and, in this case, its requirement to communicate to the workstation over 100 Base-TX Ethernet. Wood (Wood 1995) measured 12.5MB/sec sustained NFS bandwidth using a high performance OC3 (115Mbit) ATM channel.

In Fig 6, we show a test where we ran single CVFS performance runs on a PC NT. This machine consisted of a 200MHz (MegaHertZ) One million cycles per second. It is used to measure the transmission speed of electronic devices, including channels, buses and the computer's internal clock. A one-megahertz clock (1 MHz) means some number of bits (16, 32, 64, etc.  Pentium with 64MB memory and an Emulex 7000 Fibre Channel host adapter connected to eight Seagate Cheetah 9GB disks through a 16-port Brocade Silkworm silkworm, name for the larva of various species of moths, indigenous to Asia and Africa but now domesticated and raised for silk production throughout most of the temperate zone. The culture of silkworms is called sericulture.  Fibre Channel switch In a computer storage field, a Fibre Channel switch is a network switch compatible with Fibre Channel (FC) protocol. It allows the creation of a Fibre Channel fabric, that is currently the core component of most storage area networks. . The operating system was Windows NT 4.0, Service Pack 3.

Fig 7 shows the single workstation read performance of a SGI Octane. This machine has dual 195MHz CPUs and 256MB of memory. The adapter is an Adaptec XIO XIO St Marys, Ontario, Canada - St Marys / via Rail Service (Airport Code)
XIO Execute Input/Output
XIO Xdr Input Output
XIO All in One
XIO Extended Input Output
XIO Any Input Output
 Fibre Channel host adapter connected directly to a JBOD of eight 9GB Seagate Cheetah Drives. The file system was striped RAID-0 using 128, 4096-byte blocks per disk As can be seen, the Octane and Cheetah combination is capable of saturating the Fibre Channel at near the 4MB transfer size.

AGGREGATE PERFORMANCE

Now that we have investigated single workstation performance, focus returns to the more important aggregate performance issues. In the next experiment, we put the SGI Challenge, the SGI Octane, the NT PC, and one other NT PC onto a 16-port Brocade Silkworm Fabric Switch. The new PC had the same configuration as the other, except it was from a different manufacturer (Dell versus HP). We, then, connected two eight drive JBOD chassis consisting of Seagate 9GB Cheetah disk drives. Since the two JBOD chassis each have a Fibre Channel connected to them, the maximum theoretical bandwidth possible is 200MB/sec. In the Octane test, we saturated the channel at 94MB/sec, therefore, we can surmise that the potential aggregate bandwidth of the storage network is around 188MB/sec. We created a single file system of 16 disks on the two chassis. The file system was divided into four stripe groups (storage pools) of equal size. The fourdrive stripes were set at 128, 4096-byte blocks per disk. Using the CVFS data placement feature, we directed each workstation to a different stripe group. This helped eliminate the artifacts of head movement and disk cache interaction. The test was accomplished by reading four 2GB files simultaneously and measuring the individual performance of each workstation. All applications were synchronized to begin their tests at exactly the same time. The aggregate bandwidth was, then, summed up from the individual performance numbers.

The individual performance numbers were observed to be highly volatile and this graph is an average representation of a number of passes. However, the aggregate performance value remained very consistent through all of the iterations. Using 4MB transfer sizes, CVFS and the Fibre Channel hardware sustained 88.5% of theoretical and 94.1% of available bandwidth through the storage network. What is demonstrated in these numbers is that a very large portion of the potential bandwidth of the storage network can be exploited even when there is contention among multiple workstations.

We have described the basic differences between the conventional server-attached shared storage model, a distributed networkattached storage model, and a hybrid network-attached storage model. We presented our opinion of important SAN DFS features that make the model valuable to a production environment. We have shown that a distributed network-attached file system can be superior to the more conventional server-attached shared storage model. Due to the bottlenecks of a server-attached storage model, we can imply that, without distributed file systems, the benefit of a SAN is not fully realized. It is clear that by allowing a direct connection from each application server to the network-attached storage, we can see dramatic performance gains. These gains are realized in productivity to the end-user and, therefore, savings to the customer. It is also apparent that more development and cooperation is required by various operating system vendors to support cross-platform distributed file system products. It is possible that a new standard must emerge to help vendors drive towards DFS compatibility and interoperability for the same reasons that POSIX standards were made for end-user applications.

Brad Kline is the principal software architect at Mountaingate Imaging Systems Corporation (Reno, NV).
COPYRIGHT 2000 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2000, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Technology Information
Author:KLINE, BRAD
Publication:Computer Technology Review
Date:Jan 1, 2000
Words:4281
Previous Article:"Storage Is Becoming The IT Strategy Driver".(Company Operations)
Next Article:"Tape Is Always Going To Be Important".(Company Operations)
Topics:



Related Articles
SNIA Definitions.(Storage Networking Industry Association on storage area networks)(Technology Information)
The Evolution Of Network Storage.(Technology Information)
Global namespace: The future of distributed file server management, part 1. (Storage Networking).
Virtual storage and real confusion: a big disconnect between what vendors offer and what users want.(Industry Overview)
The new world of NAS virtualization: the next generation could fulfill NAS' promise. (Storage Networking).(network attached storage)
File sharing over the WAN. (Storage Networking).
Table for two: NAS and the general-purpose server; how network attached storage can maximize the general-purpose server's capabilities.(Storage...
High Performance Computing: past, present and future.(Storage Networking)
Wide area file sharing across the WAN.(Storage Networking)(wide area network)
NAS virtualization simplifies file storage management.(Storage Networking)(Network Attached Storage)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles