Printer Friendly
The Free Library
14,380,430 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

A survey of next generation storage system archrectures: beyond storage networking. (Storage Networking).


Architectural philosopher Ludwig Mies van der Rohe Van Der Ro·he  

See Ludwig Mies Van Der Rohe.
 once stated: "Architecture starts when you carefully put two bricks together." In the world of information storage, storage devices are the "bricks" and storage interconnect is the mortar. The focus of this article is not about building a better "brick" but rather how to "carefully" put storage bricks together, creating a more effective and elegant storage architecture.

Storage Device Technologies vs. Storage Architectures

Advances in storage architecture should not be equated with advances in storage device technology. Storage device technologies continue to evolve, especially with respect to capacity. Every reader of this article is acutely aware of how disk drive density has steadily increased even while cost per bit has dramatically decreased. It appears that this trend will continue into the future. For example, researchers involved with IBM's Millipede millipede (mĭl`əpēd'), elongated arthropod having many body segments and pairs of legs. Millipedes, sometimes termed thousand-legged worms, have two pairs of legs on each body segment except the first few and the last.  project predict they will be able to hold several feature films on a postage stamp postage stamp, government stamp affixed to mail to indicate payment of postage. The term includes stamps printed or embossed on postcards and envelopes as well as the adhesive labels.  size memory card by 2005.

Although advances in storage device technology to date have been impressive, organizing and managing large amounts of ever expanding data continues to vex storage administrators. The answer to this dilemma lies in how the data is systematically organized or, in other words Adv. 1. in other words - otherwise stated; "in other words, we are broke"
put differently
, in the storage system architecture. Like most technological innovation, storage architecture development passes through definite generations.

What Defines a Generation?

In his book The Innovator's Dilemma, Harvard professor Clayton Christiansen defines a "disruptive technology A new technology that has a serious impact on the status quo and changes the way people have been dealing with something, perhaps for decades. Music CDs all but wiped out the phonograph industry within a few years, and digital cameras are destined to eliminate the film industry. " as an innovation that eventually revolutionizes a market. Not to be confused with continual evolutionary changes of a technology, the advent of a disruptive technology marks the beginning of that market's next generation. Architectures, as systems of intertwined technologies, similarly go through revolutionary disruptions and generational changes. Computing architectures have moved from mainframe architectures through client-server architectures to Internet architectures.

In most cases, revolutionary trends, and hence generational changes, are much easier to spot in hindsight than during the early stages of the revolution. Christiansen documents that most enterprises based on prior technological generations miss the revolutionary change and fail to participate in the next generation. As evidence, note that few mainframe era computer manufacturers currently exist as independent entities.

Storage architectures have gone through similar generational changes, albeit at a slower pace.

First Generation Storage: Direct Attached Storage

First generation storage is typified as being tightly coupled See tight coupling.  to the computing system's processing unit via direct attached proprietary channels or standard buses such as SCSI SCSI
 in full Small Computer System Interface

Once common standard for connecting peripheral devices (disks, modems, printers, etc.) to small and medium-sized computers. SCSI has given way to faster standards, such as Firewire and USB.
 and ATA (1) (AT Attachment) The specification for IDE drives. See IDE.

(2) See analog telephone adapter.

ATA - Advanced Technology Attachment
. This is commonly referred to as a direct attached storage (DAS) configuration (Figure 1). These configurations were designed to focus first on performance and eventually on interoperability.

DAS configurations are primarily limited by the amount of storage that they can handle. Each channel or bus is limited to a relatively small number of devices, typically less than eight. The physical length of channel is also limited to, at most, a few meters. This means that any physical disaster affecting the processing unit likely affects the attached storage as well. Most distressing, from the storage manager's perspective, is the difficulty in reconfiguring the storage, most commonly to increase the amount of storage. Adding storage requires extensive periods of system downtime to physically attach the storage, configure it logically within the system and then reallocate Verb 1. reallocate - allocate, distribute, or apportion anew; "Congressional seats are reapportioned on the basis of census data"
reapportion

allocate, apportion - distribute according to a plan or set apart for a special purpose; "I am allocating a loaf of
 the new storage among the various applications.

Second Generation: Storage Networking

The innovation that defines the second generation of storage architectures involved decoupling Decoupling

The occurrence of returns on asset classes diverging from their normal pattern of correlation.

Notes:
Take for example stock and corporate bond returns, which normally rise and fall together.
 storage from processing units and interconnecting various storage devices via a dedicated storage network (Figure 2). One of the first companies to popularize pop·u·lar·ize  
tr.v. pop·u·lar·ized, pop·u·lar·iz·ing, pop·u·lar·iz·es
1. To make popular: A famous dancer popularized the new hairstyle.

2.
 storage networking was Vinca Vin·ca
n.
A genus of evergreens usually found in the Eastern hemisphere.



Vinca

plant genus of Apocynaceae family; contains cardiac glycoside; causes diarrhea; includes V. major (blue periwinkle), V.
 Corporation, starting in 1992. Jay Carlson, then president of Vinca, recalls: "Those were heady times, while most of the industry was struggling to properly terminate a SCSI bus, we were envisioning networks of distributed storage Storing data in multiple computers or in computers that are geographically dispersed. This was an early term for storage that evolved into SANs and storage virtualization. See SAN and storage virtualization.  modules executing 'SANware."'

Like most disruptive technologies, storage networking was initially viewed as an interesting approach applicable to particular niche markets. Storage networking is now accepted as the preferred storage architecture with a market size of over $100 billion. Storage networking comes primarily in two flavors, Storage Area Networking (SAN) and Network Attached Storage (NAS (1) See network access server.

(2) (Network Attached Storage) A specialized file server that connects to the network. A NAS device contains a slimmed-down operating system and a file system and processes only I/O requests by supporting the popular
). SANs typically make use of tightly coupled, block-based, low-latency Fibre Channel interconnect while NAS is typically a loosely coupled See loose coupling. , file-based, higher-latency Ethernet interconnect. The selection of storage networking flavor is guided by the application it serves. In general, high performance transaction processing Updating the appropriate database records as soon as a transaction (order, payment, etc.) is entered into the computer. It may also imply that confirmations are sent at the same time.

Transaction processing systems are the backbone of an organization because they update constantly.
 applications demand SAN configurations while lower performance, file access based applications do well with simpler to manage NAS configurations.

Is a New Generation Needed?

Storage networking addresses storage capacity issues satisfactorily but does not solve overall data management issues. Storage networks often require bolted-on external technologies to properly manage the storage. Management functions such as backup, archiving, cloning, replication, disaster recovery and snapshot copies must be acquired separately from third party vendors and suffer from lack of integration and inter-compatibility. The next generation of storage architectures should be able to address management functions as an inherent aspect of the storage architecture, not as an aftermarket add-on.

John Spiers John Spiers (born 1975) is an English melodeon, concertina and bandoneón player. He was born in Birmingham. His father was a Morris dancer. He attended John Mason School in Abingdon, and then went on to study genetics at King's College, Cambridge. , CTO (Chief Technical Officer) The executive responsible for the technical direction of an organization. See CIO and salary survey.  of LeftHand Networks, elaborates: "Current network storage strives to provide an always bigger, better and faster box but doesn't inherently address fundamental issues of manageability, including scalability, availability, replication and storage clustering." Jim Pownell, CTO of Inflection Systems, agrees: "Data storage customers continue to encounter a lot of pain managing data, particularly over the long term."

Next Generation Storage Initiatives

There are several interesting initiatives underway in the information storage industry. Some are focused on device technology, some on enhanced performance interconnect and others on better manageability. The following is a survey of some interesting initiatives and trends that portend por·tend  
tr.v. por·tend·ed, por·tend·ing, por·tends
1. To serve as an omen or a warning of; presage: black clouds that portend a storm.

2.
 future storage architecture developments.

Enhanced Backup Solutions Initiative (EBSI EBSI Ecole de Bibliothéconomie et des Sciences de l'Information (French: School of Biblio-economics and Information Sciences, University of Montreal, Canada) ). Storage industry futurist/visionary Michael Peterson has created a new organization, EBSI, to address advanced data protection issues. The EBSI's efforts will revolve around Verb 1. revolve around - center upon; "Her entire attention centered on her children"; "Our day revolved around our work"
center, center on, concentrate on, focus on, revolve about
 repairing what end-users have identified as "cracks in the armor" of current backup technology. According to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 Peterson, three primary backup issues top the concerns of end-users: meeting backup time schedules, confidence in the completeness of a backup routine, and the time it takes to restore data. Rather than a standards setting body, EBSI is intended to be a common ground between storage venders and endusers. Storage venders and end-users interested in participating in this initiative are directed to the EBSI website at www.enhancedbackup.com.

Network Unified Storage The capability of combining different data formats such as files, folders, contacts and e-mail messages into collections that can be viewed and manipulated as one group. See WinFS.  (NUS). NUS is being developed by LeftHand Networks of Boulder, Colorado The City of Boulder (, Mountain Time Zone) is a home rule municipality located in Boulder County, Colorado, United States. Boulder is the 11th most populous city in the State of Colorado, as well as the most populous city and the county . LeftHand Networks' Spiers believes that NUS is a revolutionary architecture because of its ability to effectively integrate both locally and remotely distributed standard storage. Spiers notes, "NUS is to storage as blades are to servers." NUS relies on a block-based protocol characterized as "iSCSI lite." This protocol removes the SCSI protocol's device specific handcuffs hand·cuff  
n.
A restraining device consisting of a pair of strong, connected hoops that can be tightened and locked about the wrists and used on one or both arms of a prisoner in custody; a manacle. Often used in the plural.

tr.v.
 and adds extensive management capabilities. Although currently proprietary to Left-Hand Networks, Spiers indicates that he is working with the Storage Networking Industry Association An association of producers and consumers of storage networking products, whose goal is to further storage networking technology and applications. The Storage Networking Industry Association, or SNIA  (SNIA (Storage Networking Industry Association, San Francisco, CA, www.snia.org) An organization devoted to the advancement of mission critical storage systems. Founded in 1997, its goal is to determine the standards that must be developed to allow hosts and storage systems to interact via ) to make the NUS protocol an open standard.

Direct Access File Systems (DAFS (Direct Access File System) A high-performance file sharing protocol based on the VI memory-to-memory architecture. Designed for storage area networks (SANs), DAFS provides bulk data transfer directly between the application buffers of two machines without ). The DAFS protocol is a new file-access protocol designed to take advantage of emerging RDMA (Remote Direct Memory Access) A communications protocol that provides transmission of data from the memory of one computer to the memory of another without involving the CPU. InfiniBand, Virtual Interface (VI) and RDMA Over IP are all forms of RDMA.  (remote direct memory access) interconnect technologies such as InfiniB and, VI and iWARP. DAFS is an optimized method of shared data access designed to bypass performance bottlenecks inherent in current network-attached storage See NAS.  and direct-attached storage Direct-attached storage (DAS) refers to a digital storage system directly attached to a server or workstation, without a storage network in between. It is a retronym, mainly used to differentiate non-networked storage from SAN and NAS.  architectures. It is optimized for high-throughput, low-latency communication, and for the requirements of local file-sharing architectures.

Currently, both network-attached storage and direct-attached storage require the operating system operating system (OS)

Software that controls the operation of a computer, directs the input and output of data, keeps track of files, and controls the processing of computer programs.
 to copy data from file systems buffers into application buffers. DAFS, in conjunction with its underlying specialized interconnect hardware, enables direct, memory-to-memory transfers between application servers and storage servers. The application server, in essence, treats the remote storage server semiconductor memory as though it were a mapped section of its own internal memory, with only minimal performance and latency degradations.

Object-Based Storage. Object-based storage (Figure 3) moves low-level storage functions into the storage device itself. This enhanced storage device is designated an Object-based Storage Device (OSD (1) (On-Screen Display) An on-screen control panel for adjusting monitors and TVs. The OSD is used for contrast, brightness, horizontal and vertical positioning and other monitor adjustments. ). An OSD is a network-attached storage device that presents an interface of arbitrarily named data objects of variable size rather than sequentially numbered fixed-size blocks. These objects are subsequently used to deal with data storage details, such as request scheduling and data layout. One or more specialized servers manage the associated metadata separately. By separating data storage, metadata storage and management, heavily accessed data channels can be optimized for high performance and distributed access.

Content Addressed Storage A storage technique from EMC for content that is in its final form (fixed content). CAS assigns an identifier to the files so they can be accessed no matter where they are located.  (CAS). A CAS is a storage system whose storage objects are identified by their contents, or by a part of their contents, rather than by their names or positions with a storage device.

Storage industry powerhouse EMC (1) (EMC Corporation, Hopkinton, MA, www.emc.com) The leading supplier of storage products for midrange computers and mainframes. Founded in 1979 by Richard J. Egan and Roger Marino, EMC has developed advanced storage and retrieval technologies for the world's largest companies.  provides a CAS-based networked storage solution designed exclusively for fixed content called Centera. With the Centera approach, applications access data objects based on a globally unique address that is derived from the object. That address, because it is unique to content, enables Centera to guarantee both the authenticity and integrity of the object. The application requires no knowledge of the physical or logical placement of content.

Caching Systems. Caching is a familiar method used in microprocessors to mitigate performance and capacity differences between frequently used memory locations and less used memory locations. Most modem processing units use a hierarchy of caches, starting with a small, high performance cache on the silicon of the processor itself. This on-chip cache is augmented by a larger, slightly slower performance cache on the substrate of the processor's package, and a larger, even slower, performance cache located on the processor's motherboard. Data held on local disk drives is often similarly cached in the motherboard's general semiconductor memory by the file system.

Storage architectures based on caching extends the caching hierarchy even further. Rather than static storage, the local hard disk drives of a client machine are treated as a cache for a LAN-attached storage server. The LAN-attached server may be, in turn, a cached subset of the complete enterprise dataset held, preferably, in an off-site storage facility. Each higher level of cache is constructed of respectively slower and, hence, less costly storage and interconnect technology.

Storage Messaging. Storage messaging proponent Carlson views the ultimate future storage architecture as an infinitely scalable message-based system. Storage messaging applies Oxford's C.A.R. Hoare's seminal "Communicating Sequential Processes (language, parallel) Communicating Sequential Processes - (CSP) A notation for concurrency based on synchronous message passing and selective communications designed by Anthony Hoare in 1978. It features cobegin and coend and was a precursor to occam. " concepts to storage. The result is the long anticipated convergence of data storage and data communication. Convergence is possible because any communication channel has an inherent storage capacity (bandwidth times latency) and, conversely, any storage device has inherent communications channel Also called a "circuit" or "line," it is a pathway over which data are transferred between remote devices. It may refer to the entire physical medium, such as a telephone line, optical fiber, coaxial cable or twisted wire pair, or, it may refer to one of several carrier frequencies  characteristics (capacity divided by access time).

Futurist George Gilder has coined the term "storewidth" to describe this convergence. He points out that while bandwidth is abundant, connectivity is virtually absent. "Storewidth" is the conversion of abundant bandwidth and heterogeneous petabytes into accessible information.

The foundation of storage messaging is the notion that any data item should be treated as a message. All data is always considered to be in transit from the point at which it is created to the point it is destroyed. In general, each message contains a source, a destination, metadata, as well as the data payload itself. After creation, the messages travel seamlessly from device to device, channel to channel, until they are eventually destroyed. These messages, then, essentially manage themselves.

Carlson further clarifies, "Storage networking addresses storage quantity issues; storage messaging addresses storage quality issues. Think of the way our brain works. The information in our brain is not a collection of static data. It is a collection of messages, each with the capability of linking to and associating with other messages. These interlinked, distributed messages then become a coherent thought."

A Completely Different Way of Looking at Storage

Perhaps what will truly revolutionize the storage market are not novel architectures but rather a completely new way of looking at data storage. EBSI's Peterson points out, "Storage is now about business values, not price per megabyte. The storage manager needs to turn network storage in to a business weapon. Companies should be generating profits from their storage; storage should be turned from a cost center into a profit center. It can be shown that an investment in storage produces, on average, a four times return on investment." It follows that storage is becoming an independent profit center of a company's most valuable tactical and strategic asset.

In many companies, the value of that company's intangible assets greatly exceeds that of its tangible assets. These intangible assets include the company's databases, business processes, customer goodwill, investor confidence and, most important, its employees. All of these assets are thoroughly enmeshed en·mesh   also im·mesh
tr.v. en·meshed, en·mesh·ing, en·mesh·es
To entangle, involve, or catch in or as if in a mesh. See Synonyms at catch.
 with the company's data.

Carlson agrees with this new view of data storage within an enterprise. "The value of a company's data needs to be recorded on its balance sheet and audited as any important financial asset is." Specifically, the storage manager should take on the characteristics of a business unit manager as opposed to an infrastructure manager.

The Future

If one were to ask ten storage systems visionaries about the future, one would likely receive ten very divergent opinions. Swedish Physicist Niels Bohr opined: "Predictions are hard to make, especially if they involve the future." The easiest prediction to make is that the amount of data under management will continue to grow while the cost per bit of storage media will continue to shrink. Managing storage will be an ever-increasing component of IT costs. Carlson also points out that the ratio of actual data with respect to metadata will continue to shrink. More and more of the data under management will be metadata. The good news is that the initiatives and architectures discussed in this article will substantially enhance the productivity of the individual data managers.

LeftHand Networks' Spiers predicts: "In the future, storage will be abstracted and virtualized to the point that complex management will be reduced to a simple point and click operation. Storage is only starting to catch up to where server management technology, such as clustering and consolidation, has been for some time."

Inflection Systems' Pownell also believes that storage will be much more virtualized in the future. "Future storage will be virtualized at a higher level, at a file level as opposed to today's server and device level. Any storage component should be able to go away and be replaced without affecting the data accessibility."

Alacritus CTO Roger Stager foresees, "Future innovations will move more data services into the SAN." Services such as backup, archiving, point-in-time copying and replication will be performed on SAN switches and edge devices rather than the host, as is currently common.

EBSI's Peterson points out that in a DAS environment, productivity can be measured as about 0.4TB per administrator. In a networked storage environment, productivity increases to about 3TB per administrator, or nearly an order of magnitude A change in quantity or volume as measured by the decimal point. For example, from tens to hundreds is one order of magnitude. Tens to thousands is two orders of magnitude; tens to millions is three orders of magnitude, etc.  increase. The promise of next generation architectures is a productivity increase to at least 50Th per administrator.

Perhaps renowned economist Peter F. Drucker understood technological advancement best when he said: "The best way to predict the future is to create it." The future, then, is really in our own hands. To the technologist lies the responsibility of inventing the architectures that will carry data storage and management to the next generation, to the data manager lies the responsibility to elevate the recognized value of the enterprise's data to its deserved level.

RELATED ARTICLE: Companies to Watch

The bulk of the practical attention to next generation storage architectures these days seems to be centered in small, venture capital back startup companies. While most are still operating in "black" mode, interesting companies to watch can be identified by keeping an eye on the movements of storage industry vision lenders. The following is a sampling of a few such companies:

LeftHand Networks

* Vision Leader: John Spiers, formerly of Maxtor and MiniScribe.

* LeftHand Networks has developed Network Unified Storage (NUS). NUS allows customers to build affordable, intelligent and highly manageable storage area networks that use their existing IP infrastructure.

Panasas

* Vision Leader: Garth Gibson formerly of Carnegie-Mellon.

* Panasas is creating a highly scalable and manageable storage network by combining innovative distributed storage software (distributed file system Software that keeps track of files stored across multiple networks. When the data are requested, it converts the file names into the physical location of the file so it can be found. ) with low-cost, industry-standard hardware. The result: multi-gigabyte throughput performance and a dramatically reduced total cost of ownership for storage systems.

Inflection Systems

* Vision Leaders: Jim Pownell, and Dave Therrien formerly of StorageNetworks & Highground Systems.

* Inflection Systems is developing an exciting new data storage system.

Alacritus

* Vision Leaders: Roger Stager and Don Trimmer trimmer

see resco nail trimmer, toenail scissors.
 formerly of Intelliguard.

* Alacritus is replacing tape libraries with the latest disk subsystem technology.

www.yottasys.com

Al Mudrow is CTO of Yotta Systems (Orem, Utah)
COPYRIGHT 2003 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2003, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Mudrow, Al
Publication:Computer Technology Review
Date:Apr 1, 2003
Words:2745
Previous Article:Bumping the glass ceiling: government regs, terrorism, email and tight budgets place new demands on storage and bandwidth. (Storage Networking).
Next Article:BMC pulls the plug, competitors pick up the pieces. (Storage Networking).



Related Articles
The Evolution Of Network Storage.(Technology Information)
SAN Appliances: Enabling Media Rich Internet Storage Networks.(Technology Information)
Fujitsu combines with EMC to deliver integrated PRIMEPOWER and Symmetrix DMX solutions.
Milk and honey: reaching the promised land of heterogeneous storage management. (Storage Networking).
Storage Networking market analysis. (Stub Files).
Network emulation tools in IP networks: test system infrastructure before deployment. (Connectivity).
Transparent capacity management.(Storage Management)
Hardware enables next-gen storage: next generation storage software will require specialized hardware.(Storage Management)
There's a "great white" inside every SAN: and this man-eater's name is complexity.(SAN Trends)(storage area network)
Clustered network storage.(first in/first out)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles