Printer Friendly
The Free Library
14,717,777 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

12 questions for Phil Schwan.


Phil Schwan, CEO (1) (Chief Executive Officer) The highest individual in command of an organization. Typically the president of the company, the CEO reports to the Chairman of the Board.  of Cluster File Systems Cluster File Systems, Inc. (CFS) is the company that originally developed the Lustre distributed file system. CFS was a privately held company with offices in the United States, Canada, and China.

CFS was founded in 2001 by Dr. Peter Braam.
 (CFS CFS
abbr.
chronic fatigue syndrome


CFS,
n.pr See syndrome, chronic fatigue.

CFS Chronic fatigue syndrome, see there
) and developer of the Lustre lustre

In mineralogy, the appearance of a mineral surface in terms of its light-reflecting qualities. Lustre depends on a mineral's refractivity (see refraction), transparency, and structure.
 Open Source Cluster File System spoke with CTR See click-through rate.  about the state of his company and the adoption and positioning of the Lustre File System--a high-performance cluster file system developed for managing very large volumes, with very high bandwidths. Many of the world's largest supercomputers have been using Lustre in production for more than a year.

What is the DNA DNA: see nucleic acid.
DNA
 or deoxyribonucleic acid

One of two types of nucleic acid (the other is RNA); a complex organic compound found in all living cells and many viruses. It is the chemical substance of genes.
 behind Lustre?

The Lustre project was started in 1999 by CFS's founder, Dr. Peter Braam, when we were together at Carnegie-Mellon University. Peter began working on an answer to the Department of Energy's Scalable/Global/Secure file system challenge. The goal was to solve the next ten years of cluster file system problems, for systems of tens of thousands of nodes and many hundreds of petabytes.

Lustre's architecture was developed to address these massive storage needs with the expectation that if we could solve the hardest problems of the biggest clusters, then scaling down to the more typical systems would be possible. Working with the early-adopter customers and partners, we've had systems in production for about a year and a half, handling HPC (Handheld PC) A palmtop computer that weighs less than one pound and runs specialized versions of popular applications. Microsoft coined the term for its Windows CE operating system, which is an abbreviated version of Windows. See Pocket PC.  workloads for government and private industries.

Would you give us an overview of the Lustre architecture?

There are three types of nodes in a Lustre installation. Metadata servers manage the namespace A collection of names for a particular purpose. Typically, each name is unique. For example, tables in a relational database must all have unique names. A Windows workgroup that uses the original NetBIOS naming system requires a different "made-up" name for each computer and printer in . The names of the files are stored here, but the actual file data is split up and distributed over one or more Object Storage Servers. Today's largest Lustre installations can have several hundred of these commodity servers. You can scale the bandwidth and capacity of your clusters very linearly by adding more, because they operate completely in parallel. Many shared file systems have a single pool of disk shared directly between the entire cluster. That's an approach that really doesn't scale, so we give each server its own pool of disk to manage on its own.

The third type of node is the client node. These speak a Lustre object protocol to the metadata and object servers, and provide a standard file system interface to applications. Today, those native clients run only on Linux, but you could use Samba samba

Ballroom dance of Brazilian origin, popularized in the U.S. and Europe in the 1940s. Danced to music in ⁴⁄₄ time with a syncopated rhythm, the dance is characterized by simple forward and backward steps and tilting, rocking body movements.
 or NFS (Network File System) The file sharing protocol in a Unix network. This de facto Unix standard, which is widely known as a "distributed file system," was developed by Sun. See file sharing protocol and WebNFS.

NFS - Network File System
 to re-export, so that you have access to your Lustre file system from whatever other OS you may happen to have.

How is Lustre sold and distributed today and what is the licensing model?

We saw that there was no other really good open source cluster file system for the Linux community, so Cluster File Systems was founded to create, distribute, and support Lustre as an open source product. Today, CFS distributes Lustre in two ways. For the first year of any new release of Lustre, like our recent Lustre 1.4 release, the software is available to our paying customers and our commercial partners under the GNU General Public License A software license from the Free Software Foundation (FSF) that ensures every user receives the essential freedoms that define "free" software, which is free of restrictions (see free software).  (GPL See GNU General Public License.

1. GPL - General Purpose Language.
2. GPL - ["A Sample Management Application Program in a Graphical Data-driven Programming language", A.L. Davis et al, Digest of Papers, Compcon Spring 81, Feb 1981, pp. 162-167].
). After a year, often sooner, we make that new version available on the Web to anyone, under the GPL.

We stay in business by licensing and supporting the Lustre software directly and through partners such HP, Cray, Sun, Linux Networx, Verari and similar companies. Directly, we've fostered some exceptional relationships around very high-end deployments for which there are very few choices other than Lustre. For example, the largest supercomputer supercomputer, a state-of-the-art, extremely powerful computer capable of manipulating massive amounts of data in a relatively short time. Supercomputers are very expensive and are employed for specialized scientific and engineering applications that must handle very  in the world. Lawrence Livermore's IBM-based BlueGene/L system, will use Lustre. We're also working very closely with Cray on the new Red Storm supercomputer, which will be installed in 2005 at Sandia National Laboratories Sandia National Laboratories, which is managed and operated by the Sandia Corporation (a wholly owned subsidiary of Lockheed Martin Corporation), is a major United States Department of Energy research and development national laboratory with two locations, one in Albuquerque, New .

HPC was clearly the birthplace of Lustre, but how far off are you from being adopted in commercial enterprise environments?

We already have some strong acceptance within the industries that resemble our engineering and scientific customers. These are industries such as oil and gas, pharmaceuticals, digital media, manufacturing and automotive, and I expect this part of our business to continue growing rapidly. Transaction processing Updating the appropriate database records as soon as a transaction (order, payment, etc.) is entered into the computer. It may also imply that confirmations are sent at the same time.

Transaction processing systems are the backbone of an organization because they update constantly.
 environments, such as those found on Wall Street or in the insurance industry, are still a few years off, as they tend to be very conservative and require additional features. While these are on our roadmap, they're not finished yet.

Your partners like HP, Sun, and others sell in both the HPC and commercial markets. Can we assume that while they are pursuing the engineering and scientific markets today, they will move more into the enterprise with Lustre at some point downstream?

Yes, that's true of many of our partners. It doesn't do them any good to push Lustre where it's not yet ready. We are, however, having many discussions with partners and enterprise customers about where they want Lustre to be in the future.

How far down the development path have you been able to go? How far along is Lustre today?

Lustre is ready for prime time. It is extremely usable in production for a large number of applications on both large and small clusters. Organizations in many industries are using it very successfully with the help of CFS and our partners. We've had the fortune of being able to develop a very healthy, respectable customer base without the need of a great marketing effort. While there is still a lot more to accomplish, and our roadmap is long, Lustre is a reality today--and our customers are proving that.

What is the largest Lustre system today? What are the upper bounds of system size and file system size?

The largest Lustre system in production today is a 4,000-CPU cluster at Lawrence Livermore National Laboratory Lawrence Livermore National Laboratory: see Lawrence Berkeley National Laboratory.

(body) Lawrence Livermore National Laboratory - (LLNL) A research organaisatin operated by the University of California under a contract with the US Department of Energy.
. With Lustre 1.4, there should be little problem installing 3,000 or 4,000 nodes right out of the box. The Red Storm computer will be substantially larger and it will be online in 2005, so a 10,000-node installation is not far out.

With respect to file system size, Lustre was architected to handle 64-bit file systems with thousands of petabytes, but we didn't really expect people to do that for some time. We are finding that people are deploying petabytes now, and we are lifting all the limits.

Is there life for Lustre beyond Linux and other operating systems Operating systems can be categorized by technology, ownership, licensing, working state, usage, and by many other characteristics. In practice, many of these groupings may overlap. ?

Absolutely. In 2005, we will release a Lustre client for Apple's Mac OS X. As for other systems like AIX (Advanced Interactive eXecutive) IBM's Unix-based operating system which runs on its Intellistation workstations and pSeries, p5, iSeries and i5 server families. , Solaris, and Windows, we are considering all of them, but we need to see what customers want. It seems quite likely that we will do a Windows port at some point in the near future.

How did you decide to offer Lustre as open source and what is your company's profit model?

We started the Lustre project and we founded Cluster File Systems with the goal of providing a good open source cluster file system for Linux. With that in mind, we began by doing contract development of Lustre, developing prototypes or stabilizing the features that customers wanted most. Sometimes these were for clusters that were immediately available, like at Lawrence Livermore Lawrence Livermore may refer to:
  • Larry Livermore musician, record producer and music journalist.
  • Lawrence Livermore National Laboratory.
 and Pacific Northwest and NCSA (1) (National Center for Supercomputing Applications, Urbana-Champaign, IL, www.ncsa.uiuc.edu) A high-performance computing facility located at the University of Illinois at Urbana-Champaign. , and sometimes these were very forward-looking prototypes of things they wanted in years to come.

CFS is in the middle of shifting quite substantially to a model based on services and licensing. Well more than half of our revenue now comes from services such as enterprise support, training and licensing, both through partners and directly to customers.

What is your corporate growth strategy and how fast do you want to grow?

If we had been asked that question when we started, we would have said we would become a ten-person niche supercomputing company. Today we have about 45 engineers developing and supporting Lustre. I don't expect the same kind of explosion of growth that we've had in our first three years, but I do expect that we'll continue to grow slowly. We would prefer to work through experienced partners and fulfill a role of architecture development, productization and high-level support, rather than interfacing with each individual user directly. We don't expect to be a 500-person company.

What do you foresee for the company over the next five years?

We might be very happy to stay small, but our roadmap is as good or better than that of any company in the storage industry. We will be setting the standard in cost per terabyte, reliability, performance, features, and scalability. We will probably end up competing in most major areas of storage deployment.

In five years, the Years, The

the seven decades of Eleanor Pargiter’s life. [Br. Lit.: Benét, 1109]

See : Time
 system will be quite mature, but people will still need to know that if something happens they can pick up the phone and call us. They will want speak directly to the experts who designed and implemented the Lustre file system, the people who know it best.

So the buck stops with you?

Ultimately, that's true. Whether you buy Lustre support from Cluster File Systems or from one of our industry partners, the buck does stop here.

www.clusterfs.com
COPYRIGHT 2004 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2004, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Storage Clustering; Cluster File Systems
Publication:Computer Technology Review
Article Type:Interview
Geographic Code:1USA
Date:Dec 1, 2004
Words:1468
Previous Article:Clustered storage: improved utility for production computer clusters.(Storage Clustering)
Next Article:Looking back.(Calendar)
Topics:



Related Articles
Without A Revolutionary File Management Solution There Can Be No True SAN.(Industry Trend or Event)
Shared Data Clusters: Achieving Application Scalability And Availability With A SAN.(Technology Information)
Shared SAN Cluster Configurations Get A Firm Foundation.(VERITAS SANPoint Foundation Suite HA)(Product Information)
SteelEye works with HP to deliver HP ProLiant DL380 Packaged Clusters to Linux customers.(SteelEye LifeKeeper for Linux 4.1.1)
Virtual storage and real confusion: a big disconnect between what vendors offer and what users want.(Industry Overview)
Sistina Software and Fujitsu Systems Europe partner to push Linux clusters into the enterprise data center.
Smart object-based storage cluster computing.(Storage Networking)
High Performance Computing: past, present and future.(Storage Networking)
Clustered storage: improved utility for production computer clusters.(Storage Clustering)
Clustered network storage.(first in/first out)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles