Printer Friendly
The Free Library
5,674,640 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

SMART Integrators Shop The DATAMART.


There are literally billions of dollars in exploitable customer information locked up in the operational databases of companies throughout the U.S. This data represents billions or trillions of transactions that reveal customer preferences, buying history, and other information that the businesses involved could use to gain a competitive advantage by establishing a closer relationship with their customers.

Little wonder that the Business Intelligence/Data Warehouse (BI/DW) market, which comprises the technologies used to extract and analyze information from operational databases, is booming. The Database Solutions III (DBS (Direct Broadcast Satellite) A one-way TV broadcast service from a communications satellite to a small round or oval dish antenna no larger than 20" in diameter.  III) study recently completed by the BI/DW Program at Survey.com (Mountain View, CA) leads us to forecast total revenue for this market of $148 billion by 2003.

To most people in the IT industry, the phrase "data warehouse" suggests massive projects, cost overruns, delays, and a high potential for failure. Certainly, the attempt to gather every bit of data in an enterprise into one massive data warehouse is likely to tax to the utmost even the most accomplished IT department. That's why the more usual approach is increasingly what's called a datamart: basically a small data warehouse that supports one department or aspect of a business. Using an approach called a federated Connected and treated as one. See federated database and federated directories.  data warehouse, these datamarts can later be combined into a data warehouse.

There lies the opportunity for integrators of every size. Not only is a datamart fundamentally easier to implement, but it can also be built using very inexpensive hardware and software: Intel-based computers running Microsoft Windows See Windows.

(operating system) Microsoft Windows - Microsoft's proprietary window system and user interface software released in 1985 to run on top of MS-DOS. Widely criticised for being too slow (hence "Windoze", "Microsloth Windows") on the machines available then.
 NT (and soon, Windows 2000) and Microsoft SQL Server A relational DBMS from Microsoft that is a major component of the Windows Server System. It is Microsoft's high-end client/server database and is closely integrated with Microsoft Visual Studio and the Microsoft Office System.  7.0, which includes almost all of the specialized functionality needed to construct a datamart. Indeed, the DBS III study revealed statistics about Microsoft mind-share. This is not surprising, given the extensive data warehousing See data warehouse.

data warehousing - data warehouse
 capabilities offered by Microsoft, as described briefly below. The only remaining obstacle lies in the specialized nature of the knowledge needed to construct a datamart: many integrators who are otherwise knowledgeable about databases may regard the additional expertise not worth acquiring, even given the availability of a low-cost Wintel platform. Yet a new company, Appsco Software Ltd. (Guildford, Surrey, UK; U.S. office Bellevue, WA; www.appsco.com) offers an almost push-button (electronics) push-button - A roughly fingertip-sized plastic cover attached to a spring-loaded, normally-open switch, which, when pressed, closes the switch. Typical examples are the keys on a computer or calculator keyboard and mouse buttons.  approach to datam art construction, a rapid application development tool called AppsMart, that brings this technology within the reach of almost any integrator with SQL Server An earlier relational DBMS from Sybase and from Microsoft. Sybase introduced SQL Server in 1988 for various Unix versions. In that same year, with help from IBM, Sybase created an OS/2 version that Microsoft licensed and branded as Microsoft SQL Server.  database expertise. To illustrate the potential for new business that this tool provides, I'll take a brief look at how a datamart is constructed and then discuss how AppsMart short-circuits much of the process, enabling integrators to bring the benefits of BI/DW to businesses of virtually any size.

Making Sense Of Data

Most businesses run their day-to-day operations using some form of On-Line Transaction Processing (database) On-Line Transaction Processing - (OLTP) The processing of transactions by computers in real time.  (OLTP (OnLine Transaction Processing) See transaction processing and OLCP.

OLTP - On-Line Transaction Processing
). For instance, every time you go through the checkout line at a supermarket or use your bank's ATM, you are interacting with an OLTP system. Data warehouse technology evolved because these systems, whose underlying datastore is called an operational database, are not designed to support analysis of the data they store. Their architecture is designed to support the day-to-day operations of the business. So it is practically impossible to query such a database for analytic purposes (e.g., how many customers in Store A bought the detergent promotion last week) without extensive custom programming. Even then, the operation is difficult, slow, and interferes with the real-time processing Noun 1. real-time processing - data processing fast enough to keep up with an outside process
real-time operation

data processing - (computer science) a series of operations on data by a computer in order to retrieve or transform or classify information
 that is the raison d'etre rai·son d'ê·tre  
n. pl. rai·sons d'être
Reason or justification for existing.



[French : raison, reason + de, of, for + être, to be.
 of the system.

A data warehouse or datamart solves this problem by extracting the data from the operational database, cleaning it up adding reference data (called metadata) that allows analysis along various dimensions (such as time) that are only implicit in Adj. 1. implicit in - in the nature of something though not readily apparent; "shortcomings inherent in our approach"; "an underlying meaning"
underlying, inherent
 the operational store and makes sense of the data in a larger business context, and then presenting it, using a variety of analytical tools from a datastore that is separate from the OLTP system. Some of the transformations that might be applied are date and time format conversions, data type conversions (i.e., character to integer), and the like, as well as more complex changes. The latter may include changes such as those that are necessary to make data from a financial application that tracks data by quarter and fiscal year, match that from a sales applications, which tracks data by month and calendar year.

The technologies for extracting, cleaning, adding metadata, and then putting the transformed data into the data warehouse or datamart is often called Extraction Transformation and Loading (ETL (Extract, Transform, Load) The functions performed when pulling data out of one database and placing it into another of a different type. ETL is used to migrate data, often from relational databases into decision support systems. ) technologies, or, as we prefer to call them, Data-Enabling Technologies (DET DET diethyltryptamine.

DET
n.
Diethyltryptamine; a hallucinogenic agent similar to DMT.
). In Microsoft SQL Server, this functionality is supplied by Data Transformation Services Data Transformation Services, or DTS, is a set of objects and utilities to allow the automation of extract, transform and load operations to or from a database. The objects are DTS packages and their components, and the utilities are called DTS tools.  (DTS (1) (Digital Theatre Sound) A digital audio encoding system used in movie and home theaters. Popularized by the movie Jurassic Park, the six-channel (5. ). While not as extensive as some of the third-party DET solutions out there, DTS are free compared to a cost for third party tools that can range from $100,000 to upwards of $300,000!

One of the most common of these analytical tools is On-Line Analytic Processing (OLAP (OnLine Analytical Processing) Decision support software that allows the user to quickly analyze information that has been summarized into multidimensional views and hierarchies. OLAP tools are used to perform trend analysis on sales and financial information. ), which presents the data in the form of a multidimensional cube (an OLAP cube A multidimensional database that holds data more like a 3D spreadsheet rather than a relational database. The cube allows different views of the data to be quickly displayed. See OLAP and multidimensional views. ) instead of a vast table of discrete entries representing each transaction. Thus, rather than a huge agglomeration ag·glom·er·a·tion  
n.
1. The act or process of gathering into a mass.

2. A confused or jumbled mass:
 of entries consisting of customer name, customer address, product bought, time bought, shipping address, etc., an OLAP cube presents a set of dimensions (e.g., customer, product, region, and time) that enable the data to be "sliced, diced, and drilled through" along any arbitrary division. This makes it possible for Line Of Business (LOB) managers to ask questions, like "What percentage of customers who used the Memorial Day coupon for steak in Region 2 also bought barbecue charcoal?"

Again, Microsoft offers this functionality as part of SQL Server 7.0: the Microsoft OLAP Server, formerly code-named Plato, which also works with a number of third-party databases, as well. The front end to OLAP Services See OLAP and Microsoft SQL Server OLAP Services.  can be a third party OLAP tool or it can be Excel, using a feature called Pivot Tables. The benefit of this approach, especially important to integrators for small and midsize companies, is that most managers likely already have Excel on their desktop--there's no new application to learn.

Microsoft has also created the Microsoft Repository, a database that stores descriptive information about data warehouse information such as database schema The definition of a database. It defines the structure and content in each data element within the structure. Schemas are often designed with visual modeling tools that automatically create the SQL code necessary to define the table structures. See subschema and XML schema. , data transformations, and On-Line Analytical Processing (database) On-Line Analytical Processing - (OLAP) A category of database software which provides an interface such that users can transform or limit raw data according to user-defined or pre-defined functions, and quickly and interactively examine the results in various dimensions  (OLAP), which permits many datamarts to be integrated over time into a federated data warehouse, as well as extensive integration of third-party solutions and many other data warehouse services and features.

The Art Of The Datamart

Yet even given the extensive data warehousing framework supplied by Microsoft and its low cost, another obstacle remains to integrator exploitation of this technology for small and midsize companies (or departmentally in large companies). That is the expertise needed to design and implement a data warehouse or datamart. The data in the datamart has to be split into two categories: factual information and reference information. Factual information is basically the individual transactions or events handled by the OLTP system. The reference information is data that is used to analyze the factual data and it is organized into dimensions to permit OLAP queries and the like. So, for instance, a piece of factual data might be a store transaction such as in the example above while the reference information might be a product hierarchy, customer, time periods often used for analysis, location of various business entities (warehouse, store, distribution center, etc.), and the like.

Factual data doesn't change: a transaction is a historical event. Reference data can and often does change as LOB managers and others demand new ways of looking at the data in response to competitive challenges. For this reason, and others, the data in a data warehouse is usually represented in a star schema A data warehouse design that enhances the performance of multidimensional queries on traditional relational databases. One fact table is surrounded by a series of related tables. Data is joined from one of the points to the center, providing a so-called "star query." See OLAP.  with the factual information at the center of the star, surrounded by the various types of reference data. Fig 1 illustrates a typical star schema as presented by the AppsMart user interface.

Although there are many decisions involved in data warehouse or datamart implementation, the design of the star schema is perhaps the most critical. The wrong decisions can render a datamart useless, unable to deliver the information managers need and unable to adapt to change. Designing the right schema requires a domain expert, someone who knows the business and can describe the essential processes and entities involved, and come up with what is called a business information model that matches the requirements of the managers who will use the datamart. This domain expert not only has to know the business, but also he or she must understand datamart design--and this expertise is neither common nor inexpensive.

That expertise bottleneck is precisely what AppsMart addresses. AppsMart makes it possible for a domain expert to design a template that describes the star schema for a class of business--something as broad as a "value chain" (resellers of finished products). This template can be applied to a specific business by a consultant who does not possess the high-level knowledge of the domain expert, yet has an understanding of business processes and how people need to analyze them. This expertise is far more common and less expensive. That is why I call AppsMart the "datamarts for the masses" application.

As delivered, AppsMart is actually available in two versions. The AppsMart Domain Expert Bundle (US $15,000) enables a domain expert to create a template for a given class of business; the End-User Bundle (a confusing name I hope they will change) (US $8,000) enables a consultant to apply existing templates to a specific customer's needs. Both versions come with a variety of pre-built templates for a broad range of businesses that can be adapted to specific industries and specific customer needs.

Fig 1 shows a star schema for a campaign effectiveness application, part of the value chain template supplied with the application. At the center is the factual data: consumers, responses, cost of campaign, and the time the campaign happened. Surrounding it are the various types of reference data by which the campaign can be analyzed: product, marketing channel, marketing campaign, demographics, and so forth. This star schema permits a manager to ask questions like: what was the cost per response for this campaign among women age 18-24 with household incomes between $30,000-$40,000 in Region 2?

Using the AppsMart Scratch Pad scratch pad
n.
1. A pad of paper for preliminary or hasty writing, notes, or sketches.

2. also scratch·pad Computer Science An internal register used for temporary storage of preliminary data or notes.
 application, the consultant can modify the template created by a domain expert to match specific needs. This process includes modeling any additional business processes specific to the customer, defining new measures and dimensions, and matching sample data to the customer's organization so that AppsMart can populate To plug in chips or components into a printed circuit board. A fully populated board is one that contains all the devices it can hold.  the datamart with appropriate test data. At this point, a click of the mouse delivers a detailed specifications document for customer review and signoff (or for further modification). From this point, another click of the mouse begins the construction of the datamart. (Fig 2)

AppsMart generates the DTS packages needed to bring operational data into the datamart, the star schema along with sample data for review, the OLAP cubes needed to analyze the data, and the Excel Pivot Tables as front ends to OLAP Services. At this point, the customer can take the datamart for a test drive. If something is missing, it's a simple, iterative process to define new business processes, measures, and dimensions and then regenerate the datamart with the new features.

Reportedly, the World Wrestling Federation developed and deployed a datamart encompassing 3 years of WWF See Windows Workflow Foundation.  home video sales data, 12 different views of sales order The sales order, sometimes abbreviated as SO, is an order received by a business from a customer. A sales order may be for products and/or services. Given the wide variety of businesses, this means that the orders can be fulfilled in several ways.  information, 40 standard reports, and the ability to run special queries and create custom reports--all in less than three weeks. Similar results were reported by Prandium, the operator of El Torito The format developed by Phoenix Technologies and IBM that has become the standard for creating bootable CD-ROMs on the Intel platform. El Torito provides only the format. In order to make a CD-ROM bootable, the correct boot images must be placed on the disc, and the target computer must , ChiChi's, Koo Koo Roo, and Hamburger Hamlet.

While AppsMart is not a panacea, still requiring some expertise above and beyond the usual database design expertise that many integrators in this field already possess, it so greatly eases and accelerates the design and implementation of a datamart that it should be seriously considered by any integrator who wants to cash in on this burgeoning market. To help integrators exploit this powerful tool, Appsco offers raining courses and is partnering with companies such as Arthur Andersen For the U.S. Supreme Court case commonly known as Arthur Andersen, see .
Arthur Andersen LLP, based in Chicago, was once one of the "Big Five" accounting firms (the other four are PricewaterhouseCoopers, Deloitte Touche Tohmatsu, Ernst & Young and KPMG), performing
 and others who will be able to create templates for vertical business applications that can be applied by integrators at all levels.
COPYRIGHT 1999 West World Productions, Inc.
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1999, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:Technology Information; data warehousing
Author:Trowbridge, Dave
Publication:Computer Technology Review
Geographic Code:1USA
Date:Nov 1, 1999
Words:2041
Previous Article:FC Demands New XOR Calculations.(Technology Information)
Next Article:Comdex Was Yesterday. These Companies Are TOMORROW Internet technologies to watch in 2000.(Industry Trend or Event)
Topics:



Related Articles
Brio Technology Announces Support For Informatica's MX Architecture; BrioQuery Enterprise's Open Metadata Interpreter Provides Direct, Dynamic Access...
PLATINUM Technology to Buy HP's Intelligent Warehouse; Deal Provides Customers Best-in-class Hardware and Software Solutions for Managing Data...
Sysix Technologies Joins Informix and HP Certified Data Warehousing Program; Performance Guarantees for Pre-Tested Configurations Dramatically Cut...
DataMirror And Silvon Software Announce e-Business Intelligence Alliance.
ETL PARADIGM "PUBLISHES" GENE EXPRESSION DATA.(Product Announcement)
Database and Network Journal Editorial Features 2000.(News Briefs)
Customer DataMart 2.0 Available From Aspect.(Brief Article)(Product Announcement)
Aspect and Performix integrate solutions. (Strategic Alliances).
Data warehouses deliver: cook up some cool business intelligence by slicing and dicing your data.(Data Warehousing)
BULK APPEAL HIGHER-INCOME CONSUMERS NO LONGER ASHAMED OF SHOPPING AT NO-FRILLS STORES.(Business)

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles