Seven misconceptions about data quality.The narrow definition of data quality is that concerns bad data ie: data that is missing or incorrect. A broader definition is that data quality is achieved when a business uses data that is comprehensive, consistent, relevant and timely. If you focus only on the narrow data definition you may be lulled into a sense of false security when, in fact, your efforts fall short. We will address several more misconceptions Misconceptions is an American sitcom television series for The WB Network for the 2005-2006 season that never aired. It features Jane Leeves, formerly of Frasier, and French Stewart, formerly of 3rd Rock From the Sun. about data quality. In order to fix a problem you have to recognize you have a problem. According to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. recent Gartner research, 25 percent of Fortune 1000 companies are working with poor quality data. The Data Warehousing See data warehouse. data warehousing - data warehouse Institute TDWI TDWI The Data Warehousing Institute TDWI The Doctor Weighs In (website) estimated that data quality problems cost U.S. businesses $600 billion each year. Regulatory initiatives such as Sarbanes-Oxley and Basel II Basel II is the second of the Basel Accords, which are recommendations on banking laws and regulations issued by the Basel Committee on Banking Supervision. The purpose of Basel II is to create an international standard that banking regulators can use when creating regulations dictate that companies must provide transparent data. But even with the documented high costs of poor data quality and the tight regulatory environment, many companies are turning a blind eye to their data quality problems. Why? Perhaps it is because of their mistaken belief that bad data is the only data quality issue they need to worry about. A corollary corollary: see theorem. to the above: to fix a problem you first have to take responsibility for it. That's the risk. Taking responsibility is the biggest roadblock to dealing with data quality. In order to achieve a high level of quality, data has to be viewed from an enterprise and holistic perspective. Data may be correct within each data silo (1) A separate database or set of data files that are not part of an organization's enterprise-wide data administration. See siloed application. (2) An external storage array or cabinet. See disk array. , but the information will not be consistent, relevant or timely when viewed across the enterprise. To make matters worse, you've got each report or analysis interpreting the data differently, so even when the numbers start off the same in each silo, the end results will not be consistent. Data is a corporate asset and has to be consistent across the entire corporation, not just within the business function or division where it originated. Misconception mis·con·cep·tion n. A mistaken thought, idea, or notion; a misunderstanding: had many misconceptions about the new tax program. 1.0 You Can Fix Data Fixing implies that there was something wrong with the original data, and you can fix it once and be done with it. In reality, the problem may have been not, with the data itself, but rather in the way it was used. When you manage data you manage data quaLity. It's an ongoing process. Data cleansing See address cleansing and data hygiene. is not the answer to data quality issues. Yes, data cleansing does address some important quality problems that data has, and offers a solid business value ROI (Return On Investment) The monetary benefits derived from having spent money on developing or revising a system. In the IT world, there are more ways to compute ROI than Carter has liver pills (and for those of you who never heard of that expression, it means a lot). , but it is only one element of the data quality puzzle. loo often the business purchases a data cleansing too[and thinks the problem is solved. In other cases, because the cost of data cleansing tools is high, a business may decide that it is too expensive for them to deal with the problem. Misconception 2.0 Data Quality is an IT Problem Data quality is a company problem that costs a business in many ways. Although IT can help address the problem of data quality, the business has to own the data and the business processes that create or use it. The business has to define the metrics metrics Managed care A popular term for standards by which the quality of a product, service, or outcome of a particular form of Pt management is evaluated. See TQM. for data quality its completeness, consistency, relevancy and timing. The business has to determine the threshold between data quality and ROI. IT can enable the processes and manage data through technology, but the business has to define it. For an enterprise-wide data quality effort to be initiated and successful on an ongoing basis, it needs to be truly a joint business and IT effort. Misconception 3.0 The Problem is in the Data Sources or Data Entry Data entry or operational systems are often blamed for data quality problems. Although incorrectly entered or missing data is a problem, it is far from the only data quality problem. Also, everyone blames their data quality problems on the systems that they sourced the data from. Although some of that may be true, a large part. of the data quality issue is the consistency, relevancy and timeliness of the data. If two divisions are using different customer identifiers or product numbers, does it mean that one of them has the wrong numbers or is the problem one of consistency between the divisions? If the problem is consistency, then it is an enterprise issue, not a divisional issue. The long-term solution may be for all divisions to use the same codes, but that has to be an enterprise decision. The larger issue is that you need to manage data from its creation all the way to information consumption. You need to be able to trace its flow from data entry, transactional systems, data warehouse, data marts A subset of a data warehouse for a single department or function. A data mart may have tens of gigabytes of data rather than hundreds of gigabytes for the entire enterprise. See data warehouse. and cubes all the way to the report or spreadsheet used for the business analysis. Data quality requires tracking, checking and monitoring data throughout the entire information ecosystem. To make this happen you need data responsibility 'people', data metrics (processes) and meta data management (technology). Misconception 4.0 The Data Warehouse will Provide a Single Version of the Truth In computerized business management, svot, or Single Version of the Truth, is a technical concept describing the sequence and structure of a database formed by a particular but arbitrary sequencing of records. In an ideal world, every report or analysis performed by the, business exclusively uses data sourced from the data warehouse.-data that has gone through data cleansing arid ar·id adj. 1. Lacking moisture, especially having insufficient rainfall to support trees or woody plants: an arid climate. 2. quality processes and includes constant interpretations such as profit or sales calculations. If everyone uses the data warehouse's data exclusively and it meets your data quality metrics then it is the single version of the truth. However, two significant conditions lessen the likelihood that the data warehouse solves your data quality issues by itself. First, people get data for their reports and analysis from a variety of data sources-data warehouse (sometimes there are multiple data warehouses in an enterprise), data marts and cubes (that you hope were sourced from the data warehouse. They also get data from systems such as ERP (Enterprise Resource Planning) An integrated information system that serves all departments within an enterprise. Evolving out of the manufacturing industry, ERP implies the use of packaged software rather than proprietary software written by or for one customer. , CRM (Customer Relationship Management) An integrated information system that is used to plan, schedule and control the presales and postsales activities in an organization. , and budgeting and planning systems See spreadsheet and financial planning system. that may be sourced into the data warehouse themselves. In these cases, ensuring data quality in the data warehouse alone is not enough. Multiple data silos mean multiple versions of the truth and multiple Interpretations of the truth. Data quality has to be addressed across these data si':os, not just in the data warehouse. Second, data quality involves the source data and its transformation into information. That means that even if every report and analysis gets data from the same data warehouse, if the business transformations and Interpretations in these reports are different then there still are significant data quality issues. Data quality processes need to involve data creation; the staging of data in data warehouses, data marts, cubes and data shadow systems; and information consumption in the form of reports and business analysis. Applying data quality to the data itself and not its usage as information is not sufficient. Misconception 5.0 The ERP System will Provide a Single Version of the Truth Ditto what I said for Misconception 4.0 Misconception 6.0 The Corporate Performance Management (CPM (1) (Critical Path Method) A project management planning and control technique implemented on computers. The critical path is the series of activities and tasks in the project that have no built-in slack time. ) System will Provide a Single Version of the Truth Ditto what I said for Misconception #4. Misconception 7.0 BI Standardization standardization In industry, the development and application of standards that make it possible to manufacture a large volume of interchangeable parts. Standardization may focus on engineering standards, such as properties of materials, fits and tolerances, and drafting will Eliminate the Problem of Different "Truths" Represented in the Reports or Analysis Yes, standardizing on BI tools can save money and may be a worthwhile project. But, don't lose sight of the fact that the use of different BI tools is a symptom of a data quality problem, not the cause. If you pull the same data and implement the same transformations (formulas) in different BI tools you get the same results. The report, chart or dashboard (1) See Mac Dashboard. (2) A software-based control panel for one or more applications, network devices or industrial machines. Dashboards display simulated gauges and dials that look somewhat like an automobile dashboard. may look a little different, but the numbers would be the same. The problem, therefore, is not that different BI tools are. being used, but that each project implementing these tools built a different data mart or cube and then applied different formulas in their reports or analysis. Using the same BI tool in different projects that use different data with different transformations is still going to yield different results, hence the data quality issues still remain. The cause of the data quality issues was the lack of consistency between the data used and data transformations, not the use of different BI tools. Data quality is defined as comprehensive, consistent, relevant and timely data for use by the business. Don't shrug it off as issue of bad data entry. Data needs to be addressed on an enterprise scale and in a holistic manner incorporating people, processes and technology. RELATED ARTICLE: West Europe Manufacturers Going Mobile A recent IDC study, shows that the adoption of mobile-centric solutions is still somewhat limited in the W. Europe manufacturing industry, the majority of organizations using only basic mobility functions such as accessing emails and, to a lesser extent, agenda scheduling and corporate directories. A European vertical markets survey investigated the implementation status of mobile technologies and future adoption plans. Overall, 45% of discrete and 46% of process respondents with a mobile solution, or with plans to adopt a mobile solution have already implemented mobile email. However, more advanced manufacturing organizations are beginning to move beyond enterprise mobile application deployments for email and the more progressive manufacturers have begun mobilizing mobilizing, v 1. freeing or making loose and able to move. 2. observing any ongoing movements in a client's body, whether small or large, assisted or not, that identify strengths and weaknesses, as well as the client's physical and strategic applications such as CRM, sales force automation Automating the sales activities within an organization. A comprehensive SFA package provides such functions as contact management, note and information sharing, quick proposal and presentation generation, product configurators, calendars and to-do lists. , and inventory management as they seek to increase worker productivity and strengthen their competitive positioning.' Growth opportunities are evident across all mobile areas investigated. Mobile sales force automation, mobile office/collaborative, XML XML in full Extensible Markup Language. Markup language developed to be a simplified and more structural version of SGML. It incorporates features of HTML (e.g., hypertext linking), but is designed to overcome some of HTML's limitations. , and mobile supply chain applications will show the most interesting opportunities, but these vary by vertical manufacturing industry and country. More than 70% of discrete manufacturing Fabricating products by assembling components and subsystems into larger systems. The automated assembly line is the prime example of discrete manufacturing such as in the making of automobiles, household appliances and computer systems. organizations with mobile solutions will access emails in a mobile environment, with pharmaceutical, other discrete, and automotive standing out. While pharmaceutical is expected to be the most advanced at year-end 2004, the highest growth opportunities are expected in other discrete, metal, and chemical sub-verticals. However, a significant number of manufacturers still have no plans to invest in mobility. www.idc.com |
|
||||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion