Home > Data management / BI News > Data quality management for business intelligence projects
Data management / BI News:
EMAIL THIS
COLUMN

Data quality management for business intelligence projects

By Rick Sherman
28 Jul 2005 | SearchDataManagement.com


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   

Plenty of business intelligence or data warehouse projects have been blindsided by complications related to data quality management. Sometimes these issues aren't apparent until business users start testing the system just before going live with the project. So what causes business intelligence project teams to get caught off guard by data quality issues? And why do these problems surface so late in the project?
Rick Sherman, Athen IT Solutions
Rick Sherman, Athena IT Solutions

There are two common pitfalls: defining data quality too narrowly and assuming data quality is the responsibility of the source systems.

People often assume that data quality simply means eliminating bad data -- data that is missing, inaccurate or incorrect. Bad data is certainly a problem, but it isn't the only problem. Good data quality programs also ensure that data is comprehensive, consistent, relevant and timely.

Don't blame the source systems
Defining data quality too narrowly often leads people to assume that source transactional systems -- either through data entry or systemic errors -- cause the bad data. Although they may be a source of some errors, the more likely culprits are either inconsistent dimensions across source systems (such as customer or product identifiers) or inconsistent definitions for derived data across organizations. Conforming dimensions -- developing consistent customer or product identifiers -- is important for accessing and analyzing data for a company. The source systems do not own the data quality issues across other systems, the BI project team does. The source systems need to ensure that the data within their data silo is correct. But the BI project team is responsible for providing the business with data that is consistent across the enterprise.
For more information on data quality and business intelligence

Read Rick's previous column, Business intelligence strategy: IT is from Mars, business users are from Venus

Learn more about data quality management in our data quality Learning Guide

Similarly, each organization within the enterprise may have valid business reasons to derive data differently than others. For example, their position in a set of business processes may determine how they view their data. The individual organizations aren't tasked with developing common definitions for derived data, but the BI project team is. Many BI project teams try to claim that data quality issues aren't their responsibility. However, from a practical viewpoint, the BI team does need to make these issues their own, since their job is to ensure the highest data quality possible. The BI project team is packaging the data for consumption by business users and they will be held accountable for the data quality. This may not seem fair, but the success of their project depends on it.

Don't shortchange the pilot
Surprises happen when the project does an initial pilot or release involving only a small subset of source systems. While there may be many good reasons to have a narrow scope for a pilot, you won't get an appreciation for the effort necessary to conform these dimensions as the number of source systems expands.

Sometimes pilots are only with a single organization, using only their definitions for derived data. Once again, the tough issue is often how to accommodate the differences in the derivation definitions between organizations. In both cases the real challenges are encountered when dealing with multiple systems and organizations. The business users need to look at the big picture, and that is only possible when they can access and analyze data across the enterprise.

Steps to address data quality
To ensure data quality, the BI project team has to address it from the very beginning. Here are several significant steps to consider:

  1. Require the business to define data quality in a broad sense, establish metrics to monitor and measure it, and determine what should be done if the data fails to meet these metrics.
  2. Undertake a comprehensive data profiling effort when performing a source systems analysis. Data anomalies across source systems and time (historical data does not always age well!) is needed so that the team can address them with the business early on.
  3. Incorporate data quality into all data integration and business intelligence processes from data sourcing to information consumption by the business user. Data quality issues need to be detected as early in the processes as possible and dealt with as defined in the business requirements.

Enterprises must present data that meets very stringent data quality levels, especially in light of recent compliance regulations and demands. The level of data transparency needed can only result from establishing a strong commitment to data quality and building the processes to ensure it.

About the author
Rick Sherman has more than 18 years of business intelligence and data warehousing experience, having worked on more than 50 implementations as an independent consultant and as a director/practice leader at a Big Five accounting firm. He founded Athena IT Solutions, a Stow, Mass.-based business intelligence consulting firm. He can be reached at rsherman@athena-solutions.com.

Tags: Data warehouse softwareData quality techniques and best practicesData quality mgmt. best practicesVIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   



RELATED CONTENT
Data warehouse software
Microsoft details self-service business intelligence, data warehouse releases
Why pay for a data warehouse appliance when you can get one free?
In-database analytics pulls together SAS, data warehouse vendors
Teradata takes a logical approach to data warehousing appliances
BT taps open source BI software, homegrown DW to unlock customer data
Bill pushes for data warehouse, XBRL to track TARP funds
Teradata VP talks data warehouse appliances, reveals cloud and SSD plans
Data Warehouse Platforms Product Directory
Commodity hardware aiding data warehouse appliance performance, costs
What does MapReduce and in-database technology mean for data warehouses?

Data quality techniques and best practices
Understanding five major enterprise information management benefits
Gartner: Open source data quality software focuses on data profiling
Poor data quality costing companies millions of dollars annually
Are there data governance plans, templates or standard procedures?
Should we buy data quality management tools or focus on policies?
Where to find new academic resources on data quality best practices
How to improve data quality on a tight budget -- a guide
Data quality management tips and best practices
Peachtree Data uses SAP BOBJ Data Services to clean up mailing lists
Data quality software, including dashboards for non-IT users, gaining traction

Data quality mgmt. best practices
Data quality management for data warehouses
Information assurance: Dependability and security of networked information systems
What is high quality information?
Data quality and governance management quiz
Thirteen causes of enterprise data quality problems
How to maintain data quality and provide high quality information management and analysis
Data quality management pitfalls: Three common mistakes to avoid
How to develop and maintain an enterprise data quality management strategy, with Larry English
Five steps for weaving data quality management into your enterprise data integration processes
Data quality management: Problems and horror stories

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
data modeling  (SearchDataManagement.com)
extract, transform, load  (SearchDataManagement.com)
OLAP  (SearchDataManagement.com)
tree structure  (SearchDataManagement.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary



Data Management: Business Intelligence, Data Integration, Data Compliance
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2005 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts