Home > Data management / BI Tips > Rick Sherman's BI Column > Five steps for weaving data quality management into your enterprise data integration processes
Data Management Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

RICK SHERMAN'S BI COLUMN

Five steps for weaving data quality management into your enterprise data integration processes


Rick Sherman
05.23.2007
Rating: -3.90- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


Data quality management needs to be an integral part of the design and planning of your corporate performance management (CPM), business intelligence (BI) or data warehouse project -- as I discussed in my previous column, "Data quality management: Follow the doctor's orders."

You need to gather the business requirements and priorities for data quality; determine the appropriate business logic to handle the various data quality conditions you encounter; incorporate data quality processing throughout your data lifecycle from data sourcing through information consumption; and regularly report on the data quality levels through dashboards.

The following five steps will help you weave data quality management into your enterprise data integration efforts.

1. Set a baseline for data quality management

First, determine and establish the baseline data quality level in your data source systems. Use data profiling software to analyze source systems for completeness and accuracy. Trying to perform data profiling manually, by coding queries on the source systems and comparing the answers to expected results, is laborious and rarely thorough. The good news is that you can find excellent data profiling software. The even better news is this functionality is increasingly being bundled with data integration software.

2. Verify the data -- and don't pass the buck!

Second, verify that the data extracted from source systems is the same data that was imported into your data warehouse. This seems obvious, but too often source system data is not as correct as people assume.

In cases like these, IT sometimes assumes it is the source system's problem. But if you move the data into your CPM or BI solution, then you take responsibility for its quality. You can't pass the buck. Too often, IT assumes their duty is simply to compare record count and sum checks of the loaded records with the source system to make sure they match.


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


RELATED CONTENT
Rick Sherman's BI Column
How to decide whether it is time for BI consolidation
Data warehousing appliances: Are they market breakers?
Business intelligence and corporate performance management software: Build vs. buy
Business intelligence and corporate performance management software: What's the difference?
Data quality management: Follow the doctor's orders
Business intelligence appliances: Disruptive technology or distraction?
Why data governance projects fail
Corporate performance management tomorrow, Data Shadow Systems today
Business intelligence systems: Data shadow systems pros and cons
Data integration tools can only solve part of the integration mystery

Enterprise data integration (EDI) software
Peachtree Data uses SAP BOBJ Data Services to clean up mailing lists
EMR systems bring need for data integration, data quality into focus
Operational BI gets a boost from real-time data loading
Data integration costs hurting companies with poor negotiation skills
Wholesaler turns to data integration adapters for mainframe to Oracle migration
Exec explains IBM's Information On Demand (IOD) initiative
SaaS-based data quality and integration tools gaining momentum
Exploring key trends in data integration for data warehouses
Pervasive the most persuasive, cost-effective data integration platform, study says
IBM unveils new data management software, services

Data quality techniques and best practices
Peachtree Data uses SAP BOBJ Data Services to clean up mailing lists
Data quality software, including dashboards for non-IT users, gaining traction
IBM acquires data discovery vendor Exeros
Data Quality Management Software Product Directory
EMR systems bring need for data integration, data quality into focus
What level of data accuracy standards do we need for master data?
How to estimate customer data cleansing costs
Cleansing data after an acquisition often calls for data quality software
Enterprise-wide customer data quality still elusive at most organizations
BJ's IT overhaul lays groundwork for BI, MDM

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
data  (SearchDataManagement.com)
data governance  (SearchDataManagement.com)
data quality  (SearchDataManagement.com)
data scrubbing  (SearchDataManagement.com)
fixed data  (SearchDataManagement.com)
raw data  (SearchDataManagement.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary


But unless you know -- beyond a shadow of a doubt -- the data is already correct and consistent, this minimalist quality check is not enough.

Remember the data profiling you did on your source systems? You need to do the same for your data warehouse after it has been loaded with data. This post-load audit is really essential to validate the data loading process.

3. Clean up the data

After verifying the data, it's time to perform the data cleansing processes required by your business and industry.

For example, in businesses that sell to consumers, name and address matching is an important data cleansing function. Specific industries, such as healthcare and finance, will have their own best practices for data quality management.

To perform data cleansing, you'll likely need to use specialized data quality software packages. Some of these can even be invoked during your enterprise data integration work. That's because many data integration packages now include data cleansing capabilities.

4. Make the data consistent

Now you need to make sure the data is consistent, so it can safely be used as reliable business information across an enterprise. Although individual departments or processes may have their own customer or product lists, the enterprise needs a single view of customers and products.

In data modeling terms, this process is called conforming dimensions. The enterprise applications market has bundled these processes as master data management (MDM) and customer data integration (CDI).

Establishing a single customer or product list may sound easy from a conceptual and technical perspective. However, the difficult roadblocks are the business and political issues that you may encounter when trying to obtain definition, agreement and responsibility for these lists. IT has to work collaboratively with the business to be successful in this area.

5. Transform data into information for business users

Finally, in order to create business information and enable business analytics, data needs to be filtered, transformed, enriched and aggregated. This last step transforms raw data into useful business information.

But beware -- by this time, the IT group has often put all its energy into the enterprise data warehouse and assumes its job is done. Not so fast. Business users typically need filtering, summarization and aggregation in order to make data into useful information that they can use for business decision making. When left to their own devices, business users often create data shadow systems, which can be a headache for an enterprise.

These five steps are critical for CPM and BI projects. Transforming data into business information is the cornerstone of these kinds of projects. And ultimately, these systems are only as good as the quality of the information that resides in them.

  • Pose a question to Rick Sherman or other business intelligence and data quality management experts in our Ask the Expert section.
  • Check out all of Rick Sherman's articles on SearchDataManagement.com.

  • Rate this Tip
    To rate tips, you must be a member of SearchDataManagement.com.
    Register now to start rating these tips. Log in if you are already a member.




    DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



    Data Management Integration - EDI, EAI, ETL, MDM, CDI, PIM
    About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
    SEARCH 
    TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

    TechTarget Corporate Web Site  |  Media Kits  |  Site Map




    All Rights Reserved, Copyright 2005 - 2009, TechTarget | Read our Privacy Policy
      TechTarget - The IT Media ROI Experts