Home > Data Management Tips > Rick Sherman's BI Column > Five steps for weaving data quality management into your enterprise data integration processes
Data Management Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

RICK SHERMAN'S BI COLUMN

Five steps for weaving data quality management into your enterprise data integration processes


Rick Sherman
05.23.2007
Rating: -4.00- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


Data quality management needs to be an integral part of the design and planning of your corporate performance management (CPM), business intelligence (BI) or data warehouse project -- as I discussed in my previous column, "Data quality management: Follow the doctor's orders."

You need to gather the business requirements and priorities for data quality; determine the appropriate business logic to handle the various data quality conditions you encounter; incorporate data quality processing throughout your data lifecycle from data sourcing through information consumption; and regularly report on the data quality levels through dashboards.

The following five steps will help you weave data quality management into your enterprise data integration efforts.

1. Set a baseline for data quality management

First, determine and establish the baseline data quality level in your data source systems. Use data profiling software to analyze source systems for completeness and accuracy. Trying to perform data profiling manually, by coding queries on the source systems and comparing the answers to expected results, is laborious and rarely thorough. The good news is that you can find excellent data profiling software. The even better news is this functionality is increasingly being bundled with data integration software.

2. Verify the data -- and don't pass the buck!

Second, verify that the data extracted from source systems is the same data that was imported into your data warehouse. This seems obvious, but too often source system data is not as correct as people assume.

In cases like these, IT sometimes assumes it is the source system's problem. But if you move the data into your CPM or BI solution, then you take responsibility for its quality. You can't pass the buck. Too often, IT assumes their duty is simply to compare record count and sum checks of the loaded records with the source system to make sure they match. But unless you know -- beyond a shadow of a doubt -- the data is already correct and consistent, this minimalist quality check is not enough.

Remember the data profiling you did on your source systems? You need to do the same for your data warehouse after it has been loaded with data. This post-load audit is really essential to validate the data loading process.

For more on data quality management and enterprise data integration
Learn how Intellidyn implemented data quality management for 40 terabytes of data

Find out how Gartner ranked data quality vendors in its Magic Quadrant

Listen to a podcast about data quality, featuring thought-leader Larry English

3. Clean up the data

After verifying the data, it's time to perform the data cleansing processes required by your business and industry.

For example, in businesses that sell to consumers, name and address matching is an important data cleansing function. Specific industries, such as healthcare and finance, will have their own best practices for data quality management.

To perform data cleansing, you'll likely need to use specialized data quality software packages. Some of these can even be invoked during your enterprise data integration work. That's because many data integration packages now include data cleansing capabilities.

4. Make the data consistent

Now you need to make sure the data is consistent, so it can safely be used as reliable business information across an enterprise. Although individual departments or processes may have their own customer or product lists, the enterprise needs a single view of customers and products.

In data modeling terms, this process is called conforming dimensions. The enterprise applications market has bundled these processes as master data management (MDM) and customer data integration (CDI).

Establishing a single customer or product list may sound easy from a conceptual and technical perspective. However, the difficult roadblocks are the business and political issues that you may encounter when trying to obtain definition, agreement and responsibility for these lists. IT has to work collaboratively with the business to be successful in this area.

5. Transform data into information for business users

Finally, in order to create business information and enable business analytics, data needs to be filtered, transformed, enriched and aggregated. This last step transforms raw data into useful business information.

But beware -- by this time, the IT group has often put all its energy into the enterprise data warehouse and assumes its job is done. Not so fast. Business users typically need filtering, summarization and aggregation in order to make data into useful information that they can use for business decision making. When left to their own devices, business users often create data shadow systems, which can be a headache for an enterprise.

These five steps are critical for CPM and BI projects. Transforming data into business information is the cornerstone of these kinds of projects. And ultimately, these systems are only as good as the quality of the information that resides in them.

  • Pose a question to Rick Sherman or other business intelligence and data quality management experts in our Ask the Expert section.
  • Check out all of Rick Sherman's articles on SearchDataManagement.com.

  • Rate this Tip
    To rate tips, you must be a member of SearchDataManagement.com.
    Register now to start rating these tips. Log in if you are already a member.




    Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


    RELATED CONTENT
    Rick Sherman's BI Column
    Data warehousing appliances: Are they market breakers?
    Business intelligence and corporate performance management software: Build vs. buy
    Business intelligence and corporate performance management software: What's the difference?
    Data quality management: Follow the doctor's orders
    Business intelligence appliances: Disruptive technology or distraction?
    Why data governance projects fail
    Corporate performance management tomorrow, Data Shadow Systems today
    Business intelligence systems: Data shadow systems pros and cons
    Data integration tools can only solve part of the integration mystery
    Data manager best practices: Dear Santa, I've been a good data management manager

    Enterprise data integration (EDI)
    Data mashups meet business intelligence: "Bashups" explained
    Integration competency centers centralize data integration projects
    Informatica to buy identity resolution software maker for $85 million
    Understanding the top business intelligence data integration techniques
    What's the problem with hand-coding scripts for data integration, anyway?
    Integrating unstructured text into a structured environment
    Unlocking and integrating unstructured data, with Bill Inmon
    Managing unstructured data in the organization
    The real deal on data integration for business intelligence
    Companies choosing real-time data integration over batch-oriented techniques

    Data quality best practices
    Integration competency centers centralize data integration projects
    Informatica to buy identity resolution software maker for $85 million
    Information assurance: Dependability and security of networked information systems
    What is high quality information?
    Data governance success: No pain, no gain
    Data migration evolves from scripts to software
    Data quality programs: A common sense approach
    Data quality and governance management quiz
    Thirteen causes of enterprise data quality problems
    Data profiling tool pays off for mortgage risk intelligence firm

    RELATED GLOSSARY TERMS
    Terms from Whatis.com − the technology online dictionary
    data  (SearchDataManagement.com)
    data governance  (SearchDataManagement.com)
    data quality  (SearchDataManagement.com)
    data scrubbing  (SearchDataManagement.com)
    fixed data  (SearchDataManagement.com)
    raw data  (SearchDataManagement.com)

    RELATED RESOURCES
    2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
    Search Bitpipe.com for the latest white papers and business webcasts
    Whatis.com, the online computer dictionary

    DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.

    About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
    SEARCH 
    TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

    TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




    All Rights Reserved, Copyright 2005 - 2008, TechTarget | Read our Privacy Policy
      TechTarget - The IT Media ROI Experts