Home > Ask the Data Management Experts > Data governance and quality Questions & Answers > Data cleansing: The business impact of dirty data
Ask The Data Management Expert: Questions & Answers
EMAIL THIS

Data cleansing: The business impact of dirty data

William McKnight EXPERT RESPONSE FROM: William McKnight

Pose a Question
Other Data Management Categories
Meet all Data Management Experts
Become an Expert for this site


Enterprise IT tips and expert advice
Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google


>
QUESTION POSED ON: 22 February 2007
How do I even begin a data cleansing process? Do we start with our operational system or in the warehouse? And is there a way to determine (strategically) what data is worth being cleansed so we can save time and resources? (Is there some dirty data that won't affect us?)

>
EXPERT RESPONSE
It is always best to address data quality as early in the cycle as you can. Only when the operational system cannot be modified should data quality efforts be pushed to the data warehouse.

Please remember you don't necessarily "cleanse" all data that is determined to be less than par quality. Most of the time, you simply report on the data exception and find out your expected bounds are too tight or the exception was truly a business exception.

The best way I've found to determine strategically what data should be cleansed is to look at the business impact of not recognizing the less-than-ideal data quality and leaving it in place. If that business impact is greater than the effort to raise the awareness of the exception or cleanse the data, then the data certainly should be cleansed. Certainly there is some dirty data that will affect the business to a lesser degree than would the cost of fixing it. I have found that most data warehouse programs should take the important first step towards investigating data quality and establishing a data quality program: creating a framework for addressing quality violations.


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google


RELATED CONTENT
Data governance and quality
Data quality management for data warehouses
What is high quality information?
Data quality management tools: Where to get unbiased information
Roles and salaries of data quality and governance analysts
The role of data governance in unstructured data
How to maintain data quality and provide high quality information management and analysis
Data quality management pitfalls: Three common mistakes to avoid
Data governance: Tips for techies and managers
Data quality management begins with data governance
Troubleshooting: Performing an insert with DB2 z/OS

Business intelligence and analytics
Who should the business intelligence team report to?
Do business intelligence tools require a data warehouse?
Starting a business intelligence career from a financial background
Data warehousing, data mining and data querying: Terms and definitions
Business intelligence career through Web development
Do you need managed reporting tools and business intelligence (BI) tools?
How to transition to real-time business intelligence and data warehousing
Business intelligence information management (BIIM) vs. BI
Application design for OLAP servers: Considerations and advice
Operational data store vs. operational business intelligence

Data quality best practices
Best practices for designing and implementing sustainable, long-term data quality programs
Effective data quality program management: Tips and advice
Creating successful data stewardship programs, with Jill Dyché
Data quality management for data warehouses
Gartner's data quality management software rankings show convergence with data integration
Master data management must start with data governance
Data quality assessment helps identify, fix data quality problems
Integration competency centers centralize data integration projects
Informatica to buy identity resolution software maker for $85 million
Information assurance: Dependability and security of networked information systems

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
data  (SearchDataManagement.com)
data governance  (SearchDataManagement.com)
data quality  (SearchDataManagement.com)
data scrubbing  (SearchDataManagement.com)
fixed data  (SearchDataManagement.com)
raw data  (SearchDataManagement.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary



Search and Browse the Expert Answer Center
Search and browse more than 25,000 question and answer pairs from more than 250 TechTarget industry experts.
Browse our Expert Advice

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




All Rights Reserved, Copyright 2005 - 2008, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts