data scrubbing (data cleansing)

Data scrubbing, also called data cleansing, is the process of amending or removing data in a database that is incorrect, incomplete, improperly formatted, or duplicated. An organization in a data-intensive field like banking, insurance, retailing, telecommunications, or transportation might use a data scrubbing tool to systematically examine data for flaws by using rules, algorithms, and look-up tables. Typically, a database scrubbing tool includes programs that are capable of correcting a number of specific type of mistakes, such as adding missing zip codes or finding duplicate records. Using a data scrubbing tool can save a database administrator a significant amount of time and can be less costly than fixing errors manually.

This was last updated in August 2010

Continue Reading About data scrubbing (data cleansing)

Dig Deeper on Data quality techniques and best practices

Join the conversation


Send me notifications when other members comment.

Please create a username to comment.

Data Cleansing Tools are becoming popular now a days. You can find really big names in this area, some of them are -
IBM, HP, Symantec, Dataladder etc.

Hey Margerat,
I have a confusion. As you said that Data scrubbing, also called data cleansing but i have read a difference that is;
Data Cleansing - It is a process of removing errors and resolving inconsistencies in source data before loading into targets.

Data Scrubbing - It is a process of filtering, merging, decoding and translating the source data into the validated data for data warehouse.

Will you please help me to understand this that either these are different processes or same ?
Thank you