|
|
||||||||||||||||||||
| Home > Data Management News > CDI: How much does data matching technology matter? | |
| Data Management News: |
|
||
That's because matching customer data across heterogeneous sources is a difficult problem to solve. "Matching" in CDI is used as a catch-all term for the process of comparing like records, eliminating duplicates, and combining them into the best version of a record, according to Jill Dyche and Evan Levy, co-authors of Customer Data Integration: Reaching a Single Version of the Truth (Wiley 2006) and partners and co-founders of Sherman Oaks, Calif.-based Baseline Consulting Inc.
"With CDI, name and address matching is really only the tip of the iceberg," Dyche said. CDI matching engines look at a broad variety of attributes and use complex processes to match and link customer records. Not all matching engines are created equal, though. When evaluating CDI tools, here's how to assess matching engines. Understand matching methods Matching is a complex subject, but two types of matching methods are generally used by customer data integration systems, Dyche and Levy wrote in their recent CDI book.
But it's not a matter of picking one or the other -- or one being more accurate than the other. The matching style chosen for CDI has to do with an organization's goals, according to Scott Gidley, co-founder and chief technology officer for Cary, N.C.-based DataFlux Corp., a data quality and CDI tool vendor. "It's more about processing performance -- how fast is something going to run based on how much value it's going to give you," Gidley said. Deterministic matching is more cut and dry, and faster for processes such as real-time CDI hub lookups, Gidley said. But probabilistic matching allows for a higher level of variance between records and might catch more potential matches. Most companies that provide matching engines incorporate some variation of both options in their methodologies, he said. And the end result of different CDI vendors' matching engines is often very similar, though it's a stretch to say that one algorithm will work for all companies, Levy said. The tool, and the matching engine, should be tested and evaluated against a company's unique requirements. Here are some other things to keep in mind when assessing matching engines:
Overall, Dyche said, the matching engine should be only a part of the overall CDI tool choice. "Matching is one of many decisions to make," Dyche said. "When we see these vendor bake-offs and companies get down to [which CDI vendor] has the most accurate match, that's still only one component of the decision." So, in some cases, the CDI tool with the most accurate matching engine might not be the final choice. Companies must consider the big picture, including data volumes, processing speed and functional requirements, Dyche said. |
|||||||||||||||||||||||||||||
| About Us | Contact Us | For Advertisers | For Business Partners | Site Index | RSS |
| |
|
|||||||