Home > Introduction to data mining: Association analysis
Chapter Download Library:
EMAIL THIS LICENSING & REPRINTS

Introduction to data mining: Association analysis

13 Feb 2006 | Written by: Pang-Ning Tan, Michael Steinbach and Vipin Kumar

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   

The following is an exerpt on data mining techniques is from Introduction to Data Mining.

Association analysis: Basic concepts and algorithms
Introduction to data mining techniques

Many business enterprises accumulate large quantities of data from their day-to-day operations. For example, huge amounts of customer purchase data are collected daily at the checkout counters of grocery stores. Table 6.1 illustrates an example of such data, commonly known as market basket transactions. Each row in this table corresponds to a transaction, which contains a unique identifier labeled TID and a set of items bought by a given customer. Retailers are interested in analyzing the data to learn about the purchasing behavior of their customers. Such valuable information can be used to support a variety of business-related applications such as marketing promotions, inventory management, and customer relationship management.
Introduction to Data Mining:
Table 6.1. An example of market basket transactions.

          TID -- Items

  1. {Bread, Milk}
  2. {Bread, Diapers, Beer, Eggs}
  3. {Milk, Diapers, Beer, Cola}
  4. {Bread, Milk, Diapers, Beer}
  5. {Bread, Milk, Diapers, Cola}

This chapter presents a methodology known as association analysis, which is useful for discovering interesting relationships hidden in large data sets. The uncovered relationships can be represented in the form of association rules or sets of frequent items. For example, the following rule can be extracted from the data set shown in Table 6.1:

{Diapers}      -->       {Beer}.

The rule suggests that a strong relationship exists between the sale of diapers and beer because many customers who buy diapers also buy beer. Retailers can use this type of rules to help them identify new opportunities for cross-selling their products to the customers.
More information on data mining
Is data mining reliable?

Defing web business intelligence (WBI), data mining and data wareshousing

Besides market basket data, association analysis is also applicable to other application domains such as bioinformatics, medical diagnosis, Web mining, and scientific data analysis. In the analysis of Earth science data, for example, the association pattern may reveal interesting connections among the ocean, land, and atmospheric processes. Such information may help Earth scientists develop a better understanding of how the different elements of the Earth system interact with each other. Even though the techniques presented here are generally applicable to a wider variety of data sets, for illustrative purposes, our discussion will focus mainly on market basket data.
More info on this book

Introduction to Data Mining
By Pang-Ning Tan, Michael Steinbach and Vipin Kumar
ISBN: 0-321-32136-7
Publisher: Addison-Wesley
Copyright: 2006; 769 pages

There are two key issues that need to be addressed when applying association analysis to market basket data. First, discovering patterns from a large transaction data set can be computationally expensive. Second, some of the discovered patterns are potentially spurious because they may happen simply by chance. The remainder of this chapter is organized around these two issues. The first part of the chapter is devoted to explaining the basic concepts of association analysis and the algorithms used to efficiently mine such patterns. The second part of the chapter deals with the issue of evaluating the discovered patterns in order to prevent a generation of spurious results.



Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


RELATED CONTENT
Business intelligence data mining
Introduction to mining unstructured data
The business context for unstructured data mining: Three trends
Business intelligence, data mining maximize direct marketing for Intellidyn
Data mining uses in vertical industries
Rankings of customer data mining market show shift to user-friendly tools
Data warehousing, data mining and data querying: Terms and definitions
Howard Dresner predicts the future of business intelligence
Market maturity and Microsoft lead Gartner's latest business intelligence report
IBM brings enterprise search, business intelligence to Super Bowl coverage
Can data mining expertise help my career?

Data management books
Introduction to mining unstructured data
The business context for unstructured data mining: Three trends
Business process management and IT process entities
Understanding metrics for business process management
Master data management as a service-oriented architecture enabler
Why do we need master data management systems?
What is a master data management system?
Business benefits of managed master data
Introduction to enterprise master data management
What is Microsoft PerformancePoint Server 2007?

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
data dredging  (SearchDataManagement.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary


About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




All Rights Reserved, Copyright 2005 - 2008, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts