Open Access Open Access  Restricted Access Subscription or Fee Access

Pattern Discovery: An Inference Analysis Approach

Ajay Desai, D. Sujatha

Abstract


Data mining is a process of discovering knowledge of interest for the users from various information repositories like databases and data warehouses by extracting interesting data patterns that represent knowledge. These interesting data patterns are obtained by evaluating the existing data patterns based on some interestingness measures. This data mining process is carried out by using several data mining functionalities among which Association Rule Mining is most commonly used to find interesting data patterns based on the association relationship among various data items of a data set. This association relation is represented by an association rule, which is a data pattern. To evaluate the interestingness of an association rule, two interestingness measures called as support and confidence are used whose threshold values are set by domain experts. These threshold values called as minimum support and minimum confidence which represent domain knowledge. These threshold values cannot be always accurate, which leads to loss of interesting association rules and also affects the quality of association rules discovered. This leads a user to take wrong decisions .
To avoid this problem, we need to move from this support – confidence framework for a process which can find interesting association rules based on their logical correctness. An association rule is logically correct only when it satisfies a logical principle called Equivalence, such rules are called as coherent rules. As these coherent rules are discovered without using domain knowledge, no interesting association rules are lost. Though this process is accurate, it is a time consuming task to check the logical correctness of every association rule. So we need to use a technique called Pruning, through which we can get accurate results by consuming less time.


Keywords


Data Mining, Association Rules, Coherent Rule, Equivalence, Pruning.

Full Text:

PDF

References


R. Agrawal, T. Imielinski, and A. Swami, “Mining Association Rules between Sets of Items in Large Databases,” SIGMOD Record, vol. 22, pp. 207-216, 1993

S. Brin, R. Motwani, and C.Silverstein, “Beyond Market Baskets: Generalizing Association Rules to Correlations,” Proc. 1997 ACM SIGMOD, pp. 265-276, 1997.

J. Han and M. Kamber, Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, 2006.

M.-L. Antonie and O.R. Zaı¨ane, “Mining Positive and Negative Association Rules: An Approach for Confined Rules,” Proc . European Conf. Principles and Practice of Knowledge Discovery in Databases (PKDD ‟04), pp. 27-38, 2004.

R. Forsyth, “Zoo Data Set,” Orange, AI Lab,http://magix.fri.unilj.si/orange/doc/datasets/zoo.htm,1990


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.