Open Access Open Access  Restricted Access Subscription or Fee Access

Improved Data Mining Based on Semantic Similarity to Mine New Association Rules

Gagan Kumar, Neeraj Mangla, Aakanksha Mahajan

Abstract


The problems of mining association rules in a database are introduced. Most of association rule mining approaches aim to mine association rules considering exact matches between items in transactions. A new algorithm called ―Improved Data Mining Based on Semantic Similarity to mine new Association Rules‖ which considers not only exact matches between items, but also the semantic similarity between them. Improved Data Mining (IDM) Based on Semantic Similarity to mine new Association Rules uses the concepts of an expert to represent the similarity degree between items, and proposes a new way of obtaining support and confidence for the association rules containing these items. An association rule is for ex: i.e. for a grocery store say ―30% of transactions that contain bread also contain butter; 2% of all transactions contain both of these items‖. Here 30% is called the confidence of the rule, and 2% the support of the rule and this rule is represented as Bread  Butter. The problem is to find all association rules that satisfy user-specified minimum support and minimum confidence constraints. This paper then results that new rules bring more information about the database.

Keywords


Data Mining, Semantic Similarity, Association Rules, Support, Confidence, Fuzzy Logic

Full Text:

PDF

References


W.H. Au, K.C.C. Chan, An effective algorithm for discovering fuzzy rules in relational databases, in: Proc. IEEE Internat. Conf. Fuzzy Systems, vol. II, 1998, pp. 1314–1319.

W.H. Au, K.C.C. Chan, FARM: a data mining system for discovering fuzzy association rules, in: Proc. FUZZ-IEEE’99, vol. 3, 1999, pp. 22–25.

Han, J. and Kamber, M. (2001) "Data Mining - Concepts and Techniques", 1st Edition.Nova York: Morgan Kaufmann.

Chen, G. and Wei, Q. (2002) "Fuzzy association rules and the extended mining algorithms",

Fuzzy Sets and Systems, v. 147, n. 1-4, p. 201-228 X.Wu, C.Zhang, and S.Zhang, Mining both Positive and Negative Association Rules, Proc. Of 19th Int. Conf. on Data Machine Learning, pp.658-665,2002.

Zhang, S., Zhang, C., Yan X., "Post-mining: maintenance of association rules by weighting", Information System 28 (2003) pp.691-707.

Han. J and M. Kamber (2004), ―Data Mining Concepts and Techniques‖: San Franscisco, CA:. Morgan Kaufmann Publishers.

Daly O, Taniar D. Exception Rules Mining Based on Negative Association Rules, Leture Notes in Computer Science, Vol.3046,2004,pp543-552.

Chen G, Zhu YQ, Yang HB, Study of Some Key Techniques in Mining Association Rule, Journal of Computer Research and Development, 2005, 42 (10): 1785-1789.

Ji GL, Yang M. Song YQ. Sun ZH. Fast Updating Maximum Frequent Item sets. Journal of Computers, 2005, 28(1):128-135.

T. P. Hong, K. Y. Lin and S. L. Wang, ―Mining fuzzy association rules from quantitative transactions", Soft Computing, Vol. 10, No.10, pp. 925-932, 2006.

Yun, U(2007)., "Efficient mining of weighted interesting patterns with a strong weight and/or support affinity", Information Science 177 (2007) pp.3477-3499.

Preetham Kumar and Ananthanarayana V.S(2008) ― Discovery of frequent itemsets using Weighted Tree method‖ IJCSNS, Vol. 8 No. 8 pp. 195-200

Liewean Cheng, Su-Chuan Chen, and Jashen Chen (2009) ―Applying Weighted Association Rules with the Consideration of Product Item Relevancy‖, 978-1-4244-3662-0/09, 2009 IEEE

Preetham Kumar and Ananthanarayana V.S(2009) ―Discovery of Frequent Itemsets Based on Minimum Quantity and Support" Computer Science Journal of Malaysia, Vol 3, Issue 3, June 2009.

Gyenesei A, ―Mining weighted association rules for fuzzy quantitative items‖ TUCS Technical Report No. 346, 2000, pp 1-12.

M. Kaya, R. Alhajj, F. Polat and A. Arslan, ―Efficient Automated Mining of Fuzzy Association Rules,‖ Proc. Of aDEXA, 2002.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.