Open Access Open Access  Restricted Access Subscription or Fee Access

Scientific Understanding, Comprehensive Evolution and More Informed Evaluation of Various Sequential Pattern Mining Algorithms

Sahista Machchhar, C.K. Bhensdadia, A.M. Ganatra

Abstract


As database resources get complex and bulky it not only becomes difficult to access specific information but also to extract relevant information from them. A way to address this issue is through sequential pattern mining technique. Sequential pattern mining is new trend in the domain of data mining and has many useful and exciting applications. In the sequential pattern mining approach, we mainly deal with attempting to discover a pattern that is sequential in nature. This helps us to predicting next event after a sequence or sequence-of-event(s). The success of such techniques lies in the design of their algorithm. Today, there are several competitive and efficient algorithms that cope with the popular and computationally expensive task of sequential pattern mining. Actually, these algorithms are more or less described on their own. This paper mainly focuses on the need, merits and demerits of different sequential rule mining algorithms and categorizing them according to their mining method, search method adopted, database formatting employed and other constraints as applied to the database. The basic inspiration to undertake this study is to provide a single platform-of-information that will serve as a ready reference for both the researchers and practitioners interested in the designing and implementation of sequential pattern mining algorithms depending upon categorized databases.

Keywords


Sequential Pattern Mining, Database Formatting, Mining with Constraints, Pattern-Growth Method

Full Text:

PDF

References


R. Agrawal and R. Srikant, “Mining sequential patterns”, In Proceedings of the 1995 International Conference on Data Engineering, pp. 3-14, 1995.

R. Agrawal and R. Srikant, “Mining sequential patterns: generalizations and performance improvements”, In Proceedings of the 5th International Conference on Extending Database Technology, pp. 3-17, Avignon, France, 1996.

Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach Jian Pei, Jiawei Han, Behzad Mortazavi-Asl, Jianyong Wang, Helen Pinto, Qiming Chen, Umeshwar Dayal, and Mei-Chun Hsu, IEEE Transactions on Knowledge and Data Engineering, Vol. 16, No. 10, October 2004.

M. N. Garofalakis, R. Rastogi, K. Shim, “SPIRIT: Sequential Pattern Mining with Regular Expression Constraints”, In Proceedings of 25th VLDB Conference, pp. 223-234, San Francisco, California, 1999.

Y. L. Chen, M. C. Chiang, and M. T. Kao, “Discovering time-interval sequential patterns in sequence databases”, Expert Systems with Applications, Vol. 25, No. 3, pp. 343-354, 2003.

C. Antunes, A. L. Oliveira, “Generalization of Pattern-growth Methods for Sequential Pattern Mining with Gap Constraints”, Machine Learning and Data Mining in Pattern Recognition, Third International Conference, MLDM 2003, Leipzig, Germany, July 5-7, 2003, Proceedings.

M. J. Zaki, “SPADE: an efficient algorithm for mining frequent sequences”, Machine Learning Journal, Vol. 42, Iss. (1-2), pp. 31-60, 2001.

Show-Jane Yen and Yue-Shi Lee, “Mining Sequential Patterns with Item Constraints”, DaWaK 2004: data warehousing and knowledge discovery: International conference on data warehousing and knowledge discovery, Zaragoza, ESPAGNE, vol. 3181, pp. 381-390, 2004.

Yen-Liang Chen, Ya-Han Hu, “The consideration of recency and compactness in sequential pattern mining”, In Proceedings of the second workshop on Knowledge Economy and Electronic Commerce, Vol. 42, Iss. 2 ,pp. 1203-1215, 2006.

Jian Pei, Jiawei Han, Wei Wang, “Constraint-based sequential pattern mining : the pattern growth methods”, J Intell Inf Syst , Vol. 28, No.2, pp. 133 –160 , 2007.

Yu Hirate, Hayato Yamana, “Generalized Sequential Pattern Mining with Item Intervals”, Journal of Computers, Vol. 1, No. 3, June 2006.

A. Mortazavi, Q. Chen, U. Dayal, M. Hsu, J. Han and T. Pei, FREESPAN: Frequent Pattern Projected sequential pattern mining” Proc ACM SIGMOD, 2000.

Xiaogang Wang, Yan Bai, Yue Li,” An Information Retrieval method based on sequential access pat terns” in 2010 Asia-Pecific conference on Wearable computing system, 2010.

He Ziang, Runian Geng, Baoyou sun, “Research on mining sequential positive and negative associaltion rules”, International conference on intelligent computational Technology and automation, pp. 703-706, 2009.

Jean Francois Boulicaut, “If constraint based mining is the answer: what is the constraint? (Invited Talk)”, IEEE International conference on data mining workshops, 2008.

N Ju-Dong Ren, Yin-Bo Cheng, Lung-Lung Yang, “An Algorithm for Mining Generalized Sequential Patterns”, Proceedings of the Third International Conference on Machine Learning and Cybernetics, Shanghai, 26-29 August 2004.

Erich Allen Peterson, Peiyi Tang, “A Hybrid approach to mining frequent sequential patterns”, ACMSE, CLEMSON, SC, USA, 2009, Unpublished.

Agrawal, R. and Srikant, R. 1994. “Fast algorithms for mining association rules”, In Proceeding of 20th international conference on very large databases, VLDB, J.B. Bocca, M. Jarke, and C. Zaniolo.. Eds. Morgan Kauffman pp. 487-499.

Ching-Yao-Wang, Tzung-Pei Hong, S.S.T. 2001 “Maintainance of sequential pattern for record deletion”, International conference on data mining pp. 536-541.

Wang, J., & Han J., “BIDE: Efficient Mining of Frequent Closed Sequences”, Proceedings of the 20th International Conference of Data Engineering, Boston, USA, 2004, pp. 79-90.

An Efficient Sequential Pattern Mining Algorithm Based on the 2-Sequence Matrix , 2008 IEEE International Conference on Data Mining Workshops.

Incrementally Fast Updated Sequential Pattern Trees, Proceedings of the Seventh International Conference on Machine Learning and Cybernetics, Kunming, 12-15 July 2008.

An Efficient Frequent Item set Mining Algorithm, Proceedings of Sixth International Conference on Machine Learning Cybernetics, Hong K-ong, 19-22 August 2007.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.