Open Access Open Access  Restricted Access Subscription or Fee Access

Fast Mining of Maximal Web Navigation Patterns

M. Thilagu, S. Sathya Bama

Abstract


Discovering user navigation patterns in web log sessions has been an interesting problem and used with many applications including web site development, e-business, e-learning etc. Most of the proposed algorithms for mining web log patterns generate candidate sequences and test whether they are frequent or not, based on the given min-sup. In this paper, we present a fast method that aims at mining prefix based maximal contiguous sequence patterns without generating candidate sequences level-by-level. It first generates maximal potential sequences and mines only them in the database using minimized search space.   Performance evaluation of the proposed algorithm is done by conducting experimental studies on a real dataset and found satisfactory when compared to previous approach.  

Keywords


Maximal Sequence Pattern, Sequence Pattern, Web Log Database, Web Usage Mining.

Full Text:

PDF

References


Agrawal R. and Srikant R. (1995) Mining Sequential Patterns. In Proceedings ICDE'95, 3-14.

Ayres J., Gehrke J., Yu T., and Flannick J. (2002) Sequential PAttern Mining using a Bitmap Representation. In SIGKDD 429-435.

Chen, M.S., Park, J.S. & Yu, P.S. (1998). Efficient Data Mining for Path Traversal Patterns. In IEEE Transactions on Knowledge and Data Engineering , 209-220.

Chen J. (2008) Contiguous Item Sequential Pattern Mining Using UpDown Tree, Intelligent Data Analysis – An International Journal, Vol. 12, No. 1, pp. 25-49.

R. Cooley, B. Mobasher, J. Srivastava, Data preparation for mining world wide web browsing patterns, Knowledge and Information Systems 1 (1) (1999) 5–32.

R. Cooley, Web usage mining: discovery and application of interesting patterns from web data, Ph.D. thesis, University of Minnesota, 2000.

Dhany Saputra, Dayang R., A. Rambli, Oi Mean Foong, (2008) Mining Sequential Patterns Using I-PrefixSpan, International Journal of Computer Science and Engineering 2;2.

Federico Michele Facca, Pier Luca Lanzi, Mining interesting knowledge from weblogs: a survey, Data & Knowledge Engineering 53 (2005) 225–241

J. Han, M. Kamber, Data Mining Concepts and Techniques, Morgan Kaufmann, 2001.

Han J., Pei J.,Mortazavi-Asl B., Chen Q., Dayal U.,and Hsu M.-C. (2001) FreeSpan: Frequent Pattern-Projected Sequential Pattern Mining. In Proc. ACM SIGKDD 355-359.

Hengshan Wang ,Cheng Yang Hua Zeng, Design and Implementation of a Web Usage Mining Model Based On Fpgrowth and Prefixspan, Communications of the IIMA, 2006 Volume 6 pp.71-86.

Jaideep Srivastava, Robert Cooley, Mukund Deshpande, and Pang-Ning Tan, Web usage mining: Discovery and applications of usage patterns from Web data, SIGKDD Explorations, 2000, Vol.1.pp. 12-23.

Lin M. and Lee S. (2005) Fast Discovery of Sequential Patterns through Memory Indexing and Database Partitioning. J. Info. Sci. and Eng., 21, 109-128.

B. Mortazavi-Asl, Discovering and mining user web-page traversal patterns, Master_s thesis, Simon Fraser University, 2001.

Pei, J., Han, J., Mortazavi-asi, B. and Zhu, H. (2000). Mining Access Patterns Efficiently from Web Logs. In Proceedings of 6th Pacific Area Conference on Knowledge Discovery and Data Mining (PAKDD), 396-407.

Pei J., Han J., Mortazavi-Asl B., Wang J., Pinto H., Chen Q., Dayal U. and Hsu M. C. (2004) Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach. IEEE TKDE, vol. 16, 1424-1440.

Show-Jane Yen,An Efficient Approach for Analyzing User Behaviors in a Web-Based Training Environment. Journal of Distance Education Technologies, 1(4), 55-71, Oct-Dec 2003.

Srikant R., Agrawal R., (1996) Mining Sequential Patterns: Generalizations and Performance Improvements. In Int'l Conf Extd. DB. Tech. 3-17.

Zaki M. (2001) SPADE: An Efficient Algorithm for Mining Frequent Sequences. Machine Lrng., 40, 31-60.

Zhenglu Yang and Kitsuregawa M. (2005), LAPIN-SPAM: An Improved Algorithm for Mining Sequential Pattern, Proc. of Int'l Special Workshop on Databases For Next Generation Researchers, pp. 8-11.

Zhenglu Yang, Yitong Wang, and Masaru Kitsuregawa, An Effective System for Mining Web Log, LNCS 3841, pp. 40–52, 2006.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.