Open Access Open Access  Restricted Access Subscription or Fee Access

Exploring User Navigation with the Synergy of Modified Ant Based Clustering and LCS Classification

M. Raji, N. Muthumani

Abstract


World Wide Web is a huge repository of web pages and links. It provides abundance information for the Internet users. The growth of web is incredible as it can be seen in present days. Users’ accesses are recorded in web logs. Web usage mining is application of mining techniques in logs. The proposed system consists of two phases, (i) Offline phase and (ii) Online phase. In the offline phase, preprocessing and clustering is performed, while the classification and prediction is performed during the online phase. Preprocessing is the step which transforms the raw log file into a form that is more suitable for mining. Four steps are used in preprocessing, they are, data cleaning, user identification, session identification and formatting the result to suit the clustering algorithm. Modified ant-based clustering is proposed in this paper. Here, the algorithm doesn’t have any parameters and assumptions. The proposed method will automatically calculate the number of ants required for clustering. In the online phase, a classification algorithm based on Longest Common Sequence algorithm is used. The experimental results suggest that the proposed technique for web log mining results in better prediction of user behaviors when compared to the conventional techniques.

Keywords


Preprocessing, Data Cleaning, Session Identification, Modified Ant-Based Clustering, Longest Common Sequence

Full Text:

PDF

References


Ed Chi, James Pitkow, Jock Mackinlay, Peter Pirolli, Rich Gossweiler, and Stuart Card, (1998). Visualizing the evolution of web ecologies. In CHI 98, pages 400-407, Los Angeles, CA.

Mobasher, B., Cooley, R. and Srivastava, J. (2000) Automatic Personalization Based on Web Usage Mining, Communications of the ACM, Vol. 43 , Issue 8, Pp: 142 - 151

Cyrus Shahabi, Farnoush Banaei-kashani, (2003), Efficient and anonymous web-usage mining for web personalization, Information Processing and Management

Magdalini Eirinaki, Michalis Vazirgiannis, (2003),Web mining for web personalization, ACM Transactions on Internet Technology (TOIT), Vol. 3, No. 1.

K. Devipriyaa and B. Kalpana. (2010) Users' Navigation Pattern Discovery using Ant Based Clustering and LCS Classification, Journal of Global Research in Computer Science, Pp. 1-5, Vo. 1, No. 1.

Jalali, M., Mustapha, N., Mamat, A., Md Sulaiman, N. (2008) OPWUMP An architecture for online predicting in WUM-based personalization system, 13th International CSI Computer Science, Springer Verlag, Vol. 6, Pp. 838-841.

Yan, W.T., Jacobsen, M., Garcia-Molina, H. and Umeshwar, (1996) From user access patterns to dynamic hypertext linking, Computer Networks and ISDN Systems, Proceedings of the Fifth International World Wide Web Conference, Elsevier, Computer Networks and ISDN Systems, Vol. 28, Issues 7-11, Pp. 1007-1014

Eui-Hong Han, Daniel Boley, Maria L. Gini, Robert Gross, Kyle Hastings, George Karypis, Vipin Kumar, Bamshad Mobasher, Jerome Moore, (1998), WebACE: a Web agent for document categorization and exploration, Pp. 408-415.

Joachims, T., Freitag, D. and Mitchell, T. (1997) Webwatcher: A tour guide for the world wide web. In The 15th International Conference on Artificial Intelligence, Nagoya, Japan.

Ngu, D.S.W. and Wu, X. (1997) Sitehelper: A localized agent that helps incremental exploration of the world wide web. In 6th International World Wide Web Conference, Santa Clara, CA.

Lieberman, H.(1995) Letizia: An agent that assists web browsing. In Proc. of the 1995 International Joint Conference on Artificial Intelligence, Montreal, Canada.

Mobasher, B. and Moore, J. (1998) Webace: a web agent for document categorization and exploration, Proc. of the 2nd International Conference on Autonomous Agents, ACM Press, Pp. 408—415.

Yan, T. W., Jacobsen, M., Garcia-Molina, H. and Umeshwar, D. (1996) From user access patterns to dynamic hypertext linking. Fifth International World Wide Web Conference, Pp. 52-54.

Fayyad, U., Piatetsky-Shapiro, G. and Smyth, P.(1994) From data mining to knowledge discovery: An overview, Proc. ACM KDD, Pp. 23-39.

Ivancsy, R. and Kovacs, F. (2006) Clustering techniques utilized in web usage mining, Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, World Scientific and Engineering Academy and Society (WSEAS), Knowledge Engineering and Data Bases, Pp. 237-24


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.