Open Access Open Access  Restricted Access Subscription or Fee Access

A Hybrid Web Access Prediction Algorithm Using Agglomerative Clustering, Modified Markov Model and Association Rule

A. Anitha

Abstract


The explosive growth of data in the internet makes the people with difficulty in accessing interested pages. Although several methods   including   Markov   model    and association rule are available for web access prediction, they have their own limitations in terms of predicting ability and state space complexity. In this paper, it is proposed to identify browsing pattern of people having similar interest using agglomerative clustering approach using k nearest neighbors, modified Markov model and association rule mining. The goal of   this   paper   is   to   improve   prediction accuracy. The homogeneity of clusters is improved very   well   by exact agglomeration. While doing agglomerative clustering there exist a trade-off between speed and accuracy. The slowness is overcome by reducing the object considered during agglomeration to k, instead of N-1 and by eliminating distant neighbors having similarity value above predefined threshold. Unlike rough sets, this approach   considers   objects that definitely belonging to a cluster during agglomeration.  Hence, cluster validity is improved and computational complexity is reduced. Then, a dynamic Markov model is applied to generate matching states dynamically using cluster for test session. When ambiguity arises, Association rule mining and time-stamp parameter are used to resolve prediction conflicts.  The comparative results are presented depicting the improvement in predictive accuracy of the proposed hybrid approach over other systems.

 


Keywords


Agglomerative Clustering, Association Rule, Markov Model, Pattern Discovery.

Full Text:

PDF

References


Pasi Franti,Olli Virmajoki, and Ville Hautamaki “Fast Agglomerative Clustering Using a k Nearest Neighbor graph”, IEEE transaction on pattern analysis and machine intelligence. Vol 28,No 11. pp 1875-1881, November 2006

Pasi Fränti, Timo Kaukoranta, Day-Fann Shen, and Kuo-Shu Chang ,”Fast and Memory Efficient Implementation of the Exact PNN “,IEEE Transactions On Image Processing, Vol. 9, No. 5,pp.773-777, May 2000

Yang, Y, Balaji Padmanabhan, ” GHIC: a hierarchical pattern- based clustering algorithm for grouping Web transactions”, IEEE Transactions on Knowledge and Data Engineering, Volume: 17 Issue:9, pp. 1300 – 1304, Sep 2005.

Nasraoui, O. Soliman, M. Saka, E. Badia, A.Germain, R. “A Web Usage Mining Framework for Mining Evolving User Profiles in Dynamic Web Sites”,IEEE transaction on Knowledge and data engineering,Volume 20,Issue 2, pp. 202-215, Feb 2008

Anitha A, Krishnan Nallaperumal,” An exact agglomerative clustering algorithm for Web Usage Mining”, International Journal Of Imaging Science And Engineering (IJISE),Vol 3,No 5,pp.324-329, Oct 2009

Anitha A, Krishnan Nallaperumal, ” Web Usage Mining by Agglomerative Clustering Using Small Neighborhood”, International Journal Of Imaging Science And Engineering, Vol.4,No.1, pp.331-337, January 2010

Kolari, P. Joshi, A. “Web mining research and practice”, A journal of computing in science and Enginneering,Vol 06 , Issue:4 ,pp: 49 – 53, July-Aug, 2004

Faten Khalil Jiuyong Li Hua Wang,” Integrating Recommendation Models for Improved Web Page Prediction Accuracy”, Conferences in Research and Practice in Information Technology (CRPIT), Australia, 2008,Vol. 74.

Alexandros Nanopoulos, Dimitrios Katsaros, and Yannis Manolopoulos, “ A Data Mining Algorithm for Generalized Web Prefetching”,IEEE Transactions On Knowledge And Data Engineering, Vol. 15, No. 5,pp.1155-1169, September/October 2003.

Mamoun A. Awad and Latifur R. Khan,” Web Navigation Prediction Using Multiple Evidence Combination and Domain Knowledge”, IEEE Transactions On Systems, Man, And Cybernetics—Part A: Systems And Humans, Vol. 37, No. 6,pp.1054-1062, November 2007

Jyoti, ” A Novel Approach for clustering web user sessions using RST”, International Journal on Computer Science and Engineering Vol.2(1), pp.56-61, 2009.

Mathias G´ery, Hatem Haddad,” Evaluation of Web Usage Mining approaches for user’s next request prediction” WIDM , Boston, USA,2003

Siripom chimphlee,Naomie Salim,Mohd Salihin Bin Ngadiman, Witcha,Surat ,”Rough Sets Clustering and Markov Model for Web Access Prediction” , Proceedings of post graduate annual seminar ,2006, pp. 470-474


Refbacks

  • There are currently no refbacks.