Open Access Open Access  Restricted Access Subscription or Fee Access

Effective Web Personalization from web logs using Tree Structure

P. Arun, Dr. K. Iyakutti

Abstract


In the day today life the information in World Wide Web [27], [41] is increasing in an explosive way and simultaneously the usage of the Web is also growing in an increasing way and for each and every usage or accessing of the web pages it creates a separate log entry in the web log file and so the log file is also increased correspondingly, From the web log file we get some interesting information about the users previous access sequence. From that interested information it is possible to predict their future access sequence and also personalize [2], [3] the most interested pattern to the users. In this paper we create a model using Tree structure concept, it first scan the collected web log file from that it creates the tree for each and every user and the next step is to traverse the tree using Level-by-Level tree traversal and find out the interested pattern for each user and personalize it to the users in their future access. Using the Level-by-Level tree traversal results the model clusters the uniform interested pattern among the users and it counts the number of clusters and find out the maximum number of interested pattern in the website and also it count the number of interested pattern in each cluster from that the model find out most interested path/pattern of the website during that period of time. From the cluster result it personalized the information to users in their future access. The proposed model is very useful for understanding the behavior of the users, find out the interested object of the website and based upon the results it improving the web site design too, find out the maximum interested pattern of the website, find out the most interested path/pattern of the website. Finally we have done the experimental studies of our proposed model using web log data from a reputed website and prove the efficiency of the proposed model.


Keywords


Web Usage Mining, Clustering, Traversal, Web Personalization.

Full Text:

PDF

References


R.Agrawal, T. Imielinski, and A.Swami, Database mining: A performance perspective, IEEE Transactions on knowledge and Data Engineering, 5(6);914-925, December 1993. Special Issue on Learning and Discovery in Knowledge Based Database.

Massimiliano Albanese, Antonio Picariello and Carlo an Lucio Sansone,A web Personalization system based on web usage Mining Techniques,WWW2004, May 17-22,2004, New York, USA, ACM 1-58113-912-8/04/05.

Jiaqian Zheng, Zing Yao and Zunyu Niu, Web user D Identification in Personalization, ACM 978-1-60558-085-2/08/04.

R Denaux, L Aroyo, V Dimitrova , “An approach for ontology-based Elicitation of User Models to Enable Personalization on the Semantic Web”, - International World Wide Web Conference, 2005 -portal.acm.org

Hongyu Zhang, “The Scale-Free Nature of Semantic Web Ontology”,School of Software. Tsinghua University. Beijing 100084, China. ACM 978-1-60558-085-2/08/04.

Santhosk K.Rangarajan, VirV.Phoba, Kiran Balagani, S.S.Iyengar, Rastko Selmic, Web User Clustering and Its Application to Prefetching Using ART Neural Networks.

Balaji Padmanabhan, Zhiqiang Zheng and Steven O.Kimbrough,Personalization from Incomplete Data: What you Don’t know can Hurt,ACM 2001 1-58113-391-/01/08.

Ching-Cheng Lee, Category-Based Web Personalization System, 0-7695-1372-7/01 @2001 IEEE.

Ramesh C.Agarwal, Charu C. Agarwal, V.V.V. Prasad, A Tree Projection Algorithm For Generation of Frequent Itemsets. IBM T.J.Watson Research Centre, York Town Heights, NY 10598.

Ming-Syan Chen, Jong Soo Park, Data Mining for Path Traversal Patterns in a Web Environment, IBM Thomas J.Watson Research Ctr,P.O.Box 704, YK town Ny.

Jaideep Srivastava, Robert Cooley, Mukund Deshpande, Pang-Ning Tan,Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data, SIGKDD Explorations@2000, ACM SIGKDD Jan 2000.

Kenneth Wai-Ting Leung, Wilfred Ng and Dik Lun Lee, Personalized Concept- Based Clustering of Search Engine Queries. IEEE Transactions on Knowledge And Data Engineering Vol 20, No11,November 2008.

Gerd Stumme, Andreas Hotho, Bettina Berendt, “Usage Mining for and on the Semantic Web”, - Citeseer Institute for Applied Computer Science and Formal Description Methods (AIFB), University of Karlsruhe, D-76128 Karlsruhe, Germany,

S.SenthilKumar. T.V.Geetha, “Personalized Ontology for Web Search Personalization”, Annual Bangalore Compute Conference, Proceedings of the 1st Bangalore annual Compute conference Bangalore, India ,Year of Publication: 2008 ISBN:978-1-59593-950-0

Renata Ivancsy, Istvan Vajk, “Frequent Pattern Mining in Web Log Data”, -Acta Polytechnica Hungarica, 2006 - Citeseer : Department of Automation and Applied Informatics and HAS BUTE Control Research Group

R.Agrawal and R.Srikant, Mining Sequential Pattern, In Proc 1995 Int.Conf. Data Engineering, pages 3-14, Taipei, Taiwan, March 1995.

O.Zaiane, M.Xin and J.Han, Discovering Web access patterns and trends by applying OLAP and data mining technology on web logs. In Proc.Advances in Digital Libraries Conf. (ADL’98), Melbourne Australia,Pages 1244-158 , April 1998.

R. Agrawal and R. Srikant. Fast algorithm for mining association rules.Proceedings of the Twentieth International Conference on Very Large Databases. 1994. pp 487-499.

E. Cohen, B. Krishnamurthy and J. Rexford, Efficient algorithms for predicting requests to web servers. In Proceedings of the IEEE INFOCOM’99 Conference, 1999.

J.Pitkow and P.Pirolli. Mining Longest Repeating Subsequences to Predict World Wide Web Surfing. In Second USENIX Symposium on Internet Technologies and Systems, Boulder, C0, 1999.

J. Srivastava, R. Cooley, M. Deshpande and P. Tan, Web usage mining:discovery and applications of usage patterns from web data. SIGKDD Explorations, 1(2):12–23, 2000.

S. Schechter, M. Krishnan and M. D. Smith, Using path profiles to predict HTTP requests. In 7th International World Wide Web Conference, pages 457–467, Brisbane, Qld., Australia, April 1998.

Fu Y., Sandhu K., and Shih M., “Clustering of Web Users Based on AccessPatterns.” International Workshop on Web Usage Analysis and User Profiling(WEBKDD’99), San Diego, CA, 1999

Zhang T., Ramakrishnan R., and Livny M., “Birch: An Efficient Data Clustering Method for Very Large Databases.” In Proceedings of the ACM SIGMOD Conference on Management of Data, pages 103-114,Montreal, Canada, June 1996

Paliouras G., Papatheodorou C., Karkaletsis V., and Spyropoulos C.D.,“Clustering the Users of Large Web Sites into Communities.” In Proceedings of the International Conference on Machine Learning (ICML), pages 719-726, Stanford, California, 2000.

Xie Y., and Phoha V. V., “Web User Clustering from Access Log Using Belief Function.” K-CAP’ 01, British Columbia, Canada, October 2001.

Cooley R., Mobasher B., and Srivatsava J., “Web Mining: Information and Pattern Discovery on the World Wide Web.” ICTAI’97, 1997.

B. Mobasher, R. Cooley, and J. Srivastava. Automatic personalization based on Web usage mining. In Communications of the ACM, (43) 8,August 2000.

M. Perkowitz and O. Etzioni. Adaptive Sites: Automatically learning from user access patterns. In Proc. 6th Int’l World Wide Web Conf.,Santa Clara, California, April 1997.

R. Srikant and R. Agrawal. Mining quantitative association rules in large relational tables. In Proc. 1996 ACM-SIGMOD Int. Conf. Management of Data, pages 1-12, Montreal, Canada, June 1996.

M.S. Chen, J.S. Park, P.S.Yu: Data Mining for Path Traversal Patterns in a Web Environment. 16th Intl. Conference on Distributed Computing System, 1996. pp385 392

T.Yan, M. Jacobsen, H. Garcia-Molina, and U.Dayal: from user access patterns to dynamic hypertest linking. Intl World Wide Web Conference,Paris, France, 1996

Ramesh kumar Jain, R.S.Kasana, Suresh Jain, Efficient Web log Mining using Doubly Linked Tree , International journal of Computer science and Information Security, Vol. 3, No.1, 2009.

Wang Tong, He Pi-lian, Web Log Mining by an improved AprioriAll Algorithm, World Academy of Science, Engineering and Technology 4,2005.

Qiang Yang, Hui Wang, Wei Zhang, Web-log Mining for Quantitative Temporal-Event Prediction, IEEE Computational Intelligence Bulletin,Vol.1 No.1, Dec 2002.

F. Tao, F. Murtagh: Towards Knowledge Discovery from WWW Log Data. Intl. Conference on Information Technology: Coding and Computing. Mar. 2000. pp 302-307

O.R. Zaïane, M. Xin, J.W. Han: Discovering Web Access Patterns and Trends by Applying OLAP and Data Mining Technology on Web Logs.ADL 1998. pp19-29.

P. Contreras, F. Tao: The Integration of the log mining in the context of IRAIA. Presented at DIW technical meeting, Berlin, Feb. 2001.

Markatos E. P., and Chronaki C. E., “A Top-10 Approach to Prefetching on the Web.” In Proceedings of the Eighth Annual Conference of the Internet Society (INET'98), Geneva, Switzerland, July 1998.

Padmanabhan V. N., and Mogul J. C., “Using Predictive Prefetching to Improve World Wide Web Latency.” ACM Computer Communication Review, Vol. 26, No.3, page 2336, July 1996.

Wen-Chen Hu, Xuli Zong, Chung-wel Lee and Zyh-haw Yeh, World Wide Web Usage Mining Systems and Technologies.

Olfa Nasroui, Mrudula Pavuluri, Eager Learning in Two Stages for Precise and Complete Web Personalization, IEEE 2005, 0-7803-9158-6/05 @2005.

Tadeusz Morzy, Marek Wojciechowski, Maciej Zakrzewicz, Pattern-Oriented Hierarchical Clustering.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.