Open Access Open Access  Restricted Access Subscription or Fee Access

Knowledge Discovery Framework for Community Web Directories

Upendar Para, Podila Kondala Rao, Manasa Manasa

Abstract


In contrast to most of the work on Web usage mining,
the usage data that are analyzed here correspond to user navigation throughout the Web, rather than a particular Web site, exhibiting as a result a high degree of thematic diversity. For modeling the user communities, we introduce a novel methodology that combines the user’s browsing behavior with thematic information from the Web
directories. The proposed personalization methodology is evaluated in a general-purpose Web directory, indicating its potential value to the web user. A Web directory, such as Yahoo (www.yahoo.com) and the
Open Directory project (ODP) (dmoz.org), allows users to find Web sites related to the topic they are interested in, starting with broad categories and gradually narrowing down, choosing the category most related to their interests. For personalization Web directories uses the
OCDM, OPDM &OCPDM algorithms. The experiment results show the effectiveness of the different machine learning techniques on the task.


Keywords


OCDM, OPDM, OCPDM, Personalization

Full Text:

PDF

References


B. Mobasher, R. Cooley, and J. Srivastava, “Automatic Persona-lization

Based on Web Usage Mining,” Comm. ACM, vol. 43, no. 8,pp. 142-151,

Dimitrios Pierrakos, Member, IEEE, and Georgios

Paliouras,”Personalizing Web Directories with the Aid of Web Usage

Data”,pp.1331-1344,2010.

G. Paliouras,C.Papatheodorou,V. Karkaletsis, and C.D.Spyropoulos,

“Discovering User Communities on the Internet Using Unsupervised

Machine Learning Techniques,” Interacting with Computers J., pp.

-791, 2002.

C. Christophi,D. Zeinalipour-Yazti, M.D. Dikaiakos, and G. Paliouras,

“Automatically Annotating the ODP Web Taxonomy,”Proc. 11th

Panhellenic Conf. Informatics (PCI ’07), 2007,pp. 398-404.

R. Cooley, B. Mobasher, and J. Srivastava.” Data preparation for mining

world wide web browsing patterns”. Journal of Knowledge and

Information Systems, 1(1),pp. 1-25, 1999.

A. Dempster, N. Laird, and D. Rubin. “Maximum likelihood from

incomplete data via the em algorithm”.Journal of Royal Statistical

Society, pp. 1–38, 1977.

X.Jin, Y. Zhou, and B. Mobasher, “Web Usage Mining Based on

Probabilistic Latent Semantic Analysis,” Proc. ACM SIGKDD, pp.

-205, 2004.

J. Hartigan, “Clustering Algorithms”. John Wiley & Sons, pp.100-150,

D. Pierrakos, G. Paliouras, C. Papatheodorou, V. Karkaletsis, and M.

Dikaiakos, “Web Community Directories: A New Approach to Web

Personalization,” Web Mining: From Web to Semantic Web, B. Berendt

et al., eds., pp. 113-129, Springer, 2004.

Michael Wurst ,WVTOOL GUIDE pdf.

J. Srivastava, R. Cooley, M. Deshpande, and P.T. Tan, “Web Usage

Mining: Discovery and Applications of Usage Patterns from Web Data,”

SIGKDD Explorations, vol. 1, no. 2, pp. 12-23, 2000.

D. Pi errakos, G. Paliouras, C. Papatheodorou, and C.D.Spyropoulos,

“Web Usage Mining as a Tool for Personalization:A Survey,” User

Modeling and User-Adapted Interaction, vol. 13,no. 4, pp. 311-372,

G.Paliouras,C.Papatheodorou,V.Karkaletsis,andC.D.Spyropoulos,Disco

vering User Communities on the Internet Using Unsupervised Machine

Learning Techniques,” Interacting with Computers J., vol. 14, no. 6, pp.

-791, 2002.

G. Xu, Y. Zhang, and Y. Xun, “Modeling User Behaviour for Web

Recommendation Using lda Model,” Proc. IEEE/WIC/ACM Int’l Conf.

Web Intelligence and Intelligent Agent Technology, pp. 529-532,2008.

W. Chu and S.-T.P. Park, “Personalized Recommendation on Dynamic

Content Using Predictive Bilinear Models,” Proc. 18th Int’l Conf. World

Wide Web (WWW), pp. 691-700, 2009


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.