A Comparative Study on Hierarchical Clustering in Data Mining
Abstract
Data mining is largely concerned with building models. Model is simply an algorithm or set of rules that connects a collection of data (input) to a particular target or outcome. Data mining involves the tasks are classification, estimation, prediction, clustering, affinity grouping, description & profiling. The first 3 are all the examples of directed data mining, where the goal is to find the value of a particular target variable. Affinity grouping and clustering are undirected tasks where the goal is to uncover structure in data without respect to particular target variable.
Profiling in a descriptive task that may be either directed or undirected. In this paper we will review the main methods and approaches of clustering. Clustering is the task of segmenting a heterogeneous population into a number of more homogeneous sub groups or clusters. This survey concentrated on data mining, data mining issues, clusters, clustering, clustering analysis, clustering algorithms, clustering issues, comparison of clustering algorithm, and Requirements of clustering in data mining.
Keywords
Full Text:
PDFReferences
Michel J.A. Berry Gordon S. Linoff – DM Techniques – Second Edition.
Han J.and Kamber M., Datamining: Concepts and Techniques, Morgan Kaufmann publishers, 2001.
Hierarchical Clustering Algorithm – A comparative study by Dr. N. Rajalingam and K.Ranjini in International Journal of Computer Ap0plications April 2011.
http://cgm.cs.mcgill.ca/~soss/cs644/projects/siourbas/sect5.html
Comparisons between data clustering algorithms in The International Arab Journal of Information Technology, Vol.5, No. 3, July 2008 by Osama Abu Abbas, Computer Science department, Yarmouk University, Jordan.
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution 3.0 License.