Open Access Open Access  Restricted Access Subscription or Fee Access

An Efficient and Fast Data Clustering using Fuzzy C-Means

L. Divya Sivanandini, M. Mohan Raj

Abstract


The chief objective of the clustering is to present a collection of similar records. Cluster Analysis (CA) is an exploratory data analysis technique for managing collected data into significant taxonomies, groups, or clusters, according to the combinations, which increases the similarity of cases inside a cluster at the same time increasing the dissimilarity between the other groups that are primarily unknown. In the proposed approach, the efficiency of the Modified Fuzzy C-means clustering is enhanced by density sensitive distance measure. Modified Fuzzy C-Means is the effective clustering algorithm available to cluster unlabeled data that produces both membership and typicality values during clustering process. The parameters used for the evaluation is Clustering accuracy, Mean Squared Error (MSE), Execution Time and Convergence behavior. The performance of the proposed approaches is evaluated, Irvine (UCI) machine repository datasets such as Iris, Wine, Lung Cancer and Lymphography.

Keywords


Fuzzy C-Means Clustering, Mean Squared Error (MSE), Convergence Behavior.

Full Text:

PDF

References


Fayyad, Usama, Gregory Piatetsky-Shapiro and Padhraic Smyth, “From Data Mining to Knowledge Discovery in Databases”, American Association for Artificial Intelligence- AI Magazine, Pp. 37-54, 1996.

David Hand, HeikkiMannila and Padhraic Smyth, “Principles of Data Mining”, The MIT Press , 2001. ISBN: 026208290x

Ojeda-Magafia B, Ruelas R, Corona-Nakamura M.A and Andina D, “An Improvement to the Possibilistic Fuzzy c-Means Clustering Algorithm”, Automation Congress, Pp. 1–8, 2006.

RuiXu and Donald Wunsch, “Survey of Clustering Algorithms”, IEEE Transactions on Neural Networks, Vol. 16, No. 3, May 2005.

Jiabin Deng, JuanLi Hu, Hehua Chi and Juebo Wu, “An Improved Fuzzy Clustering Method for Text Mining”, Second International Conference on Networks Security Wireless Communications and Trusted Computing (NSWCTC), Vol. 1, Pp. 65–69, 2010.

RuiqiongCai and Fusheng Yu, “Fuzzy Temporal Constraints Based Fuzzy Clustering Algorithm for Temporal Dataset”, Sixth International

Conference on Fuzzy Systems and Knowledge Discovery (FSKD '09), Vol. 1, Pp. 480–484, 2009.

Jieming Wu and Wenhu Yu, “Optimization and Improvement Based on K-Means Cluster Algorithm”, Second International Symposium on Knowledge Acquisition and Modeling (KAM '09), Vol. 3, Pp. 335–339, 2009.

J. Bezdek, “Pattern Recognition with Fuzzy Objective Function Algorithms”, New York: Plenum Press, 1981.

C. Havens Timothy, ChittaRadha, Jain Anil K and Jin Rong, “Speedup of fuzzy and possibilistic kernel c-means for large-scale clustering”, IEEE International Conference on Fuzzy Systems (FUZZ), Pp. 463–470, 2011.

J.C. Dunn, “A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters”, Journal of Cybernetics Vol. 3, Pp. 32-57, 1973.

Jiang-She Zhang and Yiu-Wing Leung, “Improved possibilistic C-means clustering algorithms”, IEEE Transactions on Fuzzy Systems, Vol. 4, No. 2, Pp. 209–217, 2004.

E. Chandra, V.P. Anuradha, “Survey on Clustering Algorithms for Data in Spatial Database Management Systems”, International Journal of Computer Applications vol. 24, no.9, pp.975 – 8887, June 2011.

A.K. Jain, M.N. Murty and P.J. Flynn, “Data Clustering: A Review”, ACM Computing Surveys, Vol. 31, No. 3, September 1999

Bin Wu, Lei Wang and CunliangXu, “Possibilistic clustering using non-Euclidean distance”, Chinese Control and Decision Conference (CCDC '09), Pp. 938 – 940, 2009.

M.Parimala, Daphne Lopez and N.C. Senthilkumar, “A Survey on Density Based Clustering Algorithms for Mining Large Spatial Databases”, International Journal of Advanced Science and Technology, Vol. 31, June, 2011.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.