Open Access Open Access  Restricted Access Subscription or Fee Access

K- Means and Genetic K- Means based Clustering Analysis of Tumorous and Non-Tumorous Tissues

Gotam Singh Lalotra, R.S. Thakur

Abstract


The aim of this paper is to explore the behavior of two clustering algorithms the time and accuracy at different population and configurations of k-means and Genetic Algorithm based k- means for Serial Analysis of Gene Expression (SAGE) Data, as the SAGE data is less explored data which is characterized by high dimensions, in this study an analysis is performed for SAGE data clustering.

Keywords


Clustering, Genetic Algorithm, k-Means, SAGE, Dimensions.

Full Text:

PDF

References


C. Pizzuti and T. Domenico, "P-AutoClass: scalable parallel clustering for mining large data sets." Knowledge and Data Engineering, IEEE Transactions on, Vol 15, No. 3, pp. 629-641, 2003.

J. Marr, "Comparison of several clustering algorithms for data rate compression of lpc parameters." Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'81, Vol. 6, pp. 964-966, 1981

A. Wong, K.C. Andrew and GCL Gary, “Simultaneous pattern and data clustering for pattern cluster analysis." Knowledge and Data Engineering, IEEE Transactions on, vol. 20. 7, pp. 911-923, 2008.

Shehroz S. Khan and Amir Ahmad, “Cluster center initialization algorithm for K-modes clustering” Expert Systems with Applications vol. 40 pp. 7444–7456. 2013

M. Lin, W. Hsu, “Mining GPS data for mobility patterns: A Survey". Pervasive Mobile Comput, Vol 12, pp.1-16, 2013.

Dongxia Chang, Yao Zhao , Changwen Zheng, Xianda Zhang, A genetic clustering algorithm using a message-based similarity measure Expert Systems with Applications, vol. 39, pp. 2194–2202, 2012

A. Freitas, Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer, 2002

A. A. Freitas, “A review of evolutionary algorithms for data mining,” in Soft Computing for Knowledge Discovery and Data Mining, Springer, pp. 79–11, 2008

Amir Ahmad and Lipika Dey, “A k-means clustering algorithm for mixed numeric and categorical data” Data & Knowledge Engineering vol. 63, pp. 503-527, 2007

J. Han, M. Kamber, and J. Pei, Data mining: concepts and techniques. Morgan kaufmann, 2006.

C. L. Yang, R. J. Kuo, C. H. Chien, N. T. P. Quyen, "Non-dominated sorting genetic algorithm using fuzzy membership chromosome for categorical data clustering", Applied Soft Computing, Vol. 30, pp. 113–122, 2015

T. Velmurugan, “Performance based analysis between k-Means and Fuzzy C-Means clustering algorithms for connection oriented telecommunication data” Applied Soft Computing vol. 19 pp. 134–146. 2014

K. A. Abdul Nazeer and M. P. Sebastian, “Improving the Accuracy and Efficiency of the k- means Clustering Algorithm” Proceedings of the World Congress on Engineering 2009 Vol I WCE 2009, London, U.K.,2009

A. Wong, K.C. Andrew and GCL Gary, “Simultaneous pattern and data clustering for pattern cluster analysis." Knowledge and Data Engineering, IEEE Transactions on, vol. 20. 7, pp. 911-923, 2008.

Ujjwal Maulik and Sanghamitra Bandyopadhyay, “Genetic algorithm-based clustering technique” Pattern Recognition vol. 33, pp. 1455-1465, 2000.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.