Open Access Open Access  Restricted Access Subscription or Fee Access

An Effective Cancer Classification using Machine Learning Algorithms

K. Anandakumar, C.S. Vijayasri, Dr. M. Punithavalli

Abstract


In this paper, the recently developed Extreme Learning Machine (ELM) is used for direct multicategory classification problems in the cancer diagnosis area. It uses Microarray gene expression cancer diagnosis for directing multicategory classification problems in the cancer diagnosis area. The common problems faced by iterative learning methods like local minima improper learning rate and over fitting are avoided by ELM. ELM completes the training at a faster rate. We have evaluated the multicategory classification performance of ELM on three benchmark microarray data sets for cancer diagnosis, namely, the Lymphoma data set. The results indicate that ELM produces comparatively better classification accuracies with reduced training time. The implementation complexity of ELM is very less compared to artificial neural networks methods like conventional back-propagation ANN, Linder‘s SANN and Support vector machine.

Keywords


ELM, ANOVA, Cancer Classification and Gene Expression

Full Text:

PDF

References


M. Ringner, C. Peterson, and J. Khan, ―Analyzing Array Data Using Supervised Methods,‖ Pharmacogenomics, vol. 3, no. 3, pp. 403-415, 2002.

G.-B. Huang and C.-K. Siew, ―Extreme Learning Machine: RBF Network Case,‖ Proc. Eighth Int‘l Conf. Control, Automation, Robotics, and Vision (ICARCV ‘04), Dec. 2004

D. Serre, Matrices: Theory and Applications. Springer-Verlag, 2002.

G.-B. Huang, L. Chen, and C.-K. Siew, ―Universal Approximation Using Incremental Constructive Feedforward Networks with Random Hidden Nodes,‖ IEEE Trans. Neural Networks, vol. 17, no. 4, pp. 879-892, 2006.

S. Dudoit, J. Fridlyand, and T.P. Speed, ―Comparison of Discrimination Methods for Classification of Tumors Using Gene Expression Data,‖ J. Am. Statistical Assoc., vol. 97, no. 457, pp. 77-87, 2002.

Runxuan Zhang, Guang-Bin Huang, Narasimhan Sundararajan, and P. Saratchandran, ― Multicategory Classification Using an Extreme Learning Machine for Microarray Gene Expression Cancer Diagnosis, ‖ vol 4,no 3,july-september 2007.

M. Schena, D. Shalon, R.W. Davis, and P.O. Brown, ―Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray,‖ Science, vol. 270, pp. 467-470, 1995.

S. Dudoit, J. Fridlyand, and T.P. Speed, ―Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data,‖ J. Am. Statistical Assoc., vol. 97, pp. 77-87, 2002.

R. Linder, D. Dew, H. Sudhoff, D. Theegarten, K. Remberger, S.J. Poppl, and M. Wagner, ―The ‘Subsequent Artificial Neural

Network‘ (SANN) Approach Might Bring More Classificatory Power to ANN-Based DNA Microarray Analyses,‖ Bioinformatics, vol. 20, no. 18, pp. 3544-3552, 2004

G.-B. Huang, Q.-Y. Zhu, and C.-K. Siew, ―Extreme Learning Machine: A New Learning Scheme of Feedforward Neural Networks,‖ Proc. Int‘l Joint Conf. Neural Networks (IJCNN ‘04), July 2004.

G.-B. Huang and C.-K. Siew, ―Extreme Learning Machine: RBF Network Case,‖ Proc. Eighth Int‘l Conf. Control, Automation, Robotics, and Vision (ICARCV ‘04), Dec. 2004.

G.-B. Huang and C.-K. Siew, ―Extreme Learning Machine with Randomly Assigned RBF Kernels,‖ Int‘l J. Information Technology, vol. 11, no. 1, 2005.

G.-B. Huang, Q.-Y. Zhu, K.Z. Mao, C.-K. Siew, P. Saratchandran, and N. Sundararajan, ―Can Threshold Networks Be Trained Directly?‖ IEEE Trans. Circuits and Systems II, vol. 53, no. 3, pp. 187-191, 2006.

M.-B. Li, G.-B. Huang, P. Saratchandran, and N. Sundararajan, ―Fully Complex Extreme Learning Machine,‖ Neurocomputing, vol. 68, pp. 306-314, 2005.

S. Ramaswamy, P. Tamayo, R. Rifkin, S. Mukherjee, C.-H. Yeang, M. Angelo, C. Ladd, M. Reich, E. Latulippe, J.P. Mesirov, T. Poggio, W. Gerald, M. Loda, E.S. Lander, and T.R. Golub, ―Multiclass Cancer Diagnosis Using Tumor Gene Expression Signatures,‖ Proc. Nat‘l Academy Sciences, USA, vol. 98, no. 26, pp. 15149-15154, 2002.

R. Linder, D. Dew, H. Sudhoff, D. Theegarten, K. Remberger, S.J. Poppl, and M. Wagner, ―The ‘Subsequent Artificial Neural Network‘ (SANN) Approach Might Bring More Classificatory Power to ANN-Based DNA Microarray Analyses,‖ Bioinformatics, vol. 20, no. 18, pp. 3544-3552, 2004.

O. Troyanskaya et al., ―Missing Value Estimation Methods for DNA Microarrays,‖ Bioinformatics, vol. 17, pp. 520-525, 2001.

M. West, C. Blanchette, H. Dressman, E. Huang, S. Ishida, R. Spang, H. Zuzan, J.A. Olson Jr., J.R. Marks, and J.R. Nevins, ―Predicting the Clinical Status of Human Breast Cancer by Using Gene Expression Profiles,‖ Proc. Nat‘l Academy of Sciences USA, vol. 98, pp. 11 462-11 467, 2001.


Refbacks

  • There are currently no refbacks.