Open Access Open Access  Restricted Access Subscription or Fee Access

Effect of Feature Selection on Breast Cancer Data Classification using Neural Networks

Pradnya N. Kumbhar, Manisha P. Mali, Mohammad Atique

Abstract


Over the past few decades, extensive death of women due to breast cancer has precipitated the need for early detection of breast cancer. Early detection of breast cancer can help in taking right decisions for the appropriate treatment plan. Breast cancer classification has become vital task for researchers and scientists. Artificial Neural Networks have been widely used in the classification task. The paper uses Multilayer Perceptron Neural Network for classifying benign and malignant tumors. One of the major challenges of classification is classifier accuracy, which can be improved using Feature Selection. Feature Selection is a strategy that aims at making classifiers more accurate and efficient by determining optimal feature subset among full set of features. The paper focuses on effect of feature selection strategies such as Correlation based Feature selection (CFS), Relief-F algorithm and Wrapper methods such as Genetic search (GA), Particle Swarm Optimization (PSO), Best First Search (BFS) and Linear Forward Selection on Breast cancer classification. The results show increased classification accuracy after applying Feature Selection. Genetic search gave more promising results in terms of accuracy as well as time required for classification compared to other strategies. Correlation coefficient, Mean Absolute Error, Root Mean Squared error, Relative Absolute Error and Root Relative Squared Error are the evaluation metrics used.

Keywords


Breast Cancer Classification, Feature Selection, Feature Selection Strategies, Neural Network.

Full Text:

PDF

References


Salama, G. I., Abdelhalim, M. B. Zeid, “Breast Cancer diagnosis on three different datasets using multi-classifiers,” International Journal of Computer and Information Technology, 2012.

http://www.breastcancer.org/symptoms/understand_bc/statistics

http://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/breast-cancer.

E. Davies, “Breast cancer”, Medicine,Elsevier, vol. 44, Issue 1, January 2016.

A. Karegowda, M. Jayaram, A.S. Manjunath, Feature Subset Selection Problem using Wrapper Approach in Supervised Learning, International Journal of Computer Applications 1(7):13–17, February 2010.

J. Han and M. Kamber, “Data Mining; Concepts and Techniques, Morgan Kaufmann Publishers”, 2000.

H. Thazin, T. Thein, K. Tun, “An approach for breast cancer diagnosis classification using neural network,” Advanced Computing: An International Journal (ACIJ), Vol.6, No.1, January 2015.

A. Abdel-Zaher , A. Eldeib, “Breast Cancer classification using deep belief networks,”Expert Systems with Applications, Elsevier, Oct 2015.

H. Asri, H. Mousannif,,H. Moatassime ,T. Noeld, “Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis,” The 6th International Symposium on Frontiers in Ambient and Mobile Systems, Elsevier, no.83, pp 1064-1069, 2016.

S. Sasikala, S. Appavu, S. Geetha, “A Novel Feature Selection Technique for Improved Survivability Diagnosis of Breast Cancer,” 2nd International Symposium on Big Data and Cloud Computing,2015.

M. Pawar, S. Talbar, “Genetic Fuzzy System (GFS) based Wavelet Co-occurrence Feature selection in Mammogram Classification for Breast Cancer Diagnosis,” Elsevier, April 2016.

U. Mazlan, P. Saad, “Classification of breast cancer microarray data using Radial Basis Function Network”, IEEE, 2012.

G. Anjan Babu, S. Bhukya, R. Sendhil,“ Feed forward network with back propagation algorithm for detection of breast cancer,” 8th International Conference on Computer Science & Education (ICCSE), IEEE, 2013.

S. Ali, A. Majid, “Can–Evo–Ens: Classifier stacking based evolutionary ensemble system for prediction of human breast cancer using amino acid sequences”, Journal of Biomedical Informatics, Elsevier, Jan 2015.

G. Salama, M. Abdelhalim, M. Zeid, “Breast Cancer Diagnosis on Three Different Datasets Using Multi-Classifiers,” International Journal of Computer and Information Technology (2277 – 0764), Vol. 01, Issue. 01, September 2012.

A. Saabith, E. Sundararajan, A. Bakar, “Comparative study on different classification techniques for breast cancer dataset,” International Journal of Computer Science and Mobile Computing, Vol. 3, Issue. 10, October 2014.

P.Sivagami, “Supervised learning approach for Breast cancer classification,” International Journal of Emerging Trends & Technology in Computer Science, Volume 1, Issue 4, November – December 2012.

M. Boisserie-Lacroix, G. Hurtevent-Labrot, S. Ferron, N. Lippa, H. Bonnefoi, G. Mac Grogan, “Correlation between imaging and molecular classification of breast cancers ,” Diagnostic and Interventional Imaging, vol. 94, issue 11, November 2013

D. Soria, J. Garibaldi, A. Green, D. Powe, C. Christopher, Nolan, C. Lemetre, G. Ball, I. Ellis, “A quantifier-based fuzzy classification system for breast cancer patients,” Artificial Intelligence in Medicine, vol. 58, issue 3, July 2013.

S.W. Lam, C.R. Jimenez, E. Boven, “Breast cancer classification by proteomic technologies: Current state of knowledge,” Cancer Treatment Reviews, Volume 40, Issue 1, February 2014.

A. Leo, G. Curigliano, V. Diéras, Luca Malorni, C. Sotiriou, C. Swanton, A. Thompson, A. Tutt, M. Piccart, “New approaches for improving outcomes in breast cancer in Europe,” The Breast, Elsevier, vol. 24, Issue 4, August 2015.

R. Jones, A. Constantinidou, J. Reis-Filho, “Molecular Classification of Breast Cancer,” Surgical Pathology Clinics,Elsevier, Volume 5, Issue 3, September 2012.

G. Schaefer, T. Nakashima “Hybrid cost-sensitive fuzzy classification for breast cancer diagnosis,” Annual International Conference of the IEEE Engineering in Medicine and Biology, 2010.

F. Keivanfard, M. Teshnehlab, M. Aliyari Shoorehdeli , K. Nie “Feature selection and classification of breast cancer on dynamic Magnetic Resonance Imaging by using artificial neural networks,” Conference on Biomedical Engineering, IEEE, Nov. 2010.

V. Chaurasia and S. Pal, “Data Mining Techniques: To Predict and Resolve Breast Cancer Survivability,” vol. 3, no. 1, pp. 10–22, 2014.

W. Version, “Machine Learning with WEKA,” 2004.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.