Open Access Open Access  Restricted Access Subscription or Fee Access

SMO, IBK and Logistic Classifiers for Classification of Breast Cancer

P. Hamsagayathri, P. Sampath

Abstract


Breast cancer has become common cancer among various ages of women in world. The symptoms of breast cancer include the lumps, change in shape, change in color of skin, liquid oozing out from nipple. The occurrence of breast cancer is increasing every year by year and increased incidence is not surprising since there has been increase in numbers of women with high breast cancer risk factors, including lower age of menarche, late age of first pregnancy, fewer pregnancies, shorter or no periods of breastfeeding, later menopause, prolactin hormone levels, use of birth control pills and radiation exposure in youth. In order to reduce the mortality of breast cancer effective early classification detection techniques are required for long term survival of the women. Classification is one of the most commonly used tool analyze and classify the data. This paper analyzes the classifier algorithms such as SMO, Logistic and MLP for seer breast cancer dataset using WEKA software. The performance of the classifiers are evaluated against the parameters like classification accuracy, RMSE, TP Rate, FP Rate, Precision, Recall, F-Measure, ROC, Specificity, Sensitivity and so on. MLP Classifier is superior to SMO and Logistic and it has the highest classification accuracy of 93.5%.


Keywords


Classification, SMO, IBK, Logistic Regression, Perceptron, Accuracy, RMSE, Confusion Matrix.

Full Text:

PDF

References


Aruna S, Rajagopalan SP, Nandakishore LV, “Knowledge based analysis of various statistical tools in detecting breast cancer,” Computer Science & Information Technology, 37–45, 2011.

Vaidehi K and Subashini TS, “Breast tissue characterization using combined K-NN classifier,” Indian Journal of Science and Technology, 6-23, 2015

Williams K, Idowu PA, Balogun JA, and Oluwaranti A, “Breast cancer risk prediction using data mining classification techniques,” IEEE Transactions on Networks and Communications, Vol. 3, No. 2, 1–11, 2015.

http://seer.cancer.gov/popdata/popdic.html-SEER dictionary

T. M. Cover, “Geometrical and Statistical Properties of Systems of Linear with Applications in Pattern Recognition,” IEEE Transactions on Electronic Computers EC-14, pp. 326-334, 1965

Ramnath Takiar et al, “Projections of Number of Cancer Cases in India (2010-2020) by Cancer Groups,” Asian Pacific Journal of Cancer Prevention, Vol.11, 2010

R. W. Brause, “Medical analysis and diagnosis by neural networks,” Lecture notes In Computer Science, vol. 2199, pp. 1-13, 2001

Evanthia E. Tripoliti et al, “Automated Diagnosis of Diseases Based on Classification: Dynamic Determination of the Number of Trees in Random Forests Algorithm,” IEEE Transactions On Information Technology In Bio medicine,Vol. 16,No.4, July 2012

Xindog Wu, Vipin Kumar et al, “Top 10 Algorithms in Data Mining,” Knowledge and Information Systems, Vol. 14, 1-13, 2008

H.L. Chen, B. Yang, G. Wang, S.J. Wang, J. Liu, D.Y. Liu, “Support vector machine based diagnostic system for breast cancer using swarm intelligence,” J. Med. Syst. Vol.36 No.4,pp 2505–2519, 2012.

G. Salama, M.B. Abdelhalim, and Magdy Abd-elghany Zeid. Son,” Breast Cancer Diagnosis on Three Different Datasets Using Multi-classifiers,” International Journal of Computer and Information Technology, Vol. 01, pp. 36-43, September 2012

Sharma AK, Sahni S.A,”Comparative study of classification algorithms for spam email data analysis,” International Journal on Computer Science and Engineering, Vol. 3, No.5, 2011.

Kaur G, Chhabra A,” Improved J48 classification algorithm for the prediction of diabetes,” International Journal of Computer Applications, Vol.98, No.22, pp 7-13, 2014

Aloraini A,” Different machine learning algorithms for breast cancer diagnosis,” International Journal of Artificial Intelligence and Applications, Vol.3, No.6, pp 21-30, 2012

Saabith ALS, Elankovan S, Abu Bakar A, “Comparative study on different classification techniques for breast cancer dataset”, Vol.3, No.10, 2010

Poomani N, Porkodi. R, “A comparison of Data Mining classification algorithms using breast cancer microarray dataset: A study”, International Journal for Scientific Research and Development, Vol.2, No.12, 2015


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.