Open Access Open Access  Restricted Access Subscription or Fee Access

Performance Analysis of Supervised Machine Learning Algorithms for Insulin Deficiency Disorder

T. Rubya, N. Radha, S. Shantha Kumari

Abstract


Data mining techniques are used to manipulate the existing a large volume of data. Biomedical datasets pretense an elite challenge for machine learning and data mining techniques in order to extract accurate, understandable and hidden knowledge. This paper extensively investigates the role of a biomedical dataset (diabetes dataset) has been taken for finding the classification accuracy of that disease. Diabetes is a common disorder caused by a lack of insulin or insulin resistance leading to impaired glucose metabolism. Glucose (a simple sugar) is the key component used by the body’s cells to make energy. In order for glucose to move from the blood into most cells insulin is required (exceptions are the brain and exercising muscles). Insulin is a hormone made in the cells of the pancreas and is secreted into the blood in response to the blood glucose level increasing. According to that the whole dataset were classified to predict the accuracy of disease. The data are retrieved from the real time environmental setup especially it consists of information about glucose, cholesterol and insulin secretion level in blood. These are all considered as attributes. Machine learning provides methods techniques and tools, which help to learn automatically and to make accurate predictions based on past observations. The data mining classification algorithms of Weka were used to classify the data. The predicted values of the classifiers were evaluated using Weka tool with respect to the specific algorithms. Comparison has been made among various algorithms from three different categories of classifiers such that Function, Rule, and Meta. In this paper the conclusion is made as to infer which algorithm provides the maximum accuracy of the disease.

Keywords


Accuracy Prediction, Data Classification, Insulin Deficiency, Machine Learning Techniques, and Weka Classifiers.

Full Text:

PDF

References


Tulai,A.F.,Oppacher, F., Carmody,.G. “GeneticsBased System For Genetic and General Data Classification”, Canada, pp 2-6.

Tan, Aik Choon and David Gilbert, Ensemble Machine Learning On Gene Expression Data For Cancer Classification, Glasgow, UK, pp 2,5-6.

Tulai.A., Oppacher, F., “Multiple Species Weighted Voting – a Genetic-Based Machine Learning System”, Genetic and Evolutionary Computation Conference, Seattle, USA, 1263-1274.

Tulai, A., Oppacher, F., Maintaining Diversity and Increasing the Accuracy the Accuracy of Classification Rules through Automatic Speciation. Congress of Evolutionary Computation,Portland, USA, 2241-2248.

Murphy P.M. and Aha.D.W. (1994), “UCI Repository of Machine Learning”, University of California, Department of Information and Computer Science, (1994), USA.

Aha, D., Kibler, D., Albert, M., Stancebased Learning algorithms, Machine Learning Vol. 6, 37-66, 1991.

Inza I., Larranaga P. and Sierra B., ” Feature Subset Selection by Bayesian Networks: A Comparison with Genetic and Sequential Algorithms”, International Journal of Approximate Reasoning 27, pp143-164, 2001.

Jourdan L., Dhaenens C. and Talbi E., ”A Genetic Algorithm for Feature Subset Selection in Data-Mining for Genetics”, MIC 2001 Proceedings, 4th Metaheuristics International Conference, 29-34, 2001.

Inza I., Merino M., Larranaga P., Quiroga J., Sierra B. and Girala M.(2001a), Feature Subset Selection by Genetic Algorithms and Estimation of Distribution Algorithms - A Case Study in the Survival of Cirrhotic Patients Treated with TIPS, Artificial Intelligence In Medicine 23 pp 187-205.

EfroymsonM.A.“Multiple Regression Analysis”, “Mathematical Methods for Digital Computers (Ralston, A. and Wilf, H.S., ed.)”, Vol.1, Wiley, New York, 1960.

Dzeroski, S., Zenko, B. Is Combining Classifiers with Stacking Better than Selecting the Best One?. Machine Learning 54 (2004) 255–273

Brazdil, P.B., Soares, C., Da Costa, J.P.: Ranking Learning Algorithms: Using IBL and Meta-Learning on Accuracy and Time Results. Machine Learning 50 (2003) 251-277

Pfahringer, B., Bensusan, H., Giraud-Carrier, C.: Meta-learning by landmarking various learning algorithms. In: International Conference on Machine Learning, (2000).

Kalousis, A., Theoharis, T., “ NDesign, Iimplementation and performance results of an intelligent assistant for classifier selection”, In: Intelligent Data Analysis, (1999).

Chen S., “Nonlinear time series modeling and prediction using Gaussian RBF networks with enhanced clustering and RLS learning”, Inst. Elect. Eng. Electron. Lett.31:17– 118, 1995.

Ian H. Witten, Eibe Frank, Len Trigg, Mark Hall, Geoffrey Holmes, Sally Jo Cunningham, “WEKA: Practical Machine Learning Tools and Techniques with Java Implementations,””.Introduction to Machine Learning and Data Mining”, Peng Du, Wenxiang Yao H. Poor

Wei, W., Zhang, Q., Wang, M.: A method of vehicle classification using models and neural networks. IEEE Vehicular Technology Conference, IEEE, 2001.

Yoshida, T., Mohottala, S., Kagesawa, M., Ikeuchi, K.: Vehicle Classification Systems with Local-Feature Based Algorithm using CG Model Images. IEICE Trans., Vol. E00-A, No. 12, December 2002.

Gupte, S., Masoud, O., Martin, R.F.K. Papanikolopoulos, N.P.: Detection and Classifica-tion.

Jiawei Han and Micheline Kamber (2001). Data Mining: concepts and techniques. Academic Press,San Diego, California.

Data Mining: Practical Machine Learning Tools and Techniques with JAVA Implementation, by I. H. Witten and E. Frank, Morgan Kanfmann Publishers, 2000

Introduction to Machine Learning and Data Mining: Peng Du, Wenxiang Yao.

P. O. Bobbie, C.-Z., Arif, H. Chauhdari,Homecare Telemedicine Analysis and Diagnosis of Tachycardia Condition in an M8051 Microcontroller, 2nd IEEE-EMBS International Summer School and Symposium on Medical Devices and Biosensors (ISSSMDBS), Hong Kong, June 25- July 2, 2004, (CD-Volume-ISBN 0-7803- 8613-2).

U. M. Fayyad, G. Piatetsky-Shapiro, P.Smyth, and R. G. R. Uthurusamy, Advances inKnowledge Discovery and Data Mining, AAAI Press / The MIT Press, Menlo Park, CA. 1996.

G. Piatetsky-Shapiro and W. J. Frawley, Knowledge Discovery in Databases, AAAI Press, Menlo Park, CA, 1991.

D. Michie, Methodologies from Machine Learning in Data Analysis and Software, Computer Journal, Vol. 34, No. 6, 1991, pp. 559-565.

M. Pazzani and D. Kibler, The Utility of Knowledge in Inductive Learning, Machine Learning, Vol. 9, No. 1, 1992, pp. 57-94.

G. Bortolan and W. Pedrycz, An Interactive framework for an analysis of ECG signals, Artificial Intelligence in Medicine, Vol. 24, 2002, pp. 109-132.

S. Palu, The Use of Java in Machine Learning, December 19, 2002, www.developer.com/java/other/article.php/1559871

J. de la Calleja and O. Fuentes, Machine learning and image analysis for morphological galaxy classification, Monthly Notices of the Royal Astronomical Society , Vol. 349, 2004, pp. 87-93.

M. Embrechts, B. Szymanski, K. Sternickel, T. Naenna, and R. Bragaspathi , Use of Machine Learning for Classification of Magnetocardiograms, Proc. IEEE Conference on System, Man and Cybernetics, Washington DC, October 2003, pp. 1400-1405.

H. Witten and E. Frank, Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, Morgan Kaufmann Publishers, San Francisco, CA, 2000.

UCI Machine Learning Repository http://www.ics.uci.edu/ ~mlearn /MLRepository.Html

N. Kerdprasop, and K. Kerdpraso, “Moving data mining tools toward a business intelligence system”, Enformatika;, vol. 19, pp. 117-122, 2007.

Falquet, L., Pagni, M., Bucher, P., Hulo, N., Sigrist, C.J., Hofmann, K., Bairoch, A.: The PROSITE database, its status in 2002. Nucleic Acids Res. 30 (2002) 235–238

Bateman, A., Birney, E., Durbin, R., Eddy, S.R., Howem, K.L., Sonnhammer, E.L.L.: The Pfam protein families database. Nucleic Acids Res. 28 (2000) 263–266

T.Rubya received her B.Sc (Computer Science) from Mother Theresa University in 2006 and MCA from P S G R krishnammal College for women and 2009 respectively and she is doing her M.Phil(Computer Science) in PSGR Krishnammal College for women.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.