Open Access Open Access  Restricted Access Subscription or Fee Access

A Survey in Health Care Data Using Data Mining Techniques

R. Karthiyayini, Dr. R. Balasubramanian


Data mining is the process of analyzing the enormous set of data. Data mining techniques have been used in healthcare research and known to be effective. Medical data has much information that needs to be exploited in order to get intelligence on medical events. Medical information is various in range and very large in content and its size is voluminous that conventional diagnostic technique disclose very little of the potential conclusion. Medical data mining can help to obtain latent patterns or actionable knowledge. It plays a significant function can spot trends and anomalies in their data in healthcare organization and disclose invaluable knowledge which in turn more useful for the healthcare professionals for decision making. In this paper we survey the effectiveness of diverse techniques in data mining such as classification, clustering, association, regression. These techniques can be applied to medical data to recognize trends and profiles hidden in mounds of data which may be essential to effective treatment for patients, management of healthcare organization and clinical feature of healthcare. This survey also highlights healthcare domain, requisite of data mining in Medicare field, algorithms used in today’s healthcare domains.


Data Mining, Contemporary Data Mining Techniques, Medical Data Mining.

Full Text:




Jyothi Soni, “Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction”

D. Hand, H. Mannila and P. Smyth, “Principles of data mining”, MIT, (2001).

H. C. Koh and G. Tan, “Data Mining Application in healthcare”, Journal of Healthcare Information Management, vol. 19, no. 2, (2005).

G. Beller, “The rising cost of health care in the United States: is it making the United States globally noncompetitive?” J. Nucl. Cardiol., vol. 15, no. 4, (2008), pp. 481-482.

T. H. A. Soliman, A. A. Sewissy and H. A. Latif, “A Gene Selection Approach for Classifying Diseases Based on Microarray Datasets”, 2nd International Conference on Computer Technology and Development (lCCTD 2010), (2010).

Manjusha K. K, K. Sankaranarayanan and Seena P, “Prediction of Different Dermatological Conditions Using Naïve Bayesian Classification”, International Journal of Advanced Research in Computer Science and Software Engineering 4(1), January - 2014, pp. 864-868.

R. Bhuvaneswari and K. Kalaiselvi, “Naive Bayesian Classification Approach in Healthcare Applications”, International Journal of Computer Science and Telecommunications”, Volume 3, Issue 1, January 2012.

Han Jiawei, MichelineKamber, “Data Mining: Concepts and Technique”, Morgan Kaufmann Publishers, 2000.

Jie Cheng and Russell Greiner, “Comparing Bayesian Network Classifiers”, University of Alberta Edmonton, Canada.

Zurada J.M., “An introduction to artificial neural networks systems”, St. Paul: West Publishing (1992).

M. Silver, T. Sakara, H. C. Su, C. Herman, S. B. Dolins and M. J. O’shea, “Case study: how to apply data mining techniques in a healthcare data warehouse”, Healthc. Inf. Manage, vol. 15, no. 2, (2001), pp. 155-164.

J. Han and M. Kamber, “Data mining: concepts and techniques”, 2nd ed. The Morgan Kaufmann Series, (2006).

L. Lenert, A. Lin, R. Olshen and C. Sugar, “Clustering in the Service of the Public's Health”, http:/ / /~olshen/manuscripts/helsinki.PDF.

T. Balasubramanian and R. Umarani, “An Analysis on the Impact of Fluoride in Human Health (Dental) using Clustering Data mining Technique”, Proceedings of the International Conference on Pattern Recognition, Informatics and Medical Engineering, (2012) March 21-23.

J. J. Tapia, E. Morett and E. E. Vallejo, “A Clustering Genetic Algorithm for Genomic Data Mining”, Foundations of Computational Intelligence, vol. 4 Studies in Computational Intelligence, vol. 204, (2009), pp. 249-275.

Z. Liu, T. Sokka, K. Maas, N. J. Olsen and T. M. Aune, “Prediction of Disease Severity in Patients with Early Rheumatoid Arthritis by Gene Expression Profiling”, Human Genomics and Proteomics, (2009).

Belciug, F. Gorunescu, A. Salem and M. Gorunescu, “Clustering-based approach for detecting breast cancer recurrence”, 10th International Conference on Intelligent Systems Design and Applications, (2010)

S. Belciug, “Patients length of stay grouping using the hierarchical clustering algorithm”, Annals of University of Craiova, Math. Comp. Sci. Ser., ISSN: 1223-6934, vol. 36, no. 2, (2009), pp. 79-84.

T. S. Chen, T. H. Tsai, Y. T. Chen, C. C. Lin, R. C. Chen, S. Y. Li and H. Y. Chen, “A Combined K-Means and Hierarchical Clustering Method for improving the Clustering Efficiency of Microarray”, Proceedings of 2005 International Symposium on Intelligent Signal Processing and Communication Systems, (2005).

M. E. Celebi, Y. A. Aslandogan and R. P. Bergstresser, “Mining Biomedical Images with Density-based Clustering”, Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC’05), (2005).

S. Soni and O. P. Vyas, “Using Associative Classifiers for Predictive Analysis in Health Care Data Mining”, International Journal of Computer Applications (0975 – 8887), vol. 4, no. 5, (2010) July.

J. Yanqing, H. Ying, J. Tran, P. Dews, A. Mansour and R. Michael Massanari, “Mining Infrequent Causal Associations in Electronic Health Databases”, 11th IEEE International Conference on Data Mining Workshops, (2011).

Carlos Ordonez, “Improving Heart Disease Prediction Using Constrained Association Rules”

A. Bakar, Z. Kefli, S. Abdullah and M. Sahani, “Predictive Models for Dengue Outbreak Using Multiple Rulebase Classifiers”, 2011 International Conference on Electrical Engineering and Informatics, Bandung, Indonesia, (2011) July 17-19.

Maria-Luiza Antonie et al., “Application of Data Mining Techniques for Medical Image Classification”

Mohammad Mudassar Khan and Prof. AnandRajavat,“An Efficient Algorithm for Extracting Frequent Item Sets from a Data Set”, International Journal of Advanced Research in Computer Science and Software Engineering 3(6), June - 2013, pp. 1373-1375.

Arun K. Pujari, “Data Mining Techniques” Edition 2001.

Han Jiawei, MichelineKamber, Data Mining:Concepts and Technique. Morgan Kaufmannl Publishers, 2000

MsShweta and Dr. KanwalGarg, “Mining Efficient Association Rules Through Apriori Algorithm Using Attributes and Comparative Analysis of Various Association Rule Algorithm”, In: Proceeding of IJARCSSE, ISSN 2277-128X, Vol. 3, Issue 6, June 2013.

M. Patil, R. C. Joshi and D. Toshniwal, “Association rule for classification of type -2 diabetic patients”, Second International Conference on Machine Learning and Computing, (2010).

R. Agrawal and R. Srikant, “Fast Algorithms for Mining Association Rules”, VLDB, Chile, ISBN 1-55860-153-8, (1994) September 12-15, pp. 487-99.

N. G. Noma and M. K. A. Ghani, “Discovering Pattern in Medical Audiology Data with FP-Growth Algorithm”, IEEE EMBS International Conference on Biomedical Engineering and Sciences, Langkawi, (2012) December 17-19.

J. Fox, “Applied Regression Analysis, Linear Models, and Related Methods”, (1997).

Divya and S. Agarwal, “Weighted Support Vector Regression approach for Remote Healthcare monitoring”, IEEE-International Conference on Recent Trends in Information Technology, ICRTIT 2011, 978-1-4577-0590-8/11/$26.00 ©2011 IEEE MIT, Anna University, Chennai, (2011) June 3-5.

P. A. Gutiérrez, C. Hervás-Martínez and F. J. Martínez-Estudillo, “Logistic Regression by Means of Evolutionary Radial Basis Function Neural Networks”, IEEE Transactions on Neural Networks, vol. 22, no. 2, (2011), pp. 246-263.


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.