Open Access Open Access  Restricted Access Subscription or Fee Access

Preprocessing and Generation of Association Rules for Bone Marrow Analysis Data of Hematology using Abnormal Attribute Values

D. Minnie, S. Srinivasan

Abstract


Clinical Pathology uses laboratory tests on body fluids such as blood and urine to diagnose diseases. Haematologyis the study of blood and blood forming organs such as bone marrow. In this paper we analyze the components of the bone marrow and the structure of the bone marrow analysis database. The Knowledge Discovery in Databases (KDD) steps arebriefly explained.18,000 bone marrow analysis records are collected from a reputed Hospital and this raw data is transformed into a preprocessed data using the pre-processing phases of KDD such as Data Cleaning, Data Selection and Data Transformation. The eliminate_the_tuple technique is used to clean the data. The attributes related to the bone marrow components are selected. The ranges of low, high and normal values for the individual attributes are used to transform the data. The Data Mining techniques are studied and theapriori algorithm is selected for finding frequent itemsets that are used for the generation of association rules. The transformed bone marrow data with low values is used to generate associations between the attributes of the bone marrow dataset.

Keywords


Association Rule Mining, Bone Marrow Analysis, Haematology, Knowledge Discovery In Databases

Full Text:

PDF

References


Jaiwei Han, Michelinne Kamber, Data Mining : Concepts and Techniques, 2nd Edition, Morgan Kaufmann Publishers, 2006

Margaret H.Dunham, Data Mining: Introductory and advanced topics”, Pearson Education, 2007.

Rakesh Agrawal, T. Imielinski, A. Swami, “Database Mining: A performance perspective”, IEEE Transactions on Knowledge and Data Engineering, Volume 5 Issue 6, December 1993, pp. 914 – 925.

Rakesh Agrawal, T. Imielinski, A. Swami, “Mining associations between sets of items in large databases”, Proc. of the ACM-SIGMOD 1993 Int'l Conference on Management of Data, Washington D.C., May 1993, pp. 207 - 216.

Ramakrishnan Srikant and Rakesh Agrawal, “Mining generalized association rules”, Proceedings of the 21st International Conference on Very Large Data Bases, Zurich, Swizerland, September 1995

Rakesh Agrawal and Ramakrishnan Srikant, “Fast algorithms for mining association rules”, Proceedings of the 20th International Conference on Very Large Data Bases, Santiago, Chile, September 1994

Patricia Cerrito, John C. Cerrito, “Data and Text Mining the Electronic Medical Record to improve care and to lower costs”, Proceedings of SUGI 31, March 26 – 29, 2006 paper 077-31, 2006

Cios KJ, Moore GW, “Uniqueness of medical data mining”, Artificial Intelligence in Medicine, 2002 Sep-Oct; 26(1-2): 2002, pp. 1- 24.

Karen Quillen and Kate Murphy, “Quality improvement to decrease specimen mislabeling in transfusion medicine”, Archives of Pathology and Laboratory Medicine, Vol 130, August 2006, pp. 1196 - 1198.

Dale J. Duca, “Auto verification in a laboratory information system”, Laboratory Medicine, January 2002, number 1, Volume 33, pp. 21 – 25.

Alp Aslandogan Y. and Gauri A.Mahajani, “Evidence combination in medical data mining”, Proceedings of the International Conference on

Information Technology: Coding and Computing (ITCC’04), Volume 2, 2004, pp. 465 – 469

Massoud Toussi, Jean-Baptiste Lamy, Philippe Le Toumelin, and Alain Venot, “Using data mining techniques to explore physicians' therapeutic decisions when clinical guidelines do not provide recommendations: methods and example for type 2 diabetes”, BMC Medical Informatics and Decision Making 2009; pp. 9:28

Sengul Dogan and Ibrahim Turkoglu.“Diagnosing Hyperlipidemia using association rules”, Mathematical and Computational Applications, Association for Scientific Research, Vol.13,No. 3, 2008, pp. 193-202

Dion H.Goh and Rebecca P.Ang. “An introduction to association rule mining: an application in counseling and help-seeking behavior of adolescents”. Behaviour Research Methods, 39(2), 2007, pp. 259-266.

Jiuyong Li, Ada Wai-chee Fu and Hongxing He Et. Al, “Mining risk patterns in medical data”, KDD’05, Chicago, Illinois, USA, 2005, pp. 770 – 775.

Michael Goebel, Le Gruenwald, “A survey of data mining and knowledge discovery software tools”, SIGKDD Explorations, ACM SIGKDD, June 1999.

Minnie D, Srinivasan S, “Application of knowledge discovery in database to blood cell counter data to improve quality control in clinical pathology”, Proceedings of BIC-TA 2011, September 2011, pp 338 – 342

Minnie D, Srinivasan S, “Preprocessing and generation of association rules for automated blood cell counter data in haematology”, Proceedings of RACSS 2012, April 2012, pp 27 – 32.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.