Open Access Open Access  Restricted Access Subscription or Fee Access

Mushroom Plant Analysis through Reduct Technique

Ayesha Butalia

Abstract


The issues of Real World are Very large data sets,Mixed types of data (continuous valued, symbolic data), Uncertainty (noisy data), Incompleteness (missing, incomplete data), Data change, Use of background knowledge etc. Lot of knowledge related to the application can be generated through these large data sets. Rough set is the methodology which can be used to deduce rules from these data sets. The main goal of the rough set analysis is induction of approximations of concepts [4]. Rough sets constitute a sound basis for KDD. It offers mathematical tools to discover patterns hidden in data [4] and hence used in the field of data mining. Rough Sets does not require any preliminary information as Fuzzy sets require membership values or probability is required in statistics. Hence this is its specialty. Two novel algorithms to find optimal Reducts of condition attributes based on the relative atttribute dependency, out of which the first algorithms gives simple Reduct whereas the second one gives the Reduct with minimum attributes. This project highlights on the case study of mushroom which consists of twenty two attributes depending on which the decision is taken whether the mushroom plant is edible or poisonous. The technique of Reduct is very useful as when tested, through the algorithms, the twenty one attributes, excluding the decision attribute gets reduced to two to three attributes


Keywords


Data mining, rough sets, reducts, mushroom.

Full Text:

PDF

References


Almuallim H., Dietterich, T., G., Learning Boolean concepts in the presence of many irrelevant features, Artificial Intelligence, Vol. 69(1-2), pp 279-305, 1994.

B lake, C. L. and Merz, C. J. (1998). UCI Repository of machine learning databases.

[http://www.ics.uci.edu/kmlearn/MLRepository.html]. Irvine, CA: University of California, Department of Information and Computer Science.

Han, J., Hu, X., and Lin T. Y., A New Computation Modelfor Rough Set Theory Based on Database Systems, 5th International Conference on Data Warehousing and Knowledge Discovery, Lecture Notes in Computer Science 2737, pp. 381 - 390, 2003.

Han, J., Hu, X., and Lin T. Y., Feature Subset Selection Based on Relative Dependency Between Attributes, 4h International Conference on Rough Sets and Current Trends in Computing, Lecture Notes in Computer Conference on Artificial Intelligence (AAAI), pp 129-134,1992.

Liu, H. and Setiono, R., Chi2: Feature Selection and Discretization of Numeric Attributes, 7t IEEE International Conference on Tools with Artificial Intelligence, 1995.

Modrzejewski, M., Feature Selection Using Rough Sets Theory, European Conference on Machine Learning, pp.213-226, 1993.

Pawlak, Z., Rough Sets, International Journal of Information and Computer Science, 11(5), pp.341-356, 1982.

Pawlak, Z., Rough Sets: Theoretical Aspects of Reasoning About Data,Kluwer, Academic Publishers, 1991.

Quafafou, M. and Boussouf, M., Generalized Rough Sets Based Feature Selection, Intelligent Data Analysis, Vol. 4, pp. 3-17, 2000.

Quinlan, J.R., C4.5: Programs for Machine Learning, Morgan Kaufmann, 1993.

Ramadevi, Yellasiri, C.R.Rao,Vivekchan Reddy, Decision Tree Induction Using Rough Set Theory – Comparative Study in Journal of Theoretical and Applied Information Technology, 2005 - 2007 JATIT.

Sever, H., Raghavan, V., and Johnsten, D. T., The Status of Research on Rough Sets for Knowledge Discovery in Databases, 2nd International Conference on Nonlinear Problems in Aviation and Aerospace, Vol. 2,pp. 673-680, 1998.

Jun Wu, Anastasiya olesnikova, Chi-Hwa Song, Won Don Lee, The Development and Application of Decision Tree for Agriculture Data in Second International Symposium on Intelligent Information Technology and Security Informatics, IEEE Computer Society, 2009.

Deng-Yiv Chiu, Ya-Chen Pan and Wen-Chih Chang, Using Rough set Theory to Construct E-Learning FAQ Retrieval Infrastructure in IEEE explore 2008.

Yang Liu, M l B d M Multi-Agent Based Multi-Knowledge Acquisition Method for Rough Set, (PhD thesis) Blekinge Institute of Technology, Sweden Xi’an Jiaotong University, P. R China 2008.

Ramadevi YELLASIRI ,C.R.RAO, Hari RAMAKRISHNA and Prathima T. Reduct Based Decision Tree (RDT) in IJCSES International Journal of Computer Sciences and Engineering Systems,Vol.2, No.4, October 2008 CSES International 2008 ISSN 0973-4406 Manuscript received May 25, 2008. Manuscript revised August 15,2008.

Hyrudaya Ku. Tripathy, B.K.Tripathy, A Rough Set Approach for Clustering the Data Using Knowledge Discovery in World Wide Web for E-Business in IEEE International Conference on e-Business,Engineering 2007.

Qiang Shen_ Richard Jensen, Rough Sets, their Extensions and Application International Journal of Automation and Computing, Jan 2007.


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.