Incremental Discretization for Naïve Bayes Learning with Optimum Binning
Abstract
Incremental Flexible Frequency Discretization (IFFD)
is a recently proposed discretization approach for Naïve Bayes (NB).IFFD performs satisfactory by setting the minimal interval frequency for discretized intervals as a fixed number. In this paper, we first argue that this setting cannot guarantee that the selecting MinBinSize is on always optimal for all the different datasets. So the performance of Naïve Bayes is not good in terms of classification error. We thus proposed a sequential search method for NB: named Optimum Binning. Experiments were conducted on 4 datasets from UCI machine learning repository and performance was compared between NB trained on the data discretized by OB, IFFD, and PKID.
Keywords
Full Text:
PDFReferences
HAN & KAMBER “Data mining - concept and techniques”
Ian H. Witten & Eibe Data mining - practical machine learning tools and
techniques.
Pat Langley, Wayne IBA and Kevin Thompson. “An analysis of Bayesian
Classifiers”, Tenth national conference on AI. [Page no. 223-228].1992.
Harry Z, Charles L”A Fundamental issue in Naïve Bayes” computer
science, university of new bunswick. [Page no. 1-5]
Geoffrey Webb, Ying Yang “A Comparative Study of Discretization
Methods for naïve bayes classification “in proceeding of PKAW. [Page
no. 159-173]. 2002.
Ying.Yang, Geoffery Webb “Proportional k-Interval Discretization for
Naive-Bayes Classifiers” ECML. [Page no. 564-575] 2001.
Ying Yang, Geoffrey Webb “Weighted Proportional k-Interval
Discretization for Naive-Bayes Classifiers” In proceedings PAKDD.
[Page no. 501-512] 2003.
Carlos Pinto “Partition incremental discretization” in proceedings IEEE.
[Page no. 168-174] 2005.
Geoffrey Webb, Ying Yang “Discretization for Naïve bayes learning:
managing Bias and variance” Machin learning 74(1). [Page no. 39-74]
Jingali LU, Ying Yang and Geoffrey Webb. “Incremental Discretization
for Naïve-Bayes Classifier.” ADMA. [Page no. 223-238].2006
Ying Yang “Discretization for Naïve bayes learning” Ph.D.Thesis,
School of Computer Science and Software Engineering Monash Uni,
Australia.2006.
I. Rish,“An empirical study of the naïve Bayesian classifier.” T.J. Watson
Research Center, IBM.
Y.Yang “On why discretization works for naïve bayes classifier” School
of Computer Science and Software Engineering Monash University,
Australia.
Tony R Martinez “An empirical comparison of discretization methods”
Computer Science Department, Brigham Young University, Provo.
Liu, H., Hussain, F., Tan, C.L., Dash, M.: Discretization: An enabling
technique. Data Mining & Knowledge Discovery 6(4), [page no.
–423] 2002.
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution 3.0 License.