Improving Simple Bayes (1997)
| Citations: | 45 - 1 self |
BibTeX
@MISC{Kohavi97improvingsimple,
author = {Ron Kohavi and Barry Becker and Dan Sommerfield},
title = {Improving Simple Bayes},
year = {1997}
}
Years of Citing Articles
OpenURL
Abstract
The simple Bayesian classifier (SBC), sometimes called Naive-Bayes, is built based on a conditional independence model of each attribute given the class. The model was previously shown to be surprisingly robust to obvious violations of this independence assumption, yielding accurate classification models even when there are clear conditional dependencies. We examine different approaches for handling unknowns and zero counts when estimating probabilities. Large scale experiments on 37 datasets were conducted to determine the effects of these approaches and several interesting insights are given, including a new variant of the Laplace estimator that outperforms other methods for dealing with zero counts. Using the bias-variance decomposition [15, 10], we show that while the SBC has performed well on common benchmark datasets, its accuracy will not scale up as the dataset sizes grow. Even with these limitations in mind, the SBC can serve as an excellenttool for initial exp...







