Looking for Lumps: Boosting and Bagging for Density Estimation
| Citations: | 4 - 0 self |
BibTeX
@MISC{Ridgeway_lookingfor,
author = {Greg Ridgeway},
title = {Looking for Lumps: Boosting and Bagging for Density Estimation},
year = {}
}
OpenURL
Abstract
The solution to data mining problems often involves discovering non-linear relationships in large, noisy datasets. Bagging, boosting, and their variations have produced an interesting new class of techniques for finding these relationships in prediction problems. In this paper I extend these methods to the design of algorithms for density estimation for large, noisy, high dimensional datasets. Analogous to the boosting framework, the algorithms iteratively mix the current density estimator with an additional density chosen in a greedy fashion to optimize a fit criterion. A bagging step helps to control overfitting by providing better estimates of the fit criterion. I derive optimization algorithms for the boosting steps, discuss strategies for massive datasets, and show results from real and simulated problems.







