Distributed Multivariate Regression Using Wavelet-based Collective Data Mining (1999)
| Venue: | Journal of Parallel and Distributed Computing |
| Citations: | 22 - 7 self |
BibTeX
@ARTICLE{Hershberger99distributedmultivariate,
author = {Daryl E. Hershberger and Hillol Kargupta},
title = {Distributed Multivariate Regression Using Wavelet-based Collective Data Mining},
journal = {Journal of Parallel and Distributed Computing},
year = {1999},
volume = {61},
pages = {2001}
}
Years of Citing Articles
OpenURL
Abstract
This paper presents a method for distributed multivariate regression using wavelet-based Collective Data Mining (CDM). The method seamlessly blends machine learning and information theory with the statistical methods employed in multivariate regression to provide an effective data mining technique for use in a distributed data and computation environment. Evaluation of the method in terms of model accuracy as a function of appropriateness of the selected wavelet function, relative number of non-linear cross-terms, and sample size demonstrates that accurate multivariate regression models can be generated from distributed, heterogeneous, data sets with minimal data communication overhead compared to that required to aggregate a centralized data set. Application of this method to Linear Discriminant Analysis, which is closely related to multivariate regression, produced classification results on the Iris data set that are comparable to those obtained with centralized data analysis. 1 Intr...







