Enhancing Data Analysis with Noise Removal
Cached
Download Links
- [www-users.cs.umn.edu]
- [cimic.rutgers.edu]
- DBLP
Other Repositories/Bibliography
| Citations: | 11 - 4 self |
BibTeX
@MISC{Xiong_enhancingdata,
author = {Hui Xiong and Gaurav Pandey and Michael Steinbach and Vipin Kumar},
title = {Enhancing Data Analysis with Noise Removal},
year = {}
}
OpenURL
Abstract
Removing objects that are noise is an important goal of data cleaning as noise hinders most types of data analysis. Most existing data cleaning methods focus on removing noise that is the result of low-level data errors that result from an imperfect data collection process, but data objects that are irrelevant or only weakly relevant can also significantly hinder data analysis. Thus, if the goal is to enhance the data analysis as much as possible, these objects should also be considered as noise, at least with respect to the underlying analysis. Consequently, there is a need for data cleaning techniques that remove both types of noise. Because data sets can contain large amount of noise, these techniques also need to be able to discard a potentially large fraction of the data. This paper explores four techniques intended for noise removal to enhance data analysis in the presence of high noise levels. Three of







