• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 712
Next 10 →

Handling of data containing outliers

by Wolfram Stacklies, Henning Redestig , 2010
"... 1 PCA robust to outliers Away from often showing missing values, Microarray or Metabolite data are often corrupted with extreme values (outliers). Standard SVD is highly susceptible to outliers. In the extreme case, an individual data point, if sufficiently outlying, can draw even the leading princi ..."
Abstract - Add to MetaCart
1 PCA robust to outliers Away from often showing missing values, Microarray or Metabolite data are often corrupted with extreme values (outliers). Standard SVD is highly susceptible to outliers. In the extreme case, an individual data point, if sufficiently outlying, can draw even the leading

Abstract Suboptimal LULU-estimators in Measurements Containing Outliers

by Stefan Ludwig Astl, Prof Hans, C. Eggers, Dr. Carl, H. Rohwer, S. L. Astl , 2013
"... By submitting this thesis electronically, I declare that the entirety of the work contained therein is my own, original work, that I am the sole author thereof (save to the extent explicitly otherwise stated), that reproduction and publication thereof by Stellenbosch University will not infringe any ..."
Abstract - Add to MetaCart
By submitting this thesis electronically, I declare that the entirety of the work contained therein is my own, original work, that I am the sole author thereof (save to the extent explicitly otherwise stated), that reproduction and publication thereof by Stellenbosch University will not infringe

Regression Analysis for Data Containing Outliers and High Leverage Points

by Asim Kumer Dey, Md. Amir Hossain, Kumer Pial Das, Y Xβ
"... The strong impact of outliers and leverage points on the ordinary least square (OLS) regression estimator is studied for a long time. Situations in which a relatively small percentage of the data has a significant impact on the model may not be acceptable to the user of the model. A vast literature ..."
Abstract - Add to MetaCart
The strong impact of outliers and leverage points on the ordinary least square (OLS) regression estimator is studied for a long time. Situations in which a relatively small percentage of the data has a significant impact on the model may not be acceptable to the user of the model. A vast literature

Efficient Algorithms for Mining Outliers from Large Data Sets

by Sridhar Ramaswamy, Rajeev Rastogi, Kyuseok Shim , 2000
"... In this paper, we propose a novel formulation for distance-based outliers that is based on the distance of a point from its k th nearest neighbor. We rank each point on the basis of its distance to its k th nearest neighbor and declare the top n points in this ranking to be outliers. In addition ..."
Abstract - Cited by 322 (0 self) - Add to MetaCart
, and then prunes entire partitions as soon as it is determined that they cannot contain outliers. This results in substantial savings in computation. We present the results of an extensive experimental study on real-life and synthetic data sets. The results from a real-life NBA database highlight and reveal

Mining Outliers in Spatial Networks

by Wen Jin, Yuelong Jiang, Weining Qian, Anthony K. H. Tung
"... Abstract. Outlier analysis is an important task in data mining and has attracted much attention in both research and applications. Previous work on outlier detection involves different types of databases such as spatial databases, time series databases, biomedical databases, etc. However, few of the ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
, then quickly identifies the outliers in the remaining edges after pruning those unnecessary edges which cannot contain outliers. We also present algorithms that can be applied when the spatial network is updating points or the input parameters of outlier measures are changed. The experimental results verify

FP-Outlier: frequent pattern based outlier detection

by Zengyou He, Xiaofei Xu, Joshua Zhexue Huang, Shengchun Deng , 2002
"... An outlier in a dataset is an observation or a point that is considerably dissimilar to or inconsistent with the remainder of the data. Detection of such outliers is important for many applications and has recently attracted much attention in the data mining research community. In this paper, we pr ..."
Abstract - Cited by 6 (0 self) - Add to MetaCart
present a new method to detect outliers by discovering frequent patterns (or frequent itemsets) from the data set. The outliers are defined as the data transactions that contain less frequent patterns in their itemsets. We define a measure called FPOF (Frequent Pattern Outlier Factor) to detect

Learning an Outlier-Robust Kalman Filter

by Jo-anne Ting, Evangelos Theodorou, Stefan Schaal
"... Abstract. We introduce a modified Kalman filter that performs robust, real-time outlier detection, without the need for manual parameter tuning by the user. Systems that rely on high quality sensory data (for instance, robotic systems) can be sensitive to data containing outliers. The standard Kalma ..."
Abstract - Cited by 6 (0 self) - Add to MetaCart
Abstract. We introduce a modified Kalman filter that performs robust, real-time outlier detection, without the need for manual parameter tuning by the user. Systems that rely on high quality sensory data (for instance, robotic systems) can be sensitive to data containing outliers. The standard

D-optimality for minimum volume ellipsoid with outliers

by Er N. Dolia, Neil M. White, Chris J. Harris - In Proceedings of the Seventh International Conference on Signal/Image Processing and Pattern Recognition, (UkrOBRAZ’2004 , 2004
"... A family of one-class classification methods is extended by the determinant maximization novelty detection (DMND) model based on the D-optimum experimental design approach for the ellipsoid estimation. Similar to the one-class classification methods based on the support vector machine or the so-call ..."
Abstract - Cited by 5 (3 self) - Add to MetaCart
-called support vector data description (SVDD) approach, DMND is a method that fits a geometrical object around the training data. However, in contrast to SVDD, DMND finds the hyperellipsoid of the smallest volume covering the target objects that can contain outliers by maximizing the determinant

Multivariate Outlier Detection Using Independent Component Analysis

by Md. Shamim Reza, Sabba Ruhi - Science Journal of Applied Mathematics and Statistics, Science Publishing Group, USA
"... Abstract: The recent developments by considering a rather unexpected application of the theory of Independent component analysis (ICA) found in outlier detection, data clustering and multivariate data visualization etc. Accurate identification of outliers plays an important role in statistical analy ..."
Abstract - Cited by 2 (2 self) - Add to MetaCart
analysis. If classical statistical models are blindly applied to data containing outliers, the results can be misleading at best. In addition, outliers themselves are often the special points of interest in many practical situations and their identification is the main purpose of the investigation

Variance estimation for complex surveys in the presence of outliers

by Beat Hulliger, Ralf Münnich - In Proceedings of the Section on Survey Research Methods , 2006
"... Quantitative variables in surveys often have a markedly skew distribution and, in addition, contain outliers. Robust estima-tors, which may be used in this situation, generally are bi-ased. In addition linearized variance estimators tend to un-derestimate the true variance considerably. Alternatives ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Quantitative variables in surveys often have a markedly skew distribution and, in addition, contain outliers. Robust estima-tors, which may be used in this situation, generally are bi-ased. In addition linearized variance estimators tend to un-derestimate the true variance considerably
Next 10 →
Results 1 - 10 of 712
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University