Results 11 - 20
of
48
Volumetric features for video event detection
, 2008
"... Real-world actions occur often in crowded, dynamic environments. This poses a difficult challenge for current approaches to video event detection because it is difficult to segment the actor from the background due to distracting motion from other objects in the scene. We propose a technique for eve ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Real-world actions occur often in crowded, dynamic environments. This poses a difficult challenge for current approaches to video event detection because it is difficult to segment the actor from the background due to distracting motion from other objects in the scene. We propose a technique for event recognition in crowded videos that reliably identifies actions in the presence of partial occlusion and background clutter. Our approach is based on three key ideas: (1) we efficiently match the volumetric representation of an event against oversegmented spatio-temporal video volumes; (2) we augment our shape-based features using flow; (3) rather than treating an event template as an atomic entity, we separately match by parts (both in space and time), enabling robustness against occlusions and actor variability. Our experiments on human actions, such as picking up a dropped object or waving in a crowd show reliable detection with few false positives. 1.
A geometric invariant shape descriptor based on the Radon, FOURIER, AND MELLIN TRANSFORMS
- IN PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION
, 2010
"... A new shape descriptor invariant to geometric transformation based on the Radon, Fourier, and Mellin transforms is proposed. The Radon transform converts the geometric transformation applied on a shape image into transformation in the columns and rows of the Radon image. Invariances to translation, ..."
Abstract
-
Cited by 3 (3 self)
- Add to MetaCart
A new shape descriptor invariant to geometric transformation based on the Radon, Fourier, and Mellin transforms is proposed. The Radon transform converts the geometric transformation applied on a shape image into transformation in the columns and rows of the Radon image. Invariances to translation, rotation, and scaling are obtained by applying 1D Fourier-Mellin and Fourier transforms on the columns and rows of the shape’s Radon image respectively. Experimental results on different datasets show the usefulness of the proposed shape descriptor.
3D Shape Matching by Geodesic Eccentricity
"... This paper makes use of the continuous eccentricity transform to perform 3D shape matching. The eccentricity transform has already been proved useful in a discrete graph-theoretic setting and has been applied to 2D shape matching. We show how these ideas extend to higher dimensions. The eccentricity ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
This paper makes use of the continuous eccentricity transform to perform 3D shape matching. The eccentricity transform has already been proved useful in a discrete graph-theoretic setting and has been applied to 2D shape matching. We show how these ideas extend to higher dimensions. The eccentricity transform is used to compute descriptors for 3D shapes. These descriptors are defined as histograms of the eccentricity transform and are naturally invariant to euclidean motion and articulation. They show promising results for shape discrimination.
Outlier Detection with Globally Optimal Exemplar-Based GMM
"... Outlier detection has recently become an important problem in many data mining applications. In this paper, a novel unsupervised algorithm for outlier detection is proposed. First we apply a provably globally optimal Expectation Maximization (EM) algorithm to fit a Gaussian Mixture Model (GMM) to a ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Outlier detection has recently become an important problem in many data mining applications. In this paper, a novel unsupervised algorithm for outlier detection is proposed. First we apply a provably globally optimal Expectation Maximization (EM) algorithm to fit a Gaussian Mixture Model (GMM) to a given data set. In our approach, a Gaussian is centered at each data point, and hence, the estimated mixture proportions can be interpreted as probabilities of being a cluster center for all data points. The outlier factor at each data point is then defined as a weighted sum of the mixture proportions with weights representing the similarities to other data points. The proposed outlier factor is thus based on global properties of the data set. This is in contrast to most existing approaches to outlier detection, which are strictly local. Our experiments performed on several simulated and real life data sets demonstrate superior performance of the proposed approach. Moreover, we also demonstrate the ability to detect unusual shapes. 1
3D Deformation Using Moving Least Squares
"... Figure 1: Deformation results using our method. Point handles are inside the mesh and marked as red dots. (Models courtesy of Cyberware.) We present a 3d deformation method based on Moving Least Squares that extends the work by Schaefer et al. [Schaefer et al. 2006] to the 3d setting. The user contr ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Figure 1: Deformation results using our method. Point handles are inside the mesh and marked as red dots. (Models courtesy of Cyberware.) We present a 3d deformation method based on Moving Least Squares that extends the work by Schaefer et al. [Schaefer et al. 2006] to the 3d setting. The user controls the deformation by manipulating a set of point handles. Locally, the deformation takes the form of either a rigid transformation or optionally a similarity transformation, and tends to preserve local features. Our derivation of the closed-form solution is based on singular value decomposition, and is applicable to deformation in arbitrary dimensions, as opposed to the planar case in [Schaefer et al. 2006]. Our prototype implementation allows interactive deformation of meshes of over 100k vertices. For the application of 3d mesh deformation, we further introduce a weighting scheme that determines the influence of point handles on vertices based on approximate mesh geodesics. In practice, the new scheme gives much better deformation results for limbed character models, compared with simple Euclidean distance based weighting. The new weighting scheme can be of use to the traditional skinny based deformation technique as well. 1
1 Rotation Invariant Kernels and Their Application to Shape Analysis
"... Shape analysis requires invariance under translation, scale and rotation. Translation and scale invariance can be realized by normalizing shape vectors with respect to their mean and norm. This maps the shape feature vectors onto the surface of a hypersphere. After normalization, the shape vectors c ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
Shape analysis requires invariance under translation, scale and rotation. Translation and scale invariance can be realized by normalizing shape vectors with respect to their mean and norm. This maps the shape feature vectors onto the surface of a hypersphere. After normalization, the shape vectors can be made rotational invariant by modelling the resulting data using complex scalar rotation invariant distributions defined on the complex hypersphere, e.g., using the complex Bingham distribution. However, the use of these distributions is hampered by the difficulty in estimating their parameters and the nonlinear nature of their formulation. In the present paper, we show how a set of kernel functions, that we refer to as rotation invariant kernels, can be used to convert the original nonlinear problem into a linear one. As their name implies, these kernels are defined to provide the much needed rotation invariance property allowing one to bypass the difficulty of working with complex spherical distributions. The resulting approach provides an easy, fast mechanism for 2D & 3D shape analysis. Extensive validation using a variety of shape modelling and classification problems demonstrates the accuracy of this proposed approach.
Two Perceptually Motivated Strategies for Shape Classification
"... In this paper, we propose two new, perceptually motivated strategies to better measure the similarity of 2D shape instances that are in the form of closed contours. The first strategy handles shapes that can be decomposed into a base structure and a set of inward or outward pointing “strand” structu ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
In this paper, we propose two new, perceptually motivated strategies to better measure the similarity of 2D shape instances that are in the form of closed contours. The first strategy handles shapes that can be decomposed into a base structure and a set of inward or outward pointing “strand” structures, where a strand structure represents a very thin, elongated shape part attached to the base structure. The similarity of two such shape contours can be better described by measuring the similarity of their base structures and strand structures in different ways. The second strategy handles shapes that exhibit good bilateral symmetry. In many cases, such shapes are invariant to a certain level of scaling transformation along their symmetry axis. In our experiments, we show that these two strategies can be integrated into available shape matching methods to improve the performance of shape classification on several widelyused shape data sets. Shape matching through nonrigid shape deformation is a typical approach to measure shape similarity [6, 14, 10, 21, 13, 16]. In general, this approach measures the amount of energy required to deform one shape contour into another based on some physical or mathematical model. The model is then optimized using methods such as dynamic programming to obtain a set of corresponded points on the two shape contours that minimize the deformation cost of this model. However, this approach is often very sensitive to strong, local shape variations that human vision may handle very well. For example, the two shape contours shown in Figs. 1(a) and (b) are similar in general, but their outward parts, represented by the dashed curves, are quite different from each other. A large deformation cost may be required to match these two shape contours. 1.
Articulation-Invariant Representation of Non-planar Shapes
"... Abstract. Given a set of points corresponding to a 2D projection of a non-planar shape, we would like to obtain a representation invariant to articulations (under no self-occlusions). It is a challenging problem since we need to account for the changes in 2D shape due to 3D articulations, viewpoint ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Abstract. Given a set of points corresponding to a 2D projection of a non-planar shape, we would like to obtain a representation invariant to articulations (under no self-occlusions). It is a challenging problem since we need to account for the changes in 2D shape due to 3D articulations, viewpoint variations, as well as the varying effects of imaging process on different regions of the shape due to its non-planarity. By modeling an articulating shape as a combination of approximate convex parts connected by non-convex junctions, we propose to preserve distances between a pair of points by (i) estimating the parts of the shape through approximate convex decomposition, by introducing a robust measure of convexity and (ii) performing part-wise affine normalization by assuming a weak perspective camera model, and then relating the points using the inner distance which is insensitive to planar articulations. We demonstrate the effectiveness of our representation on a dataset with non-planar articulations, and on standard shape retrieval datasets like MPEG-7.
Shape Classification Through Structured Learning of Matching Measures
"... Many traditional methods for shape classification involve establishing point correspondences between shapes to produce matching scores, which are in turn used as similarity measures for classification. Learning techniques have been applied only in the second stage of this process, after the matching ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Many traditional methods for shape classification involve establishing point correspondences between shapes to produce matching scores, which are in turn used as similarity measures for classification. Learning techniques have been applied only in the second stage of this process, after the matching scores have been obtained. In this paper, instead of simply taking for granted the scores obtained by matching and then learning a classifier, we learn the matching scores themselves so as to produce shape similarity scores that minimize the classification loss. The solution is based on a max-margin formulation in the structured prediction setting. Experiments in shape databases reveal that such an integrated learning algorithm substantially improves on existing methods. Figure 1. (a) Existing methods; learning happens after the matching scores have been obtained. (b) Our approach; both matching and classification are optimized within a unified learning scheme. 1.
Robust Outdoor Text Detection Using Text Intensity and Shape Features
"... Recognizing texts from camera images is a known hard problem because of the difficulties in text segmentation from the varied and complicated backgrounds. In this paper, we propose an algorithm that employs two novel filters and a basic component-based text detection framework. The framework uses th ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Recognizing texts from camera images is a known hard problem because of the difficulties in text segmentation from the varied and complicated backgrounds. In this paper, we propose an algorithm that employs two novel filters and a basic component-based text detection framework. The framework uses the Niblack algorithm to threshold images and groups components into regions with commonly used geometry features. The intensity filter considers the overlap between the intensity histogram of a component and that of its adjoining area. For non-text regions, we have found that this overlap is large, and so we can prune out components with large values of this measure. The shape filter, on the other hand, deletes regions whose constituent components come from a same object, as most words consist of different characters. The proposed method is evaluated with the text locating database with 249 images used in the ICDAR2003 robust reading competition. The result shows that the algorithm is robust to both indoor images and outdoor images, even for the images of complex background, which usually is a hard factor to overcome for traditional component-based algorithms. In terms of performance statistics, we tested the algorithm on the ICDAR 2003 challenge experiment, and the algorithm achieves 66 % precision rate (p), 46 % recall rate (r), and 54 % the combined rate ( f), which is the best reported in the literature on this dataset. 1.

