Results 1 - 10
of
52
Multidimensional indexing for recognizing visual shapes
- PAMI
, 1994
"... Abstract-This paper introduces an analytical framework for studying some properties of model acquisition and recognition techniques based on indexing. The goal is to demonstrate that several problems previously associated with the approach can be attributed to the low dimensionality of invariants us ..."
Abstract
-
Cited by 74 (0 self)
- Add to MetaCart
Abstract-This paper introduces an analytical framework for studying some properties of model acquisition and recognition techniques based on indexing. The goal is to demonstrate that several problems previously associated with the approach can be attributed to the low dimensionality of invariants used. These include limited index selectivity, excessive accumulation of votes in the look-up table buckets, and excessive sensitivity to quantization parameters. Theoretical results demonstrate that using high-dimensional, highly descriptive global invariants produces better results in terms of accuracy, false positive suppression, and computation time. A practical example of high-dimensional global invariants is introduced and used to implement a 2-D shape acquisitionhecognition system. The acquisitiodrecognition system is based on a two-step table look-up mechanism. First, local curve descriptors are obtained by correlating image contour information at short range. Then, seven-dimensional global invariants are computed by correlating triplets of local curve descriptors at longer range. This experimental system is meant to illustrate the behavior of a high-dimensional indexing scheme. Indeed, its performance shows good agreement with the analytical model with respect to database size, fault tolerance, and recognition speed. Model acquisition time is linear to cubic in the number of object features. Object recognition time is constant to linear in the number of models in the database and linear to cubic in the number of features in the image. The system has been tested extensively, with more than 250 arbitrary shapes in the database. Unsupervised shape and subpart acquisition is demonstrated. I.
Feature-Based Human Face Detection
- IMAGE AND VISION COMPUTING
, 1996
"... Human face detection has always been an important problem for face, expression and gesture recognition. Though numerous attempts have been made to detect and localize faces, these approaches have made assumptions that restrict their extension to more general cases. We identify that the key factor in ..."
Abstract
-
Cited by 66 (3 self)
- Add to MetaCart
Human face detection has always been an important problem for face, expression and gesture recognition. Though numerous attempts have been made to detect and localize faces, these approaches have made assumptions that restrict their extension to more general cases. We identify that the key factor in a generic and robust system is that of using a large amount of image evidence, related and reinforced by model knowledge through a probabilistic framework. In this paper, we propose a featurebased algorithm for detecting faces that is sufficiently generic and is also easily extensible to cope with more demanding variations of the imaging conditions. The algorithm detects feature points from the image using spatial filters and groups them into face candidates using geometric and gray level constraints. A probabilistic framework is then used to reinforce probabilities and to evaluate the likelihood of the candidate as a face. We provide results to support the validity of the approach and demo...
Image segmentation based on oscillatory correlation
- Neural Computation
, 1997
"... We study image segmentation on the basis of locally excitatory globally inhibitory oscillator networks (LEGION), whereby the phases of oscillators encode the binding of pixels. We introduce a potential for each oscillator so that only those oscillators with strong connections from their neighborhood ..."
Abstract
-
Cited by 63 (18 self)
- Add to MetaCart
We study image segmentation on the basis of locally excitatory globally inhibitory oscillator networks (LEGION), whereby the phases of oscillators encode the binding of pixels. We introduce a potential for each oscillator so that only those oscillators with strong connections from their neighborhood can develop high potentials. Based on the concept of potential, a solution to remove noisy regions in an image is proposed for LEGION, so that it suppresses the oscillators corresponding to noisy regions, without affecting those corresponding to major regions. We show analytically that the resulting oscillator network separates an image into several major regions, plus a background consisting of all noisy regions, and illustrate network properties by computer simulation. The network exhibits a natural capacity in segmenting images. The oscillatory dynamics leads to a computer algorithm, which is applied successfully to segmenting real graylevel images. A number of issues regarding biological plausibility and perceptual organization are discussed. We argue that LEGION provides a novel and effective framework for image segmentation and figure-ground segregation. DeLiang Wang and David Terman Image Segmentation 1.
An Active Contour Model For Mapping The Cortex
- IEEE TRANS. ON MEDICAL IMAGING
, 1995
"... A new active contour model for finding and mapping the outer cortex in brain images is developed. A cross-section of the brain cortex is modeled as a ribbon, and a constant speed mapping of its spine is sought. A variational formulation, an associated force balance condition, and a numerical approac ..."
Abstract
-
Cited by 59 (13 self)
- Add to MetaCart
A new active contour model for finding and mapping the outer cortex in brain images is developed. A cross-section of the brain cortex is modeled as a ribbon, and a constant speed mapping of its spine is sought. A variational formulation, an associated force balance condition, and a numerical approach are proposed to achieve this goal. The primary difference between this formulation and that of snakes is in the specification of the external force acting on the active contour. A study of the uniqueness and fidelity of solutions is made through convexity and frequency domain analyses, and a criterion for selection of the regularization coefficient is developed. Examples demonstrating the performance of this method on simulated and real data are provided.
Model-Based Object Recognition - A Survey of Recent Research
, 1994
"... We survey the main ideas behind recent research in model-based object recognition. The survey covers representations for models and images and the methods used to match them. Perceptual organization, the use of invariants, indexing schemes, and match verification are also reviewed. We conclude that ..."
Abstract
-
Cited by 48 (1 self)
- Add to MetaCart
We survey the main ideas behind recent research in model-based object recognition. The survey covers representations for models and images and the methods used to match them. Perceptual organization, the use of invariants, indexing schemes, and match verification are also reviewed. We conclude that there is still much room for improvement in the scope, robustness, and efficiency of object recognition methods. We identify what we believe are the ways improvements will be achieved. ii Contents 1. Introduction .................................................................................................................................... 1 2. Representation ................................................................................................................................ 3 2.1 What makes a good shape representation? ............................................................................ 3 2.2 The choice of coordinate system ..........................................
Embedding Gestalt Laws in Markov Random Fields -- a theory for shape modeling and perceptual organization
, 1999
"... The goal of this paper is to study a mathematical framework of 2D object shape modeling and learning for middle level vision problems, such as image segmentation and perceptual organization. For this purpose, we pursue generic shape models which characterize the most common features of 2D object sha ..."
Abstract
-
Cited by 48 (7 self)
- Add to MetaCart
The goal of this paper is to study a mathematical framework of 2D object shape modeling and learning for middle level vision problems, such as image segmentation and perceptual organization. For this purpose, we pursue generic shape models which characterize the most common features of 2D object shapes. In this paper, shape models are learned from observed natural shapes based on a minimax entropy learning theory (Zhu and Mumford 1997, Zhu, Wu and Mumford 1997)[31, 32]. The learned shape models are Gibbs distributions dened on Markov random elds (MRFs). The neighborhood structures of these MRFs correspond to Gestalt laws {co-linearity, co-circularity, proximity, parallelism, and symmetry. Thus both contour-based and region-based features are accounted for. Stochastic Markov chain Monte Carlo (MCMC) algorithms are proposed for learning and model verication. Furthermore, this paper provides a quantitative measure for the so-called non-accidental statistics, and thus justies some empi...
Superquadrics for Segmenting and Modeling Range Data
- IEEE Transactions on Pattern Analysis and Machine Intelligence
, 1997
"... We present a novel approach to reliable and efficient recovery of part-descriptions in terms of superquadric models from range data. We show that superquadrics can directly be recovered from unsegmented data, thus avoiding any presegmentation steps (e.g., in terms of surfaces). The approach is b ..."
Abstract
-
Cited by 47 (4 self)
- Add to MetaCart
We present a novel approach to reliable and efficient recovery of part-descriptions in terms of superquadric models from range data. We show that superquadrics can directly be recovered from unsegmented data, thus avoiding any presegmentation steps (e.g., in terms of surfaces). The approach is based on the recover-andselect paradigm [10]. We present several experiments on real and synthetic range images, where we demonstrate the stability of the results with respect to viewpoint and noise.
Stochastic Relaxation on Partitions with Connected Components and Its Application to Image Segmentation
, 1998
"... We present a new method of segmentation in which images are segmented by partitions with connected components. For this, first we define two different types of neighborhoods on the space of partitions with connected components of a general graph; neighborhoods of the first type are simple but sma ..."
Abstract
-
Cited by 26 (0 self)
- Add to MetaCart
We present a new method of segmentation in which images are segmented by partitions with connected components. For this, first we define two different types of neighborhoods on the space of partitions with connected components of a general graph; neighborhoods of the first type are simple but small, while those of the second type are large but complex; second, we give algorithms which are not computationally costly, for probability simulation and simulated annealing on such spaces using the neighborhoods. In particular Hastings algorithms and generalized Metropolis algorithms are defined to avoid heavy computations in the case of the second type of neighborhoods. To realize segmentation, we propose a hierarchical approach which at each step minimizes a cost function on the space of partitions with connected components of a graph.
A Framework for performance characterization of intermediate-level grouping modules
- IEEE Transactions on Pattern Analysis and Machine Intelligence
, 1997
"... Abstract—We present five performance measures to evaluate grouping modules in the context of constrained search and indexing based object recognition. Using these measures, we demonstrate a sound experimental framework, based on statistical ANOVA tests, to compare and contrast three edge based organ ..."
Abstract
-
Cited by 25 (5 self)
- Add to MetaCart
Abstract—We present five performance measures to evaluate grouping modules in the context of constrained search and indexing based object recognition. Using these measures, we demonstrate a sound experimental framework, based on statistical ANOVA tests, to compare and contrast three edge based organization modules, namely, those of Etemadi et al., Jacobs, and Sarkar-Boyer in the domain of aerial objects using 50 images. With adapted parameters, the Jacobs module performs overall the best for constraint based recognition. For fixed parameters, the Sarkar-Boyer module is the best in terms of recognition accuracy and indexing speedup. Etemadi et al.’s module performs equally well with fixed and adapted parameters while the Jacobs module is most sensitive to fixed and adapted parameter choices. The overall performance ranking of the modules is Jacobs, Sarkar-Boyer, and Etemadi et al. Index Terms—Perceptual organization, performance evaluation, analysis of variance, ANOVA, experimental vision, intermediate level computer vision, feature grouping, performance characterization. 1
ARTISAN - a shape retrieval system based on boundary family indexing
, 1996
"... Successful retrieval of images by shape feature is likely to be achieved only if we can mirror human similarity judgements. Following Biederman's theory of recognition-by-components, we postulate that shape analysis for retrieval should characterize an image by identifying properties such as colline ..."
Abstract
-
Cited by 21 (1 self)
- Add to MetaCart
Successful retrieval of images by shape feature is likely to be achieved only if we can mirror human similarity judgements. Following Biederman's theory of recognition-by-components, we postulate that shape analysis for retrieval should characterize an image by identifying properties such as collinearity, shape similarity and proximity in component boundaries. Such properties can then be used to group image components into families, from which indexing features can be derived. We are currently applying these principles in the development of the ARTISAN shape retrieval system for the UK Patent Office. The trademark images, supplied in compressed bit-map format, are processed using standard edge-extraction techniques to derive a set of region boundaries, which are approximated as a sequence of straight-line and circular-arc segments. These are then grouped into families using criteria such as proximity and shape similarity. Shape features for retrieval are then extracted from the image a...

