• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Three-dimensional object recognition from single two-dimensional images (1987)

by D G Lowe
Venue:Artif. Intell
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 484
Next 10 →

Object Recognition from Local Scale-Invariant Features

by David G. Lowe
"... An object recognition system has been developed that uses a new class of local image features. The features are invariant to image scaling, translation, and rotation, and partially invariant to illumination changes and affine or 3D projection. These features share similar properties with neurons in ..."
Abstract - Cited by 2739 (13 self) - Add to MetaCart
An object recognition system has been developed that uses a new class of local image features. The features are invariant to image scaling, translation, and rotation, and partially invariant to illumination changes and affine or 3D projection. These features share similar properties with neurons in inferior temporal cortex that are used for object recognition in primate vision. Features are efficiently detected through a staged filtering approach that identifies stable points in scale space. Image keys are created that allow for local geometric deformations by representing blurred image gradients in multiple orientation planes and at multiple scales. The keys are used as input to a nearest-neighbor indexing method that identifies candidate object matches. Final verification of each match is achieved by finding a low-residual least-squares solution for the unknown model parameters. Experimental results show that robust object recognition can be achieved in cluttered partially-occluded images with a computation time of under 2 seconds.
(Show Context)

Citation Context

...t partially invariant to the image formation process and matching only to those features. Many candidate feature types have been proposed and explored, including line segments [6], groupings of edges =-=[11, 14]-=-, and regions [2], among many other proposals. While these features have worked well for certain object classes, they are often not detected frequently enough or with sufficient stability to form a ba...

Fitting Parameterized Three-Dimensional Models to Images

by David G. Lowe - IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE , 1991
"... Model-based recognition and motion tracking depends upon the ability to solve for projection and model parameters that will best fit a 3-D model to matching 2-D image features. This paper extends current methods of parameter solving to handle objects with arbitrary curved surfaces and with any nu ..."
Abstract - Cited by 361 (8 self) - Add to MetaCart
Model-based recognition and motion tracking depends upon the ability to solve for projection and model parameters that will best fit a 3-D model to matching 2-D image features. This paper extends current methods of parameter solving to handle objects with arbitrary curved surfaces and with any number of internal parameters representing articulations, variable dimensions, or surface deformations. Numerical

A Survey of Shape Analysis Techniques

by Sven Loncaric - Pattern Recognition , 1998
"... This paper provides a review of shape analysis methods. Shape analysis methods play an important role in systems for object recognition, matching, registration, and analysis. Researchin shape analysis has been motivated, in part, by studies of human visual form perception systems. ..."
Abstract - Cited by 267 (2 self) - Add to MetaCart
This paper provides a review of shape analysis methods. Shape analysis methods play an important role in systems for object recognition, matching, registration, and analysis. Researchin shape analysis has been motivated, in part, by studies of human visual form perception systems.
(Show Context)

Citation Context

...oped for engineering purposes and not necessarily to study perception. Fractal representations produce objects which correspond much better to the human model of visual perception and cognition. Lowe =-=[106]-=- proposed a computer vision system that can recognize three-dimensional objects from unknown viewpoints and single two-dimensional images. The procedure is non-typical and uses three mechanisms of per...

Principles of Object Perception

by Elizabeth S. Spelke - Cognitive Science , 1990
"... Research on human infants has begun to shed light on early-develpping processes for segmenting perceptual arrays into objects. Infants appear to perceive objects by analyzing three-dimensional surface arrangements and motions. Their per-ception does not accord with a general tendency to maximize fig ..."
Abstract - Cited by 259 (6 self) - Add to MetaCart
Research on human infants has begun to shed light on early-develpping processes for segmenting perceptual arrays into objects. Infants appear to perceive objects by analyzing three-dimensional surface arrangements and motions. Their per-ception does not accord with a general tendency to maximize figural goodness or to attend-to nonaccidental geometric relations in visual arrays. Object perception does accord with principles governing the motions of material bodies: Infants divide perceptual arrays into units that move as connected wholes, that move separately from one another, that tend to maintain their size and shape over motion, and that tend to act upon each other only on contact. These findings sug-gest that o general representation of object unity and boundaries is interposed between representations of surfaces and representations of obiects of familiar kinds. The processes that construct this representation may be related to pro-cesses of physical reasoning. This article is animated by two proposals about perception and perceptual development. One proposal is substantive: In situations where perception
(Show Context)

Citation Context

...ts incompleteness, therefore, a general process for organizing surfaces into objects might contribute to the process of forming more complete and meaningful representations of visual scenes (see also =-=Lowe, 1987-=-). ’ My previous attempts to characterize infants’ perception and knowledge of objects centered on four principles: cohesion, boundedness, spatiotemporal continuity, and solidity or substance (e.g., S...

Adaptive Execution in Complex Dynamic Worlds

by Robert James Firby , 1989
"... A robot acting in the real world must use flexible plans because actions will sometimes fail to produce desired effects, and unexpected events will sometimes demand the robot shift its attention. A plan is usually construed as a list of primitive robot actions to be executed one after another but i ..."
Abstract - Cited by 237 (9 self) - Add to MetaCart
A robot acting in the real world must use flexible plans because actions will sometimes fail to produce desired effects, and unexpected events will sometimes demand the robot shift its attention. A plan is usually construed as a list of primitive robot actions to be executed one after another but in a complex domain, a plan must be structured to cope effectively with the myriad unpredictable details it will encounter during execution. However, adding structure to a plan involves more than augmenting the primitive plan representation; it requires a complete model of interaction with the world called situation-driven execution. Situation-driven execution assumes that a plan consists of tasks with three major components: a satisfaction test, a window of activity, and a set of execution methods that are appropriate in different circumstances. Execution of such a plan proceeds by selecting an unsatisfied t...

Visual servo control Part I: basic approaches

by François Chaumette, Seth Hutchinson - IEEE ROBOTICS AND AUTOMATION MAGAZINE , 2006
"... This article is the first of a two-part series on the topic of visual servo control—using computer vision data in the servo loop to control the motion of a robot. In the present article, we describe the basic techniques that are by now well established in the field. We first give a general overview ..."
Abstract - Cited by 212 (34 self) - Add to MetaCart
This article is the first of a two-part series on the topic of visual servo control—using computer vision data in the servo loop to control the motion of a robot. In the present article, we describe the basic techniques that are by now well established in the field. We first give a general overview of the formulation of the visual servo control problem. We then describe the two archetypal visual servo control schemes: image-based and position-based visual servo control. Finally, we discuss performance and stability issues that pertain to these two schemes, motivating the second article in the series, in which we consider advanced techniques.
(Show Context)

Citation Context

...omputer vision problem is called the 3D localization problem. While this problem is beyond the scope of the present tutorial, many solutions have been presented in the literature (see, e.g., [16] and =-=[17]-=-). It is then typical to define s in terms of the parameterization used to represent the camera pose. Note that the parameters a involved in the definition (1) of s are now the camera intrinsic parame...

Mental rotation and orientation-dependence in shape recognition

by Michael J. Tarrandsteven Pinker - Cognitive Psychology , 1989
"... How do we recognize objects despite differences in their retinal projections when they are seen at different orientations? Marr and Nishihara (1978) proposed that shapes are represented in memory as structural descriptions in objectcentered coordinate systems, so that an object is represented identi ..."
Abstract - Cited by 212 (20 self) - Add to MetaCart
How do we recognize objects despite differences in their retinal projections when they are seen at different orientations? Marr and Nishihara (1978) proposed that shapes are represented in memory as structural descriptions in objectcentered coordinate systems, so that an object is represented identically regardless of its orientation. An alternative hypothesis is that an object is represented in memory in a single representation corresponding to a canonical orientation, and a mental rotation operation transforms an input shape into that orientation before input and memory are compared. A third possibility is that shapes are stored in a set of representations, each corresponding to a different orientation. In four experiments, subjects studied several objects each at a single orientation, and were given extensive practice at naming them quickly, or at classifying them as normal or mirror-reversed, at several orientations. At first, response times increased with departure from the study orientation, with a slope similar to those obtained in classic mental rotation experiments. This suggests that subjects made both judgments by mentally transforming the orientation of the input shape to the one they

Groups of Adjacent Contour Segments for Object Detection

by V. Ferrari, L. Fevrier, F. Jurie, C. Schmid , 2007
"... We present a family of scale-invariant local shape features formed by chains of k connected, roughly straight contour segments (kAS), and their use for object class detection. kAS are able to cleanly encode pure fragments of an object boundary, without including nearby clutter. Moreover, they offer ..."
Abstract - Cited by 188 (7 self) - Add to MetaCart
We present a family of scale-invariant local shape features formed by chains of k connected, roughly straight contour segments (kAS), and their use for object class detection. kAS are able to cleanly encode pure fragments of an object boundary, without including nearby clutter. Moreover, they offer an attractive compromise between information content and repeatability, and encompass a wide variety of local shape structures. We also define a translation and scale invariant descriptor encoding the geometric configuration of the segments within a kAS, making kAS easy to reuse in other frameworks, for example as a replacement or addition to interest points. Software for detecting and describing kAS is released on lear.inrialpes.fr/software. We demonstrate the high performance of kAS within a simple but powerful sliding-window object detection scheme. Through extensive evaluations, involving eight diverse object classes and more than 1400 images, we 1) study the evolution of performance as the degree of feature complexity k varies and determine the best degree; 2) show that kAS substantially outperform interest points for detecting shape-based classes; 3) compare our object detector to the recent, state-of-the-art system by Dalal and
(Show Context)

Citation Context

...al feature detector before being applicable within frameworks not based on sliding windows. Perceptual grouping. Perceptual grouping of contours has a long history in computer vision [6], [13], [18], =-=[19]-=-, [25], [26], [28], [32]. The idea behind these works is that pieces of contour related by some perceptually salient property are more likely to belong to the same object. The perceptual properties ex...

Local Feature View Clustering for 3D Object Recognition

by David G. Lowe - IEEE Conference on Computer Vision and Pattern Recognition , 2001
"... There have been important recent advances in object recognition through the matching of invariant local image features. However, the existing approaches are based on matching to individual training images. This paper presents a method for combining multiple images of a 3D object into a single model ..."
Abstract - Cited by 167 (6 self) - Add to MetaCart
There have been important recent advances in object recognition through the matching of invariant local image features. However, the existing approaches are based on matching to individual training images. This paper presents a method for combining multiple images of a 3D object into a single model representation. This provides for recognition of 3D objects from any viewpoint, the generalization of models to non-rigid changes, and improved robustness through the combination of features acquired under a range of imaging conditions. The decision of whether to cluster a training image into an existing view representation or to treat it as a new view is based on the geometric accuracy of the match to previous model views. A new probabilistic model is developed to reduce the false positive matches that would otherwise arise due to loosened geometric constraints on matching 3D and non-rigid models. A system has been developed based on these approaches that is able to robustly recognize 3D objects in cluttered natural images in sub-second times.
(Show Context)

Citation Context

...ult, as there is an increased probability that mistaken matches will happen to appear in a valid configuration. It is not sufficient to use a metric such as just the number or type of feature matches =-=[3, 5]-=- to determine the presence of a model under these circumstances. Schmid [13] has proposed a more complete model based on combining feature reliabilities. However, it does not take account of the varyi...

Is Vision Continuous with Cognition? The Case for Cognitive Impenetrability of Visual Perception

by Zenon Pylyshyn , 1998
"... This article defends the claim that a significant part of visual perception (called "early vision") is impervious to the influence of beliefs, expectations or knowledge. We examine a wide range of empirical evidence that has been cited in support of the continuity of vision and cognition a ..."
Abstract - Cited by 165 (10 self) - Add to MetaCart
This article defends the claim that a significant part of visual perception (called "early vision") is impervious to the influence of beliefs, expectations or knowledge. We examine a wide range of empirical evidence that has been cited in support of the continuity of vision and cognition and argue that the evidence either shows withinvision top-down effects, or else the extra-visual effects that are demonstrated occur before the operation of the autonomous early vision system (through the allocation of focal attention) or after the visual system has produced its 3D shape-description (through the intervention of post-visual decision processes).
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University