Results 1 - 10
of
15
Object Recognition from Local Scale-Invariant Features
- PROC. OF THE INTERNATIONAL CONFERENCE ON COMPUTER VISION, CORFU
, 1999
"... An object recognition system has been developed that uses a new class of local image features. The features are invariant to image scaling, translation, and rotation, and partially invariant to illumination changes and affine or 3D projection. These features share similar properties with neurons i ..."
Abstract
-
Cited by 1032 (14 self)
- Add to MetaCart
An object recognition system has been developed that uses a new class of local image features. The features are invariant to image scaling, translation, and rotation, and partially invariant to illumination changes and affine or 3D projection. These features share similar properties with neurons in inferior temporal cortex that are used for object recognition in primate vision. Features are efficiently detected through a staged filtering approach that identifies stable points in scale space. Image keys are created that allow for local geometric deformations by representing blurred image gradients in multiple orientation planes and at multiple scales. The keys are used as input to a nearest-neighbor indexing method that identifies candidate object matches. Final verification of each match is achieved by finding a low-residual least-squares solution for the unknown model parameters. Experimental results show that robust object recognition can be achieved in cluttered partially-occluded images with a computation time of under 2 seconds.
Local Feature View Clustering for 3D Object Recognition
- IEEE Conference on Computer Vision and Pattern Recognition
, 2001
"... There have been important recent advances in object recognition through the matching of invariant local image features. However, the existing approaches are based on matching to individual training images. This paper presents a method for combining multiple images of a 3D object into a single model ..."
Abstract
-
Cited by 69 (7 self)
- Add to MetaCart
There have been important recent advances in object recognition through the matching of invariant local image features. However, the existing approaches are based on matching to individual training images. This paper presents a method for combining multiple images of a 3D object into a single model representation. This provides for recognition of 3D objects from any viewpoint, the generalization of models to non-rigid changes, and improved robustness through the combination of features acquired under a range of imaging conditions. The decision of whether to cluster a training image into an existing view representation or to treat it as a new view is based on the geometric accuracy of the match to previous model views. A new probabilistic model is developed to reduce the false positive matches that would otherwise arise due to loosened geometric constraints on matching 3D and non-rigid models. A system has been developed based on these approaches that is able to robustly recognize 3D objects in cluttered natural images in sub-second times.
A limit to the speed of processing in ultra-rapid visual categorization of novel natural scenes
- Journal of Cognitive Neuroscience
, 2001
"... & The processing required to decide whether a briefly flashed natural scene contains an animal can be achieved in 150 msec (Thorpe, Fize, & Marlot, 1996). Here we report that extensive training with a subset of photographs over a 3-week period failed to increase the speed of the processing underlyi ..."
Abstract
-
Cited by 38 (9 self)
- Add to MetaCart
& The processing required to decide whether a briefly flashed natural scene contains an animal can be achieved in 150 msec (Thorpe, Fize, & Marlot, 1996). Here we report that extensive training with a subset of photographs over a 3-week period failed to increase the speed of the processing underlying such rapid visual categorizations: Completely novel scenes could be categorized just as fast as highly familiar ones. Such data imply that the visual system processes new stimuli at a speed and with a number of stages that cannot be compressed. This rapid processing mode was seen with a wide range of visual complex images challenging the idea that short reaction times can only be seen with simple visual stimuli and implying that highly automatic feed-forward mechanisms underlie a far greater proportion of the sophisticated image analysis needed for everyday vision than is generally assumed. & Both humans and monkeys are able to categorize natural images accurately and very rapidly (Fabre-Thorpe, Richard, & Thorpe, 1998; Thorpe, Fize, & Marlot, 1996). The nature of the underlying mechanisms is currently
The similarity-in-topography principle: reconciling theories of conceptual deficits
- Cognitive Neuropsychology
, 2003
"... Three theories currently compete to explain the conceptual deficits that result from brain damage: sensory-functional theory, domain-specific theory, and conceptual structure theory. We argue that all three theories capture important aspects of conceptual deficits, and offer different insights into ..."
Abstract
-
Cited by 32 (8 self)
- Add to MetaCart
Three theories currently compete to explain the conceptual deficits that result from brain damage: sensory-functional theory, domain-specific theory, and conceptual structure theory. We argue that all three theories capture important aspects of conceptual deficits, and offer different insights into their origins. Conceptual topography theory (CTT) integrates these insights, beginning with A. R. Damasio’s (1989) convergence zone theory and elaborating it with the similarity-in-topography (SIT) principle. According to CTT, feature maps in sensory-motor systems represent the features of a category’s exemplars. A hierarchical system of convergence zones then conjoins these features to form both property and category representations. According to the SIT principle, the proximity of two conjunctive neurons in a convergence zone increases with the similarity of the features they conjoin. As a result, conjunctive neurons become topographically organised into local regions that represent properties and categories. Depending on the level and location of a lesion in this system, a wide variety of deficits is possible. Consistent with the literature, these deficits range from the loss of a single category to the loss of multiple categories that share sensory-motor properties.
Noticing Familiar Objects in Real World Scenes: The Role of Temporal Cortical Neurons in Natural Vision
- Journal of Neuroscience
, 2001
"... During natural vision, the brain efficiently processes views of the external world as the eyes actively scan the environment. To better understand the neural mechanisms underlying this process, we recorded the activity of individual temporal cortical neurons while monkeys looked for and identified f ..."
Abstract
-
Cited by 29 (0 self)
- Add to MetaCart
During natural vision, the brain efficiently processes views of the external world as the eyes actively scan the environment. To better understand the neural mechanisms underlying this process, we recorded the activity of individual temporal cortical neurons while monkeys looked for and identified familiar targets embedded in natural scenes. We found a group of visual neurons that exhibited stimulus-selective neuronal bursts just before the monkey’s response. Most of these cells showed similar selectivity whether effective targets were viewed in isolation or encountered in the course of exploring complex scenes. In addition, by embedding target stimuli in natural scenes, we could examine the activity of these stimulus-selective cells during visual search and at the time targets were fixated and Convergent evidence from behavioral, neuropsychological, and neurophysiological experiments indicates that, in the primate
Towards a Computational Model for Object Recognition in IT Cortex
- IN IT CORTEX, IN: BIOLOGICALLY MOTIVATED COMPUTER VISION
, 2000
"... There is considerable evidence that object recognition in primates is based on the detection of local image features of intermediate complexity that are largely invariant to imaging transformations. A computer vision system has been developed that performs object recognition using features with ..."
Abstract
-
Cited by 20 (0 self)
- Add to MetaCart
There is considerable evidence that object recognition in primates is based on the detection of local image features of intermediate complexity that are largely invariant to imaging transformations. A computer vision system has been developed that performs object recognition using features with similar properties. Invariance to image translation, scale and rotation is achieved by first selecting stable key points in scale space and performing feature detection only at these locations. The features measure local image gradients in a manner modeled on the response of complex cells in primary visual cortex, and thereby obtain partial invariance to illumination, affine change, and other local distortions. The features are used as input to a nearest-neighbor indexing method and Hough transform that identify candidate object matches. Final verification of each match is achieved by finding a best-fit solution for the unknown model parameters and integrating the features consistent with these parameter values. This verification procedure provides a model for the serial process of attention in human vision that integrates features belonging to a single object. Experimental results show that this approach can achieve rapid and robust object recognition in cluttered partially-occluded images.
How Visual Cortex Recognizes Objects: The Tale of the Standard Model
, 2002
"... A host of experimental data has been accumulating on the properties and mechanisms of object recognition in cortex. We review the main findings, and summarize them using a quantitative, biologically plausible, Standard Model. The model is a tool to interpret and understand the available data, and ..."
Abstract
-
Cited by 14 (3 self)
- Add to MetaCart
A host of experimental data has been accumulating on the properties and mechanisms of object recognition in cortex. We review the main findings, and summarize them using a quantitative, biologically plausible, Standard Model. The model is a tool to interpret and understand the available data, and generate questions and predictions for new experiments.
Sensorimotor cognition and natural language syntax
, 2010
"... This book is about the interface between natural language and the sensorimotor system. It is obvious that there is an interface between language and sensorimotor cognition, because we can talk about what we see and do. The main proposal in the book is that the interface is more direct than is common ..."
Abstract
-
Cited by 5 (3 self)
- Add to MetaCart
This book is about the interface between natural language and the sensorimotor system. It is obvious that there is an interface between language and sensorimotor cognition, because we can talk about what we see and do. The main proposal in the book is that the interface is more direct than is commonly assumed. To argue for this proposal I focus on a simple concrete episode—a man grabbing a cup—which can be reported in a simple transitive sentence (e.g. the English sentence The man grabbed a cup). In the first part of the book I present a detailed model of the sensorimotor processes involved in experiencing this episode, both as the agent bringing it about and as an observer watching it happen. The model draws on a large body of research in neuroscience and psychology. I also present a model of the syntactic structure of the associated transitive sentence, developed within the entirely separate discipline of theoretical linguistics. This latter model is a version of Chomsky’s ‘Minimalist ’ syntactic theory, which assumes that a sentence reporting the episode has the same underlying syntactic structure (called ‘logical form’) regardless of which language it is in. My main proposal is that these two independently motivated models are in fact closely
Success and failure of new speech category learning in adulthood: Consequences of learned Hebbian attractors in topographic maps
, 2007
"... The influence of a native language on learning new speech sounds in adulthood is addressed using a network model in which speech categories are attractors implemented through interactive activation and Hebbian learning. The network has a representation layer that receives topographic projections fro ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
The influence of a native language on learning new speech sounds in adulthood is addressed using a network model in which speech categories are attractors implemented through interactive activation and Hebbian learning. The network has a representation layer that receives topographic projections from an input layer and has reciprocal excitatory connections with deeper layers. When applied to an experiment in which Japanese adults were trained to distinguish the English /r/–/l / contrast (McCandliss, Fiez, Protopapas, Conway, & McClelland, 2002), the model can account for many aspects of the experimental results, such as the time course and outcome of the learning, how it varies as a function of feedback, the relative efficacy of adaptive and initially easy training stimuli versus nonadaptive and difficult stimuli, and the development of a discrimination peak at the acquired category boundary. The model is also able to capture some aspects of the individual differences in learning.

