Results 1 - 10
of
42
Hierarchical Models of Object Recognition in Cortex
, 1999
"... The classical model of visual processing in cortex is a hierarchy of increasingly sophisticated representations, extending in a natural way the model of simple to complex cells of Hubel and Wiesel. Somewhat surprisingly, little quantitative modeling has been done in the last 15 years to explore th ..."
Abstract
-
Cited by 344 (67 self)
- Add to MetaCart
The classical model of visual processing in cortex is a hierarchy of increasingly sophisticated representations, extending in a natural way the model of simple to complex cells of Hubel and Wiesel. Somewhat surprisingly, little quantitative modeling has been done in the last 15 years to explore the biological feasibility of this class of models to explain higher level visual processing, such as object recognition. We describe a new hierarchical model that accounts well for this complex visual task, is consistent with several recent physiological experiments in inferotemporal cortex and makes testable predictions. The model is based on a novel MAX-like operation on the inputs to certain cortical neurons which may have a general role in cortical function.
Representation of spatial orientation by the intrinsic dynamics of the head-direction cell ensemble: A theory
- J. Neurosci
, 1996
"... The head-direction (HD) cells found in the limbic system in freely moving rats represent the instantaneous head direction of the animal in the horizontal plane regardless of the location of the animal. The internal direction represented by these cells uses both self-motion information for inet-tiall ..."
Abstract
-
Cited by 94 (1 self)
- Add to MetaCart
The head-direction (HD) cells found in the limbic system in freely moving rats represent the instantaneous head direction of the animal in the horizontal plane regardless of the location of the animal. The internal direction represented by these cells uses both self-motion information for inet-tially based updating and familiar visual landmarks for calibration. Here, a model of the dynamics of the HD cell ensemble is presented. The sta-bility of a localized static activity profile in the network and a dynamic shift mechanism are explained naturally by synaptic weight distribution components with even and odd symmetry, respectively. Under symmetric weights or symmetric reciprocal connections, a stable activity profile close to the known direc-tional tuning curves will emerge. By adding a slight asymmetry to the weights, the activity profile will shift continuously without 1
Representation is Representation of Similarities
- Behavioral and Brain Sciences
, 1996
"... Advanced perceptual systems are faced with the problem of securing a principled relationship between the world and its internal representation. I propose a unified approach to visual representation, based on Shepard's (1968) notion of second-order isomorphism. According to the proposed theory, a sha ..."
Abstract
-
Cited by 60 (15 self)
- Add to MetaCart
Advanced perceptual systems are faced with the problem of securing a principled relationship between the world and its internal representation. I propose a unified approach to visual representation, based on Shepard's (1968) notion of second-order isomorphism. According to the proposed theory, a shape is represented internally by the responses of a few tuned modules, each of which is broadly selective for some reference shape, whose similarity to the stimulus it measures. The result is a philosophically appealing, computationally feasible, biologically credible, and formally veridical representation of a distal shape space. This approach supports representation of and discrimination among shapes radically different from the reference ones, while bypassing the need for the computationally problematic decomposition into parts; it also addresses the needs of shape categorization, and can be used to derive a range of models of perceived similarity. Representation is Representation of Sim...
A theory of object recognition: computations and circuits in the feedforward path of the ventral stream in primate visual cortex
, 2005
"... ..."
Representation, Similarity, and the Chorus of Prototypes
- Minds and Machines
, 1995
"... It is proposed to conceive of representation as an emergent phenomenon that is supervenient on patterns of activity of coarsely tuned and highly redundant feature detectors. The computational underpinnings of the outlined theory of representation are (1) the properties of collections of overlappi ..."
Abstract
-
Cited by 38 (8 self)
- Add to MetaCart
It is proposed to conceive of representation as an emergent phenomenon that is supervenient on patterns of activity of coarsely tuned and highly redundant feature detectors. The computational underpinnings of the outlined theory of representation are (1) the properties of collections of overlapping graded receptive fields, as in the biological perceptual systems that exhibit hyperacuity-level performance, and (2) the sufficiency of a set of proximal distances between stimulus representations for the recovery of the corresponding distal contrasts between stimuli, as in multidimensional scaling. The present preliminary study appears to indicate that this concept of representation is computationally viable, and is compatible with psychological and neurobiological data. 1 Introduction A perceptual system confronted with a stimulus must (i) decide whether the stimulus belongs to an already encountered category, and (ii) if necessary, create a new category record for the stimulus a...
Implementation of an Attentional Prototype for Early Vision
- In Proceedings of the 2nd European Conference on Computer Vision
, 1992
"... Researchers have long argued that an attentional mechanism is required to perform many vision tasks. This thesis includes an implementation and evaluation of an attentional prototype as it applies to early and intermediate levels of visual computation. The model is composed of a processing hierarchy ..."
Abstract
-
Cited by 32 (4 self)
- Add to MetaCart
Researchers have long argued that an attentional mechanism is required to perform many vision tasks. This thesis includes an implementation and evaluation of an attentional prototype as it applies to early and intermediate levels of visual computation. The model is composed of a processing hierarchy and an attention beam that traverses the hierarchy, passing through the regions of greatest interest and inhibiting the regions that are not relevant. The amount of computation required is crucial to the derivation of this model. As a result, this scheme "scales up" extremely well with the size of the problem and in fact scales to human-size problems. In addition, the domain of input to the prototype is not limited to visual stimuli, making this system applicable to many different sensory modalities. Dimensions of attention such as localizing spatial regions of interset and ordering their importance are addressed, whereas other aspects of attention such as the role of task guidance are not....
A Coupled Attractor Model of the Rodent Head Direction System
, 1996
"... . Head direction (HD) cells, abundant in the rat postsubiculum and anterior thalamic nuclei, fire maximally when the rat's head is facing a particular direction. The activity of a population of these cells forms a distributed representation of the animal's current heading. We describe a ..."
Abstract
-
Cited by 27 (3 self)
- Add to MetaCart
.<F3.733e+05> Head direction (HD) cells, abundant in the rat postsubiculum and anterior thalamic nuclei, fire maximally when the rat's head is facing a particular direction. The activity of a population of these cells forms a distributed representation of the animal's current heading. We describe a neural network model that creates a stable, distributed representation of head direction and updates that representation in response to angular velocity information. In contrast to earlier models, our model of the head direction system accurately tracks a series of actual rat head rotations, and, using biologically plausible neurons, it fits the single-cell tuning curves of real HD cells recorded from rats executing those same rotations. The model makes neurophysiological predictions that can be tested using current technologies.<F3.74e+05> Introduction<F3.733e+05> Head direction cells in the postsubiculum (PoS, also known as dorsal presubiculum) were first described by Ranck<F3.967e+05> et...
Primitive Auditory Segregation Based On Oscillatory Correlation
- Cognitive Science
, 1996
"... Auditory scene analysis is critical for complex auditory processing. We study auditory segregation from the neural network perspective, and develop a framework for primitive auditory scene analysis. The architecture is a laterally coupled two-dimensional network of relaxation oscillators with a glob ..."
Abstract
-
Cited by 22 (6 self)
- Add to MetaCart
Auditory scene analysis is critical for complex auditory processing. We study auditory segregation from the neural network perspective, and develop a framework for primitive auditory scene analysis. The architecture is a laterally coupled two-dimensional network of relaxation oscillators with a global inhibitor. One dimension represents time and another one represents frequency. We show that this architecture, plus systematic delay lines, can in real time group auditory features into a stream by phase synchrony and segregate different streams by desynchronization. The network demonstrates a set of psychological phenomena regarding primitive auditory scene analysis, including dependency on frequency proximity and the rate of presentation, sequential capturing, and competition among different perceptual organizations. We offer a neurocomputational theory - shifting synchronization theory - for explaining how auditory segregation might be achieved in the brain, and the psychological pheno...
Neural Representation of Space Using Sinusoidal Arrays
- Neural Computation
, 1993
"... O'Keefe (1991) has proposed that spatial information in rats might be represented as phasors: phase and amplitude of a sine wave encoding angle and distance to a landmark. We describe computer simulations showing that operations on phasors can be efficiently realized by arrays of spiking neurons tha ..."
Abstract
-
Cited by 18 (3 self)
- Add to MetaCart
O'Keefe (1991) has proposed that spatial information in rats might be represented as phasors: phase and amplitude of a sine wave encoding angle and distance to a landmark. We describe computer simulations showing that operations on phasors can be efficiently realized by arrays of spiking neurons that re-code the temporal dimension of the sine wave spatially. Some cells in motor and parietal cortex exhibit response properties compatible with this proposal. 1 Address all correspondence to the first author. Electronic mail address: dst@cs.cmu.edu. This work was supported by a contract from Fujitsu Corporation. Hank Wan and David Redish were supported by NSF Graduate Fellowships. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Fujitsu Corporation, the National Science Foundation, or the U.S. government. Keywords: Neural Modelling, Spatial Reasoning, Pari...
Bilinear Sparse Coding for Invariant Vision
, 2004
"... Recent algorithms for sparse coding and independent component analysis (ICA) have demonstrated how localized features can be learned from natural images. However, these approaches do not take image transformations into account. We describe an unsupervised algorithm for learning both localized featur ..."
Abstract
-
Cited by 17 (1 self)
- Add to MetaCart
Recent algorithms for sparse coding and independent component analysis (ICA) have demonstrated how localized features can be learned from natural images. However, these approaches do not take image transformations into account. We describe an unsupervised algorithm for learning both localized features and their transformations directly from images using a sparse bilinear generative model. We show that from an arbitrary set of natural images, the algorithm produces oriented basis filters that can simultaneously represent features in an image and their transformations. The learned generative model can be used to translate features to different locations, thereby reducing the need to learn the same feature at multiple locations, a limitation of previous approaches to sparse coding and ICA. Our results suggest that by explicitly modeling the interaction between local image features and their transformations, the sparse bilinear approach can provide a basis for achieving transformation-invariant vision.

