Results 1 - 10
of
49
Recognition-by-components: A theory of human image understanding
- Psychological Review
, 1987
"... The perceptual recognition of objects is conceptualized to be a process in which the image of the input is segmented at regions of deep concavity into an arrangement of simple geometric components, such as blocks, cylinders, wedges, and cones. The fundamental assumption of the proposed theory, recog ..."
Abstract
-
Cited by 550 (8 self)
- Add to MetaCart
The perceptual recognition of objects is conceptualized to be a process in which the image of the input is segmented at regions of deep concavity into an arrangement of simple geometric components, such as blocks, cylinders, wedges, and cones. The fundamental assumption of the proposed theory, recognition-by-components (RBC), is that a modest set of generalized-cone components, called geons (N ^ 36), can be derived from contrasts of five readily detectable properties of edges in a two-dimensional image: curvature, collinearity, symmetry, parallelism, and cotermmation. The detection of these properties is generally invariant over viewing position and image quality and consequently allows robust object perception when the image is projected from a novel viewpoint or is degraded. RBC thus provides a principled account of the heretofore undecided relation between the classic principles of perceptual organization and pattern recognition: The constraints toward regularization (Pragnanz) characterize not the complete object but the object's components. Representational power derives from an allowance of free combinations of the geons. A Principle of Componential Recovery can account for the major phenomena of object recognition: If an arrangement of two or three geons can be recovered from the input, objects can be quickly recognized even when they are occluded, novel, rotated in depth, or extensively degraded. The results from experiments on the perception of briefly presented pictures by human observers provide empirical support for the theory. Any single object can project an infinity of image configura-tions to the retina. The orientation of the object to the viewer can vary continuously, each giving rise to a different two-dimen-sional projection. The object can be occluded by other objects or texture fields, as when viewed behind foliage. The object need not be presented as a full-colored textured image but in-stead can be a simplified line drawing. Moreover, the object can even be missing some of its parts or be a novel exemplar of its
Conjunction search revisited
- Journal of Experimental Psychology: Human Perception and Performance
, 1990
"... Search for conjunctions of highly discriminable features can be rapid or even parallel. This article explores, three possible accounts based on (a) perceptual segregation, (b) conjunction detectors, and (c) inhibition controlled separately by two or more distractor features. Search rates for conjunc ..."
Abstract
-
Cited by 86 (1 self)
- Add to MetaCart
Search for conjunctions of highly discriminable features can be rapid or even parallel. This article explores, three possible accounts based on (a) perceptual segregation, (b) conjunction detectors, and (c) inhibition controlled separately by two or more distractor features. Search rates for conjunctions of color, size, orientation, and direction of motion correlated closely with an independent measure of perceptual segregation. However, they appeared unrelated to the physi-ology of single-unit responses. Each dimension contributed additively to conjunction search rates, suggesting that each was checked independently of the others. Unknown targets appear to be found only by serial search for each in turn. Searching through 4 sets of distractors was slower than searching through 2. The results suggest a modification of feature integration theory, in which attention is controlled not only by a unitary "window " but also by a form of feature-based inhibition. Objects in the real world vary in a large number of prop-erties, at least some of which appear to be coded by special-ized, independent channels or modules in the perceptual
The Role of the Primary Visual Cortex in Higher Level Vision
, 1998
"... In the classical feed-forward, modular view of visual processing, the primary visual cortex (area V1) is a module that serves to extract local features such as edges and bars. Representation and recognition of objects are thought to be functions of higher extrastriate cortical areas. This paper pres ..."
Abstract
-
Cited by 67 (3 self)
- Add to MetaCart
In the classical feed-forward, modular view of visual processing, the primary visual cortex (area V1) is a module that serves to extract local features such as edges and bars. Representation and recognition of objects are thought to be functions of higher extrastriate cortical areas. This paper presents neurophysiological data that show the later part of V1 neurons' responses reflecting higher order perceptual computations related to Ullman's (Cognition 1984;18:97 -- 159) visual routines and Marr's (Vision NJ: Freeman 1982) full primal sketch, 2 1 2 D sketch and 3D model. Based on theoretical reasoning and the experimental evidence, we propose a possible reinterpretation of the functional role of V1. In this framework, because of V1 neurons' precise encoding of orientation and spatial information, higher level perceptual computations and representations that involve high resolution details, fine geometry and spatial precision would necessarily involve V1 and be reflected in the later...
Visual Attention
- In B. Goldstein (Ed.), Blackwell Handbook of Perception
, 2001
"... Spatial attention: Visual selection and deployment over space The attentional spotlight and spatial cueing Attentional shifts, splits, and resolution Object-based Selection The visual search paradigm Top-down and bottom-up control of attention Inhibitory mechanisms of attention Invalid cueing Negati ..."
Abstract
-
Cited by 47 (2 self)
- Add to MetaCart
Spatial attention: Visual selection and deployment over space The attentional spotlight and spatial cueing Attentional shifts, splits, and resolution Object-based Selection The visual search paradigm Top-down and bottom-up control of attention Inhibitory mechanisms of attention Invalid cueing Negative priming Inhibition of return Temporal attention: Visual selection and deployment over time Single target search Attentional blink and attentional dwell time Repetition blindness NEURAL MECHANISMS OF SELECTION Single-cell physiological method Event-related potentials Functional imaging: PET and fMRI
Collisions and Perception
- ACM Transactions on Graphics
, 2001
"... Level of Detail (LoD) techniques for real-time... In this paper, we are particularly interested in the problem of realistic collision simulation in scenes where large numbers of objects are colliding and processing must occur in real-time. An interruptible and therefore degradable collision handling ..."
Abstract
-
Cited by 42 (4 self)
- Add to MetaCart
Level of Detail (LoD) techniques for real-time... In this paper, we are particularly interested in the problem of realistic collision simulation in scenes where large numbers of objects are colliding and processing must occur in real-time. An interruptible and therefore degradable collision handling mechanism is used and the perceptual impact of this degradation is explored. We look for ways in which we can optimise the realism of such simulations and describe a series of psychophysical experiments that investigated different factors affecting collision perception, including eccentricity, separation, distractors, causality and accuracy of physical response. Finally, strategies for incorporating these factors into a perceptually adaptive real-time simulation of large numbers of visually similar objects are presented.
Feature binding, attention and object perception
, 1998
"... The seemingly effortless ability to perceive meaningful objects in an integrated scene actually depends on complex visual processes. The `binding problem' concerns the way in which we select and integrate the separate features of objects in the correct combinations. Experiments suggest that attentio ..."
Abstract
-
Cited by 38 (1 self)
- Add to MetaCart
The seemingly effortless ability to perceive meaningful objects in an integrated scene actually depends on complex visual processes. The `binding problem' concerns the way in which we select and integrate the separate features of objects in the correct combinations. Experiments suggest that attention plays a central role in solving this problem. Some neurological patients show a dramatic breakdown in the ability to see several objects; their deficits suggest a role for the parietal cortex inthe binding process. However, indirect measures of priming and interference suggest that more information may be implicitly available than we can consciously access.
Learning to Segment Images Using Dynamic Feature Binding
- Neural Computation
, 1991
"... Despite the fact that complex visual scenes contain multiple, overlapping objects, people perform object recognition with ease and accuracy. One operation that facilitates recognition is an early segmentation process in which features of objects are grouped and labeled according to which object t ..."
Abstract
-
Cited by 36 (9 self)
- Add to MetaCart
Despite the fact that complex visual scenes contain multiple, overlapping objects, people perform object recognition with ease and accuracy. One operation that facilitates recognition is an early segmentation process in which features of objects are grouped and labeled according to which object they belong. Current computational systems that perform this operation are based on predefined grouping heuristics. We describe a system called MAGIC that learns how to group features based on a set of presegmented examples. In many cases, MAGIC discovers grouping heuristics similar to those previously proposed, but it also has the capability of finding nonintuitive structural regularities in images. Grouping is performed by a relaxation network that attempts to dynamically bind related features. Features transmit a complex-valued signal (amplitude and phase) to one another; binding can thus be represented by phase locking related features. MAGIC's training procedure is a generalizatio...
Filtering by movement in visual search
- Journal of Experimental Psychology: Human Perception & Performance
, 1991
"... Search for a target defined by a conjunction of movement and form (e.g., an X moving up in a display of intermingled Os moving up and stationary Xs) is parallel. This result is also found if (a) the moving Os and target X move in unpredictable directions so that the moving stimuli do not form a clea ..."
Abstract
-
Cited by 24 (0 self)
- Add to MetaCart
Search for a target defined by a conjunction of movement and form (e.g., an X moving up in a display of intermingled Os moving up and stationary Xs) is parallel. This result is also found if (a) the moving Os and target X move in unpredictable directions so that the moving stimuli do not form a clear perceptual group or (b) the nontarget Xs also move but in a known, different direction from the Os and target X. In contrast, search is slow and serial if the target may be unpredictably among either moving or stationary stimuli. These results suggest that a component of the visual system operates as a movement filter that can direct attention to stimuli with a common movement characteristic. The filtering cue can be moving (vs. stationary), or movement in 1 particular direction. The results do not support the view that attention can only be directed to groups defined by common fate. McLeod, Driver, and Crisp (1988) showed that search for a target defined by a conjunction of movement and form (e.g., an X moving up among an intermingled set of Os moving up and stationary Xs) was parallel (i.e., detection time was independent of the number of nontarget stimuli). This
Consumer Eye Movement Patterns on Yellow Pages Advertising
, 1997
"... Process tracing data help understand how yellow pages advertisement characteristics influence consumer information processing behavior. A laboratory experiment collected eye movement data while consumers chose businesses from phone directories. Consumers scan listings in alphabetic order. Their scan ..."
Abstract
-
Cited by 23 (2 self)
- Add to MetaCart
Process tracing data help understand how yellow pages advertisement characteristics influence consumer information processing behavior. A laboratory experiment collected eye movement data while consumers chose businesses from phone directories. Consumers scan listings in alphabetic order. Their scan is not exhaustive. As a result, some ads are never seen. Consumers noticed over 93% of the quarter page display ads but only 26% of the plain listings. Consumers perceived color ads before ads without color, noticed more color ads than non-color ads and viewed color ads 21% longer than equivalent ads without color. Users viewed 42% more bold listings than plain listings. Consumers spent 54% more time viewing ads they end up choosing which demonstrates the importance of attention on subsequent choice behavior. 1 INTRODUCTION In 1992, yellow pages directories were a $9.4 billion dollar information services business that reached 98% of American households (Mangel 1992). It is the fourth larg...
Visual Search and Mouse Pointing in Labeled Versus Unlabeled Two-Dimensional Visual Hierarchies
, 2001
"... An experiment investigates (1) how the physical structure of a computer screen layout affects visual search and (2) how people select a found target object with a mouse. Two structures are examinedlabeled visual hierarchies (groups of objects with one label per group) and unlabeled visual hierarchie ..."
Abstract
-
Cited by 22 (14 self)
- Add to MetaCart
An experiment investigates (1) how the physical structure of a computer screen layout affects visual search and (2) how people select a found target object with a mouse. Two structures are examinedlabeled visual hierarchies (groups of objects with one label per group) and unlabeled visual hierarchies (groups without labels). Search and selection times were separated by imposing a point-completion deadline that discouraged participants from moving the mouse until they found the target. The observed search times indicate that labeled visual hierarchies can be searched much more efficiently than unlabeled visual hierarchies, and suggest that people use a fundamentally different strategy for each of the two structures. The results have implications for screen layout design and cognitive modeling of visual search. The observed mouse pointing times suggest that people use a slower and more accurate speed-accuracy operating characteristic to select a target with a mouse when visual distractor...

