Results 1 - 10
of
12
Class-specific, top-down segmentation
- In ECCV
, 2002
"... Abstract. In this paper we present a novel class-based segmentation method, which is guided by a stored representation of the shape of objects within a general class (such as horse images). The approach is different from bottom-up segmentation methods that primarily use the continuity of grey-level, ..."
Abstract
-
Cited by 108 (3 self)
- Add to MetaCart
Abstract. In this paper we present a novel class-based segmentation method, which is guided by a stored representation of the shape of objects within a general class (such as horse images). The approach is different from bottom-up segmentation methods that primarily use the continuity of grey-level, texture, and bounding contours. We show that the method leads to markedly improved segmentation results and can deal with significant variation in shape and varying backgrounds. We discuss the relative merits of class-specific and general image-based segmentation methods and suggest how they can be usefully combined. Keywords: Grouping and segmentation; Figure-ground; Top-down processing; Object classification
Combining top-down and bottom-up segmentation
- In Proceedings IEEE workshop on Perceptual Organization in Computer Vision, CVPR
, 2004
"... In this work we show how to combine bottom-up and topdown approaches into a single figure-ground segmentation process. This process provides accurate delineation of object boundaries that cannot be achieved by either the topdown or bottom-up approach alone. The top-down approach uses object represen ..."
Abstract
-
Cited by 103 (2 self)
- Add to MetaCart
In this work we show how to combine bottom-up and topdown approaches into a single figure-ground segmentation process. This process provides accurate delineation of object boundaries that cannot be achieved by either the topdown or bottom-up approach alone. The top-down approach uses object representation learned from examples to detect an object in a given input image and provide an approximation to its figure-ground segmentation. The bottomup approach uses image-based criteria to define coherent groups of pixels that are likely to belong together to either the figure or the background part. The combination provides a final segmentation that draws on the relative merits of both approaches: The result is as close as possible to the top-down approximation, but is also constrained by the bottom-up process to be consistent with significant image discontinuities. We construct a global cost function that represents these top-down and bottom-up requirements. We then show how the global minimum of this function can be efficiently found by applying the sum-product algorithm. This algorithm also provides a confidence map that can be used to identify image regions where additional top-down or bottom-up information may further improve the segmentation. Our experiments show that the results derived from the algorithm are superior to results given by a pure top-down or pure bottom-up approach. The scheme has broad applicability, enabling the combined use of a range of existing bottom-up and top-down segmentations. 1.
Interleaved object categorization and segmentation
- In BMVC
, 2003
"... Historically, figure-ground segmentation has been seen as an important and even necessary precursor for object recognition. In that context, segmentation is mostly defined as a data driven, that is bottom-up, process. As for humans object recognition and segmentation are heavily intertwined processe ..."
Abstract
-
Cited by 96 (6 self)
- Add to MetaCart
Historically, figure-ground segmentation has been seen as an important and even necessary precursor for object recognition. In that context, segmentation is mostly defined as a data driven, that is bottom-up, process. As for humans object recognition and segmentation are heavily intertwined processes, it has been argued that top-down knowledge from object recognition can and should be used for guiding the segmentation process. In this paper, we present a method for the categorization of unfamiliar objects in difficult real-world scenes. The method generates object hypotheses without prior segmentation that can be used to obtain a category-specific figure-ground segmentation. In particular, the proposed approach uses a probabilistic formulation to incorporate knowledge about the recognized category as well as the supporting information in the image to segment the object from the background. This segmentation can then be used for hypothesis verification, to further improve recognition performance. Experimental results show the capacity of the approach to categorize and segment object categories as diverse as cars and cows. 1
Six Principles for Biologically-Based Computational Models of Cortical Cognition
- TRENDS IN COGNITIVE SCIENCES
, 1998
"... This paper describes and motivates six principles for computational cognitive neuroscience models: biological realism, distributed representations, inhibitory competition, bidirectional activation propagation, errordriven task learning, and Hebbian model learning. Although these principles are suppo ..."
Abstract
-
Cited by 43 (14 self)
- Add to MetaCart
This paper describes and motivates six principles for computational cognitive neuroscience models: biological realism, distributed representations, inhibitory competition, bidirectional activation propagation, errordriven task learning, and Hebbian model learning. Although these principles are supported by a number of cognitive, computational, and biological motivations, the prototypical neural network model (a feedforward backpropagation network) incorporates only two of them, and no widely used model incorporates all of them. This paper argues that these principles should be integrated into a coherent overall framework, and discusses some potential synergies and conflicts in doing so.
Object-based attention and occlusion: Evidence from normal participants and a computational model
- Journal of Experimental Psychology: Human Perception and Performance
, 1998
"... One way of perceptually organizing a complex visual scene is to attend selectively to information in a particular physical location. Another way of reducing the complexity in the input is to attend selectively to an individual object in the scene and to process its elements preferentially. This latt ..."
Abstract
-
Cited by 32 (4 self)
- Add to MetaCart
One way of perceptually organizing a complex visual scene is to attend selectively to information in a particular physical location. Another way of reducing the complexity in the input is to attend selectively to an individual object in the scene and to process its elements preferentially. This latter, object-based attention process was examined, and the predicted superiority for reporting features from 1 relative to 2 objects was replicated in a series of experiments. This object-based process was robust even under conditions of occlusion, although there were some boundary conditions on its operation. Finally, an account of the data is provided via simulations of the findings in a computational model. The claim is that object-based attention arises from a mechanism that groups together those features based on internal representations developed over perceptual experience and then preferentially gates these features for later, selective processing. Humans are exceptionally good at recognizing objects in natural visual scenes despite the fact that such scenes usually contain multiple, overlapping objects. One way in which individuals organize this complex input to minimize the
Figure-ground organization and object recognition processes: An interactive account
- Journal of Experimental Psychology: Human Perception and Performance
, 1998
"... Traditional bottom-up models of visual processing assume that figure-ground organization precedes object recognition. This assumption seems logically necessary: How can object recognition occur before a region is labeled as figure? However, some behavioral studies find that familiar regions are more ..."
Abstract
-
Cited by 15 (4 self)
- Add to MetaCart
Traditional bottom-up models of visual processing assume that figure-ground organization precedes object recognition. This assumption seems logically necessary: How can object recognition occur before a region is labeled as figure? However, some behavioral studies find that familiar regions are more likely to be labeled figure than less familiar regions, a-problematic finding for bottom-up models. An interactive account is proposed in which figure-ground processes receive top-down input from object representations in a hierarchical system. A graded, interactive computational model is presented that accounts for behavioral results in which familiarity effects are found. The interactive model offers an alternative conception of visual processing to bottom-up models. In a typical visual scene multiple objects partially occlude one another, which makes object recognition a computation-ally complex task. Traditional information-processing theo-ries of visual perception have suggested that prior to object representation and recognition, an earlier stage of perceptual organization occurs to determine which features, locations, or surfaces most likely belong together (for examples, see
Toward a biased competition account of object-based segregation and attention
- Brain and Mind
, 2000
"... Abstract. Because the visual system cannot process all of the objects, colors, and features present in a visual scene, visual attention allows some visual stimuli to be selected and processed over others. Most research on visual attention has focused on spatial or location-based attention, in which ..."
Abstract
-
Cited by 7 (1 self)
- Add to MetaCart
Abstract. Because the visual system cannot process all of the objects, colors, and features present in a visual scene, visual attention allows some visual stimuli to be selected and processed over others. Most research on visual attention has focused on spatial or location-based attention, in which the locations occupied by stimuli are selected for further processing. Recent research, however, has demonstrated the importance of objects in organizing (or segregating) visual scenes and guiding attentional selection. Because of the long history of spatial attention research, theories of spatial attention are more mature than theories of other visual processes, such as object segregation and object attention. In the present paper, I outline a biased competition account of object segregation and attention, following similar accounts that have been developed for spatial attention (Desimone and Duncan, 1995). In my biased competition account, I seek to understand how some objects can be segregated and selected over other objects in a complex visual scene. Under this account, there are two sources of visual information that allow an object to be processed over other objects: bottom-up information carried by the physical stimulus and top-down information based on an observer’s goals. I use the biased competition account to combine many diverse findings from the object segregation and attention literatures into a common framework.
Computational Models of Perceptual Organization
- Robotics Institute, Carnegie Mellon University
, 2003
"... Perceptual organization refers to the process of organizing sensory input into coherent and interpretable perceptual structures. This process is challenging due to the chicken-and-egg nature between the various sub-processes such as image segmentation, figure-ground segregation and object recognitio ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Perceptual organization refers to the process of organizing sensory input into coherent and interpretable perceptual structures. This process is challenging due to the chicken-and-egg nature between the various sub-processes such as image segmentation, figure-ground segregation and object recognition. Low-level processing requires the guidance of high-level knowledge to overcome noise; while high-level processing relies on low-level processes to reduce the computational complexity. Neither process can be sufficient on its own. Consequently, any system that carries out these processes in a sequence is bound to be brittle. An alternative system is one in which all processes interact with each other simultaneously. In this thesis, we develop a set of simple yet realistic interactive processing models for perceptual organization. We model the processing in the framework of spectral graph theory, with a criterion encoding the overall goodness of perceptual organization. We derive fast solutions for near-global optima of the criterion, and demonstrate the efficacy of the models on segmenting a wide range of real images. Through these models, we are able to capture a variety of perceptual phenomena: a unified treatment of various grouping, figure-ground and depth cues to produce popout, region segmentation and depth segregation in one step; and a unified framework for integrating bottom-up and top-down information to produce an object segmentation from spatial and object attention. We achieve these goals by empowering current spectral graph methods with a principled solution for multiclass spectral graph partitioning; expanded repertoire of grouping cues to include similarity, dissimilarity and ordering relationships; a theory for integrating sparse grouping cues; and a model ...
Geometric and neural models of object perception
- In T. Shipley & P. Kellman (Eds.), From
, 2001
"... It is an exciting time to study visual object perception. Although object perception research has a long tradition, lately its visibility in cognitive science and neuroscience has greatly increased. One reason for heightened interest is that diverse areas of research now suggest a central role for o ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
It is an exciting time to study visual object perception. Although object perception research has a long tradition, lately its visibility in cognitive science and neuroscience has greatly increased. One reason for heightened interest is that diverse areas of research now suggest a central role for objects in many aspects of human cognition, including the

