• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

P.H.S.: Graph Cut Based Inference with Co-occurrence Statistics. (2010)

by L Ladicky, C Russell, P Kohli, Torr
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 100
Next 10 →

Fast approximate energy minimization with label costs

by Andrew Delong, Anton Osokin, Hossam N. Isack, Yuri Boykov , 2010
"... The α-expansion algorithm [7] has had a significant impact in computer vision due to its generality, effectiveness, and speed. Thus far it can only minimize energies that involve unary, pairwise, and specialized higher-order terms. Our main contribution is to extend α-expansion so that it can simult ..."
Abstract - Cited by 110 (9 self) - Add to MetaCart
The α-expansion algorithm [7] has had a significant impact in computer vision due to its generality, effectiveness, and speed. Thus far it can only minimize energies that involve unary, pairwise, and specialized higher-order terms. Our main contribution is to extend α-expansion so that it can simultaneously optimize “label costs ” as well. An energy with label costs can penalize a solution based on the set of labels that appear in it. The simplest special case is to penalize the number of labels in the solution. Our energy is quite general, and we prove optimality bounds for our algorithm. A natural application of label costs is multi-model fitting, and we demonstrate several such applications in vision: homography detection, motion segmentation, and unsupervised image segmentation. Our C++/MATLAB implementation is publicly available.
(Show Context)

Citation Context

...me form. For example, in [18] we proposed our subset costs in (⋆) as a form of co-occurrence cost in object recognition. This application was thoroughly and independently developed by Ladick´y et al. =-=[39]-=-, also within an α-expansion framework but with a heuristic extension; see Section 7 for discussion. Others have independently proposed label cost energies for specific applications. For example, we l...

Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images

by Saurabh Gupta, Pablo Arbeláez, Jitendra Malik
"... We address the problems of contour detection, bottomup grouping and semantic segmentation using RGB-D data. We focus on the challenging setting of cluttered indoor scenes, and evaluate our approach on the recently introduced NYU-Depth V2 (NYUD2) dataset [27]. We propose algorithms for object boundar ..."
Abstract - Cited by 48 (3 self) - Add to MetaCart
We address the problems of contour detection, bottomup grouping and semantic segmentation using RGB-D data. We focus on the challenging setting of cluttered indoor scenes, and evaluate our approach on the recently introduced NYU-Depth V2 (NYUD2) dataset [27]. We propose algorithms for object boundary detection and hierarchical segmentation that generalize the gP b − ucm approach of [2] by making effective use of depth information. We show that our system can label each contour with its type (depth, normal or albedo). We also propose a generic method for long-range amodal completion of surfaces and show its effectiveness in grouping. We then turn to the problem of semantic segmentation and propose a simple approach that classifies superpixels into the 40 dominant object categories in NYUD2. We use both generic and class-specific features to encode the appearance and geometry of objects. We also show how our approach can be used for scene classification, and how this contextual information in turn improves object recognition. In all of these tasks, we report significant improvements over the state-of-the-art. 1.
(Show Context)

Citation Context

.... We observe that approaches like [4, 1], which focus on classifying bottom-up regions candidates using strong features on the region have obtained significantly better results than MRF-based methods =-=[16]-=-. We build on this motivation and propose new features to represent bottom-up region proposals (which in our case are non-overlapping superpixels and their amodal completion), and use randomized decis...

Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials

by Vladlen Koltun
"... Most state-of-the-art techniques for multi-class image segmentation and labeling use conditional random fields defined over pixels or image regions. While regionlevel models often feature dense pairwise connectivity, pixel-level models are considerably larger and have only permitted sparse graph str ..."
Abstract - Cited by 46 (2 self) - Add to MetaCart
Most state-of-the-art techniques for multi-class image segmentation and labeling use conditional random fields defined over pixels or image regions. While regionlevel models often feature dense pairwise connectivity, pixel-level models are considerably larger and have only permitted sparse graph structures. In this paper, we consider fully connected CRF models defined on the complete set of pixels in an image. The resulting graphs have billions of edges, making traditional inference algorithms impractical. Our main contribution is a highly efficient approximate inference algorithm for fully connected CRF models in which the pairwise edge potentials are defined by a linear combination of Gaussian kernels. Our experiments demonstrate that dense connectivity at the pixel level substantially improves segmentation and labeling accuracy. 1
(Show Context)

Citation Context

...ons on which the model operates. This limits the ability of region-based approaches to produce accurate label assignments around complex object boundaries, although significant progress has been made =-=[9, 13, 14]-=-. In this paper, we explore a different model structure for accurate semantic segmentation and labeling. We use a fully connected CRF that establishes pairwise potentials on all pairs of pixels in the...

Describing the Scene as a Whole: Joint Object Detection, Scene Classification and Semantic Segmentation

by Jian Yao, Sanja Fidler, Raquel Urtasun
"... In this paper we propose an approach to holistic scene understanding that reasons jointly about regions, location, class and spatial extent of objects, presence of a class in the image, as well as the scene type. Learning and inference in our model are efficient as we reason at the segment level, an ..."
Abstract - Cited by 39 (12 self) - Add to MetaCart
In this paper we propose an approach to holistic scene understanding that reasons jointly about regions, location, class and spatial extent of objects, presence of a class in the image, as well as the scene type. Learning and inference in our model are efficient as we reason at the segment level, and introduce auxiliary variables that allow us to decompose the inherent high-order potentials into pairwise potentials between a few variables with small number of states (at most the number of classes). Inference is done via a convergent message-passing algorithm, which, unlike graph-cuts inference, has no submodularity restrictions and does not require potential specific moves. We believe this is very important, as it allows us to encode our ideas and prior knowledge about the problem without the need to change the inference engine every time we introduce a new potential. Our approach outperforms the state-of-the-art on the MSRC-21 benchmark, while being much faster. Importantly, our holistic model is able to improve performance in all tasks. 1.
(Show Context)

Citation Context

...e: CITY body chair bird book cow dog car cat boat flower bird chair body boat car book sign cow sign scene type: CITY While there has been significant progress in solving tasks such as image labeling =-=[14]-=-, object detection [5] and scene classification [26], existing approaches could benefit from solving these problems jointly [9]. For example, segmentation should be easier if we know where the object ...

Submodularity beyond submodular energies: coupling edges in graph cuts

by Stefanie Jegelka, Jeff Bilmes - IN CVPR , 2011
"... We propose a new family of non-submodular global energy functions that still use submodularity internally to couple edges in a graph cut. We show it is possible to develop an efficient approximation algorithm that, thanks to the internal submodularity, can use standard graph cuts as a subroutine. We ..."
Abstract - Cited by 32 (17 self) - Add to MetaCart
We propose a new family of non-submodular global energy functions that still use submodularity internally to couple edges in a graph cut. We show it is possible to develop an efficient approximation algorithm that, thanks to the internal submodularity, can use standard graph cuts as a subroutine. We demonstrate the advantages of edge coupling in a natural setting, namely image segmentation. In particular, for finestructured objects and objects with shading variation, our structured edge coupling leads to significant improvements over standard approaches.

Energy based multiple model fitting for non-rigid structure from motion

by Chris Russell, Joao Fayad, Lourdes Agapito - In Proceedings of IEEE Conference on Computer Vision and Pattern , 2007
"... In this paper we reformulate the 3D reconstruction of deformable surfaces from monocular video sequences as a labeling problem. We solve simultaneously for the assignment of feature points to multiple local deformation models and the fitting of models to points to minimize a geometric cost, subject ..."
Abstract - Cited by 21 (7 self) - Add to MetaCart
In this paper we reformulate the 3D reconstruction of deformable surfaces from monocular video sequences as a labeling problem. We solve simultaneously for the assignment of feature points to multiple local deformation models and the fitting of models to points to minimize a geometric cost, subject to a spatial constraint that neighboring points should also belong to the same model. Piecewise reconstruction methods rely on features shared between models to enforce global consistency on the 3D surface. To account for this overlap between regions, we consider a super-set of the classic labeling problem in which a set of labels, instead of a single one, is assigned to each variable. We propose a mathematical formulation of this new model and show how it can be efficiently optimized with a variant of α-expansion. We demonstrate how this framework can be applied to Non-Rigid Structure from Motion and leads to simpler explanations of the same data. Compared to existing methods run on the same data, our approach has up to half the reconstruction error, and is more robust to over-fitting and outliers. 1.
(Show Context)

Citation Context

...ly increasing cost which is linear (proposed in [12] and used in [2]), concave [8] with optimal moves proposed by α-expansion, or an arbitrary monotone increasing with sub-optimal moves by αexpansion =-=[16]-=-. A significant contribution of these works, was in proving that these label set costs could be efficiently solved with α-expansion. We make use of this in Sections 4 and 5.1 by showing how the costs ...

Filter-based mean-field inference for random fields with higher order terms and product labelspaces

by Vibhav Vineet, Jonathan Warrell, Philip H. S. Torr - In ECCV , 2012
"... Abstract. Recently, a number of cross bilateral filtering methods have been proposed for solving multi-label problems in computer vision, such as stereo, optical flow and object class segmentation that show an order of magnitude improvement in speed over previous methods. These methods have achieved ..."
Abstract - Cited by 17 (5 self) - Add to MetaCart
Abstract. Recently, a number of cross bilateral filtering methods have been proposed for solving multi-label problems in computer vision, such as stereo, optical flow and object class segmentation that show an order of magnitude improvement in speed over previous methods. These methods have achieved good results despite using models with only unary and/or pairwise terms. However, previous work has shown the value of using models with higher-order terms e.g. to represent label consistency over large regions, or global co-occurrence relations. We show how these higher-order terms can be formulated such that filter-based inference remains possible. We demonstrate our techniques on joint stereo and object labeling problems, as well as object class segmentation, showing in addition for joint object-stereo labeling how our method provides an efficient approach to inference in product label-spaces. We show that we are able to speed up inference in these models around 10-30 times with respect to competing graph-cut/move-making methods, as well as maintaining or improving accuracy in all cases. We show results on PascalVOC-10 for object class segmentation, and Leuven for joint object-stereo labeling. 1
(Show Context)

Citation Context

...homogeneous regions has been demonstrated using P n -Potts models [7], and co-occurrence relations between classes at the image level have also been shown to provide important priors for segmentation =-=[8]-=-. For stereo and optical flow, second-order priors have proved to be effective [9], as have higher-order image priors for denoising [10]. In this paper, we propose a number of methods by which higher-...

Markov Random Field Modeling, Inference & Learning in Computer Vision & Image Understanding: A Survey

by Chaohui Wang , Nikos Komodakis , Nikos Paragios , 2013
"... ..."
Abstract - Cited by 15 (6 self) - Add to MetaCart
Abstract not found

Discriminative re-ranking of diverse segmentations

by Payman Yadollahpour, Dhruv Batra, Gregory Shakhnarovich - In CVPR , 2013
"... This paper introduces a two-stage approach to semantic image segmentation. In the first stage a probabilistic model generates a set of diverse plausible segmentations. In the second stage, a discriminatively trained re-ranking model selects the best segmentation from this set. The re-ranking stage c ..."
Abstract - Cited by 14 (1 self) - Add to MetaCart
This paper introduces a two-stage approach to semantic image segmentation. In the first stage a probabilistic model generates a set of diverse plausible segmentations. In the second stage, a discriminatively trained re-ranking model selects the best segmentation from this set. The re-ranking stage can use much more complex features than what could be tractably used in the probabilistic model, allowing a better exploration of the solution space than possible by simply producing the most probable solution from the probabilistic model. While our proposed approach already achieves state-of-the-art results (48.1%) on the challenging VOC 2012 dataset, our machine and human analyses suggest that even larger gains are possible with such an approach. 1.
(Show Context)

Citation Context

... and superpixels, and incorporates many different potentials such as unary potentials based on textonboost features, Pn Potts terms between pixels and superpixels and a global co-occurrence potential =-=[19]-=-. Much of the complex dependency between regions of the image is captured by the graph structure of the CRF and high-order cliques. In contrast, O2P incorporates high-order dependencies between region...

A Hierarchical Conditional Random Field Model for Labeling and Segmenting Images of Street Scenes

by Qixing Huang, Mei Han, Bo Wu, Sergey Ioffe
"... Simultaneously segmenting and labeling images is a fundamental problem in Computer Vision. In this paper, we introduce a hierarchical CRF model to deal with the problem of labeling images of street scenes by several distinctive object classes. In addition to learning a CRF model from all the labeled ..."
Abstract - Cited by 12 (0 self) - Add to MetaCart
Simultaneously segmenting and labeling images is a fundamental problem in Computer Vision. In this paper, we introduce a hierarchical CRF model to deal with the problem of labeling images of street scenes by several distinctive object classes. In addition to learning a CRF model from all the labeled images, we group images into clusters of similar images and learn a CRF model from each cluster separately. When labeling a new image, we pick the closest cluster and use the associated CRF model to label this image. Experimental results show that this hierarchical image labeling method is comparable to, and in many cases superior to, previous methods on benchmark data sets. In addition to segmentation and labeling results, we also showed how to apply the image labeling result to rerank Google similar images. 1.
(Show Context)

Citation Context

...cognition. The goal is to assign every pixel of the image with an object class label. Most solutions fall into two general categories: parametric methods and nonparametric methods. Parametric methods =-=[2, 4, 7, 12, 14, 17, 18]-=- usually involve optimizing a Conditional Random Field (CRF) model which evaluates the probability of assigning a particular label to each pixel, and the probability of assigning each pair of labels t...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University