Results 1 - 10
of
24
Bilateral filtering-based optical flow estimation with occlusion detection
- In ECCV, volume I
, 2006
"... Abstract. Using the variational approaches to estimate optical flow between two frames, the flow discontinuities between different motion fields are usually not distinguished even when an anisotropic diffusion operator is applied. In this paper, we propose a multi-cue driven adaptive bilateral filte ..."
Abstract
-
Cited by 26 (0 self)
- Add to MetaCart
Abstract. Using the variational approaches to estimate optical flow between two frames, the flow discontinuities between different motion fields are usually not distinguished even when an anisotropic diffusion operator is applied. In this paper, we propose a multi-cue driven adaptive bilateral filter to regularize the flow computation, which is able to achieve the smoothly varied optical flow field with highly desirable motion discontinuities. First, we separate the traditional one-step variational updating model into a two-step filtering-based updating model. Then, employing our occlusion detector, we reformulate the energy functional of optical flow estimation by explicitly introducing an occlusion term to balance the energy loss due to the occlusion or mismatches. Furthermore, based on the twostep updating framework, a novel multi-cue driven bilateral filter is proposed to substitute the original anisotropic diffusion process, and it is able to adaptively control the diffusion process according to the occlusion detection, image intensity dissimilarity, and motion dissimilarity. After applying our approach on various video sources (movie and TV) in the presence of occlusion, motion blurring, non-rigid deformation, and weak textureness, we generate a spatial-coherent flow field between each pair of input frames and detect more accurate flow discontinuities along the motion boundaries. 1
Human-Assisted Motion Annotation
"... Figure 1. We designed a system to allow the user to specify layer configurations and motion hints (b). Our system uses these hints to calculate a dense flow field for each layer. We show that the flow (c) is repeatable and accurate. (d): The output of a representative optical flow algorithm [8], tra ..."
Abstract
-
Cited by 17 (4 self)
- Add to MetaCart
Figure 1. We designed a system to allow the user to specify layer configurations and motion hints (b). Our system uses these hints to calculate a dense flow field for each layer. We show that the flow (c) is repeatable and accurate. (d): The output of a representative optical flow algorithm [8], trained on the Yosemite sequence, shows many differences from the labeled ground truth for this and other realistic sequences we have labeled. This indicates the value of our database for training and evaluating optical flow algorithms. Obtaining ground-truth motion for arbitrary, real-world video sequences is a challenging but important task for both algorithm evaluation and model design. Existing groundtruth databases are either synthetic, such as the Yosemite sequence, or limited to indoor, experimental setups, such as the database developed in [5]. We propose a human-inloop methodology to create a ground-truth motion database for the videos taken with ordinary cameras in both indoor and outdoor scenes, using the fact that human beings are experts at segmenting objects and inspecting the match between two frames. We designed an interactive computer vision system to allow a user to efficiently annotate motion. Our methodology is cross-validated by showing that human annotated motion is repeatable, consistent across annotators, and close to the ground truth obtained by [5]. Using our system, we collected and annotated 10 indoor and outdoor real-world videos to form a ground-truth motion database. The source code, annotation tool and database is online for public evaluation and benchmarking. 1.
Consistent Depth Maps Recovery from a Video Sequence
, 2009
"... This paper presents a novel method for recovering consistent depth maps from a video sequence. We propose a bundle optimization framework to address the major difficulties in stereo reconstruction, such as dealing with image noise, occlusions, and outliers. Different from the typical multiview stere ..."
Abstract
-
Cited by 12 (5 self)
- Add to MetaCart
This paper presents a novel method for recovering consistent depth maps from a video sequence. We propose a bundle optimization framework to address the major difficulties in stereo reconstruction, such as dealing with image noise, occlusions, and outliers. Different from the typical multiview stereo methods, our approach not only imposes the photo-consistency constraint, but also explicitly associates the geometric coherence with multiple frames in a statistical way. It thus can naturally maintain the temporal coherence of the recovered dense depth maps without oversmoothing. To make the inference tractable, we introduce an iterative optimization scheme by first initializing the disparity maps using a segmentation prior and then refining the disparities by means of bundle optimization. Instead of defining the visibility parameters, our method implicitly models the reconstruction noise as well as the probabilistic visibility. After bundle optimization, we introduce an efficient space-time fusion algorithm to further reduce the reconstruction noise. Our automatic depth recovery is evaluated using a variety of challenging video examples.
Learning optical flow
- In Proc. ECCV
, 2008
"... Abstract. Assumptions of brightness constancy and spatial smoothness underlie most optical flow estimation methods. In contrast to standard heuristic formulations, we learn a statistical model of both brightness constancy error and the spatial properties of optical flow using image sequences with as ..."
Abstract
-
Cited by 8 (3 self)
- Add to MetaCart
Abstract. Assumptions of brightness constancy and spatial smoothness underlie most optical flow estimation methods. In contrast to standard heuristic formulations, we learn a statistical model of both brightness constancy error and the spatial properties of optical flow using image sequences with associated ground truth flow fields. The result is a complete probabilistic model of optical flow. Specifically, the ground truth enables us to model how the assumption of brightness constancy is violated in naturalistic sequences, resulting in a probabilistic model of “brightness inconstancy”. We also generalize previous high-order constancy assumptions, such as gradient constancy, by modeling the constancy of responses to various linear filters in a high-order random field framework. These filters are free variables that can be learned from training data. Additionally we study the spatial structure of the optical flow and how motion boundaries are related to image intensity boundaries. Spatial smoothness is modeled using a Steerable Random Field, where spatial derivatives of the optical flow are steered by the image brightness structure. These models provide a statistical motivation for previous methods and enable the learning of all parameters from training data. All proposed models are quantitatively compared on the Middlebury flow dataset. 1
Variational Optic Flow Computation: From Continuous Models to Algorithms
- International Workshop on Computer Vision and Image Analysis (ed. L. Alvarez), IWCVIA’03, Las Palmas de Gran Canaria
, 2003
"... Variational methods belong to the most successful techniques for computing the displacement field in image sequences. In this paper we analyse the different terms in the energy functional and sketch some of our recent contributions in this area. ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
Variational methods belong to the most successful techniques for computing the displacement field in image sequences. In this paper we analyse the different terms in the energy functional and sketch some of our recent contributions in this area.
Sparse Occlusion Detection with Optical Flow
- INT J COMPUT VIS
, 2011
"... We tackle the problem of detecting occluded regions in a video stream. Under assumptions of Lambertian reflection and static illumination, the task can be posed as a variational optimization problem, and its solution approximated using convex minimization. We describe efficient numerical schemes tha ..."
Abstract
-
Cited by 4 (3 self)
- Add to MetaCart
We tackle the problem of detecting occluded regions in a video stream. Under assumptions of Lambertian reflection and static illumination, the task can be posed as a variational optimization problem, and its solution approximated using convex minimization. We describe efficient numerical schemes that reach the global optimum of the relaxed cost functional, for any number of independently moving objects, and any number of occlusion layers. We test the proposed algorithm on benchmark datasets, expanded to enable evaluation of occlusion detection performance, in addition to optical flow.
A Survey on Variational Optic Flow Methods for Small Displacements
, 2005
"... Optic flow describes the displacement field in an image sequence. Its reliable computation constitutes one of the main challenges in computer vision, and variational methods belong to the most successful techniques for achieving this goal. Variational methods recover the optic flow field as a minimi ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Optic flow describes the displacement field in an image sequence. Its reliable computation constitutes one of the main challenges in computer vision, and variational methods belong to the most successful techniques for achieving this goal. Variational methods recover the optic flow field as a minimiser of a suitable energy functional that involves data and smoothness terms. In this paper we present a survey on different model assumptions for each of these terms and illustrate their impact by experiments. We restrict ourselves to rotationally invariant convex functionals with a linearised data term. Such models are appropriate for small displacements. Regarding the data term, constancy assumptions on the brightness, the gradient, the Hessian, the gradient magnitude, the Laplacian, and the Hessian determinant are investigated. Local integration and nonquadratic penalisation are considered in order to improve robustness under noise. With respect to the smoothness term, we review a recent taxonomy that links regularisers to diffusion processes. It allows to distinguish five types of regularisation strategies: homogeneous, isotropic image-driven, anisotropic image-driven,
Multi-image Interpolation based on Graph-cuts and Symmetric Optical Flow Technical details
"... Our correspondence estimation algorithm is based on the approach presented by Steinbruecker et al. [Steinbruecker et al. 2009a]. This approach separates the data-term, i.e. the brightness constancy assumption I1 − I2(x + w1,2) ≈ 0, and the smoothness-term, i.e. ∇w1,2 ≈ ⃗0, that are the basis for th ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
Our correspondence estimation algorithm is based on the approach presented by Steinbruecker et al. [Steinbruecker et al. 2009a]. This approach separates the data-term, i.e. the brightness constancy assumption I1 − I2(x + w1,2) ≈ 0, and the smoothness-term, i.e. ∇w1,2 ≈ ⃗0, that are the basis for the estimation of the correspondence map w1,2. It hence allows for the integration of arbitrary data-terms, especially data-terms that are not differentiable. Steinbruecker et al. already showed how to integrate patch-based dataterms into this framework [Steinbruecker et al. 2009b]. The key idea of this approach is based on the work of Zach et al. [Zach et al. 2007] where instead of direct minimization of the total-variation L1 formulation min w1,2 Z α|I1 − I2(x + w1,2) | + |∇w1,2 | dx, (1)
Segmentation Framework Based on Label Field Fusion
"... Abstract—In this paper, we put forward a novel fusion framework that mixes together label fields instead of observation data as is usually the case. Our framework takes as input two label fields: a quickly estimated and to-be-refined segmentation map and a spatial region map that exhibits the shape ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
Abstract—In this paper, we put forward a novel fusion framework that mixes together label fields instead of observation data as is usually the case. Our framework takes as input two label fields: a quickly estimated and to-be-refined segmentation map and a spatial region map that exhibits the shape of the main objects of the scene. These two label fields are fused together with a global energy function that is minimized with a deterministic iterative conditional mode algorithm. As explained in the paper, the energy function may implement a pure fusion strategy or a fusion-reaction function. In the latter case, a data-related term is used to make the optimization problem well posed. We believe that the conceptual simplicity, the small number of parameters, the use of a simple and fast deterministic optimizer that admits a natural implementation on a parallel architecture are among the main advantages of our approach. Our fusion framework is adapted to various computer vision applications among which are motion segmentation, motion estimation and occlusion detection. Index Terms—Color segmentation, label fusion, motion estimation, motion segmentation, occlusion. I.
Consistent multi-modal nonrigid registration based on a variational approach
, 2005
"... www.elsevier.com/locate/patrec ..."

