Results 1 - 10
of
18
Video epitomes
- in Proc. IEEE Conf. Comput. Vis. Pattern Recog
, 2005
"... Recently, “epitomes ” were introduced as patch-based probability models that are learned by compiling together a large number of examples of patches from input images. In this paper, we describe how epitomes can be used to model video data and we describe significant computational speedups that can ..."
Abstract
-
Cited by 25 (0 self)
- Add to MetaCart
Recently, “epitomes ” were introduced as patch-based probability models that are learned by compiling together a large number of examples of patches from input images. In this paper, we describe how epitomes can be used to model video data and we describe significant computational speedups that can be incorporated into the epitome inference and learning algorithm. In the case of videos, epitomes are estimated so as to model most of the small space-time cubes from the input data. Then, the epitome can be used for various modeling and reconstruction tasks, of which we show results for video super-resolution, video interpolation, and object removal. Besides computational efficiency, an interesting advantage of the epitome as a representation is that it can be reliably estimated even from videos with large amounts of missing data. We illustrate this ability on the task of reconstructing the dropped frames in video broadcast using only the degraded video and also in denoising a severely corrupted video. 1
Using Photographs to Enhance Videos of a Static Scene
- EUROGRAPHICS SYMPOSIUM ON RENDERING (2007) JAN KAUTZ AND SUMANTA PATTANAIK (EDITORS)
, 2007
"... We present a framework for automatically enhancing videos of a static scene using a few photographs of the same scene. For example, our system can transfer photographic qualities such as high resolution, high dynamic range and better lighting from the photographs to the video. Additionally, the user ..."
Abstract
-
Cited by 24 (2 self)
- Add to MetaCart
We present a framework for automatically enhancing videos of a static scene using a few photographs of the same scene. For example, our system can transfer photographic qualities such as high resolution, high dynamic range and better lighting from the photographs to the video. Additionally, the user can quickly modify the video by editing only a few still images of the scene. Finally, our system allows a user to remove unwanted objects and camera shake from the video. These capabilities are enabled by two technical contributions presented in this paper. First, we make several improvements to a state-of-the-art multiview stereo algorithm in order to compute view-dependent depths using video, photographs, and structure-from-motion data. Second, we present a novel image-based rendering algorithm that can re-render the input video using the appearance of the photographs while preserving certain temporal dynamics such as specularities and dynamic scene lighting.
Soft edge smoothness prior for alpha channel super resolution
- in CVPR
, 2007
"... Effective image prior is necessary for image super resolution, due to its severely under-determined nature. Although the edge smoothness prior can be effective, it is generally difficult to have analytical forms to evaluate the edge smoothness, especially for soft edges that exhibit gradual intensit ..."
Abstract
-
Cited by 19 (6 self)
- Add to MetaCart
Effective image prior is necessary for image super resolution, due to its severely under-determined nature. Although the edge smoothness prior can be effective, it is generally difficult to have analytical forms to evaluate the edge smoothness, especially for soft edges that exhibit gradual intensity transitions. This paper finds the connection between the soft edge smoothness and a soft cut metric on an image grid by generalizing the Geocuts method [5], and proves that the soft edge smoothness measure approximates the average length of all level lines in an intensity image. This new finding not only leads to an analytical characterization of the soft edge smoothness prior, but also gives an intuitive geometric explanation. Regularizing the super resolution problem by this new form of prior can simultaneously minimize the length of all level lines, and thus resulting in visually appealing results. In addition, this paper presents a novel combination of this soft edge smoothness prior and the alpha matting technique for color image super resolution, by normalizing edge segments with their alpha channel description, to achieve a unified treatment of edges with different contrast and scale. 1.
Fast Image/Video Upsampling
, 2008
"... We propose a simple but effective upsampling method for automatically enhancing the image/video resolution, while preserving the essential structural information. The main advantage of our method lies in a feedback-control framework which faithfully recovers the high-resolution image information f ..."
Abstract
-
Cited by 9 (0 self)
- Add to MetaCart
We propose a simple but effective upsampling method for automatically enhancing the image/video resolution, while preserving the essential structural information. The main advantage of our method lies in a feedback-control framework which faithfully recovers the high-resolution image information from the input data, without imposing additional local structure constraints learned from other examples. This makes our method independent of the quality and number of the selected examples, which are issues typical of learning-based algorithms, while producing high-quality results without observable unsightly artifacts. Another advantage is that our method naturally extends to video upsampling, where the temporal coherence is maintained automatically. Finally, our method runs very fast. We demonstrate the effectiveness of our algorithm by experimenting with different image/video data.
High-Zoom Video Hallucination by Exploiting Spatio-Temporal Regularities
- Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, Vol.2
, 2004
"... In this paper, we consider the problem of super-resolving a human face video by a very high (16) zoom factor. Inspired by recent literature on hallucination and examplebased learning, we formulate this task using a graphical model that encodes 1) spatio-temporal consistencies, and 2) image formation ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
In this paper, we consider the problem of super-resolving a human face video by a very high (16) zoom factor. Inspired by recent literature on hallucination and examplebased learning, we formulate this task using a graphical model that encodes 1) spatio-temporal consistencies, and 2) image formation & degradation processes. A video database of facial expressions is used to learn a domainspecific prior for high-resolution videos. The problem is posed as one of probabilistic inference, in which we aim to find the high resolution video that best satisfies the constraints expressed through the graphical model. Traditional approaches to this problem using video data first estimate the relative motion between frames and then compensate for it, effectively resulting in multiple measurements of the scene. Our use of time is rather direct: We define data structures that span multiple consecutive frames, enriching our feature vectors with a temporal signature. We then exploit these signatures to find consistent solutions over time. In our experiments, a 8 6 pixel-wide face video, subject to translational jitter and additive noise, gets magnified to a 128 96 pixel video. Our results show that by exploiting both space and time, drastic improvements can be achieved in both video flicker artifacts and mean-squared-error.
Limits of LearningBased Superresolution Algorithms. Microsoft Research
, 2007
"... Learning-based superresolution (SR) are popular SR techniques that use application dependent priors to infer the missing details in low resolution images (LRIs). However, their performance still deteriorates quickly when the magnification factor is moderately large. This leads us to an important pro ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Learning-based superresolution (SR) are popular SR techniques that use application dependent priors to infer the missing details in low resolution images (LRIs). However, their performance still deteriorates quickly when the magnification factor is moderately large. This leads us to an important problem: “Do limits of learning-based SR algorithms exist? ” In this paper, we attempt to shed some light on this problem when the SR algorithms are designed for general natural images (GNIs). We first define an expected risk for the SR algorithms that is based on the root mean squared error between the superresolved images and the ground truth images. Then utilizing the statistics of GNIs, we derive a closed form estimate of the lower bound of the expected risk. The lower bound can be computed by sampling real images. By computing the curve of the lower bound w.r.t. the magnification factor, we can estimate the limits of learning-based SR algorithms, at which the lower bound of expected risk exceeds a relatively large threshold. We also investigate the sufficient number of samples to guarantee an accurate estimation of the lower bound. 1.
Contextual and non-combinatorial approach to feature extraction
- In Int'l Workshop on EMMCVPR
, 2003
"... Abstract. Extracting features from an image is the first step in many computer vision applications. As nearby features are closely correlated, the joint distribution among features is highly constrained. Feature extraction techniques can take advantage of the correlation to extract a good set of fea ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
Abstract. Extracting features from an image is the first step in many computer vision applications. As nearby features are closely correlated, the joint distribution among features is highly constrained. Feature extraction techniques can take advantage of the correlation to extract a good set of features that satisfies the correlation constraints. Furthermore, they can refine the representation of the extracted features in terms of a set of attributes. To reduce the dimension of the joint PDF, we consider a set of conditional PDFs and maximize them iteratively. It can be shown that the process finds a local maximum of the joint PDF when it is differentiable with respect to each feature attribute. We can apply the approach to many feature extraction tasks. In this paper, we demonstrate our approach with sub-pixel contour representation and surface reconstruction problems. 1
Fast image super-resolution using connected component enhancement
, 2008
"... The paper focuses on reconstructing the discontinuity between homogenous color regions in an interpolated image to improve its perceptual quality. A low-resolution input image is firstly interpolated and then decomposed into several patches. Each patch is then segmented into multiple homogenous regi ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
The paper focuses on reconstructing the discontinuity between homogenous color regions in an interpolated image to improve its perceptual quality. A low-resolution input image is firstly interpolated and then decomposed into several patches. Each patch is then segmented into multiple homogenous regions using Connected Component Analysis technique. Then a spatial-filter is applied to enhance the color/intensity transition between neighboring components. The designed spatialfilter combines the advantages of both bilateral-filtering and unsharp masking methods, with high computational efficiency. The proposed method can be used for image/video superresolution applications. Experimental results are promising. 1.
Keywords Super-resolution · Principal components · Wavelet · Contourlet decomposition · Filter banks · Edge primitives · Image interpolation · Aliasing
, 2006
"... Single frame image super-resolution: should we process ..."
Estimation of High Resolution Images and Registration Parameters from Low Resolution Observations
"... In this paper we consider the problem of reconstructing a high resolution image from a set of undersampled and degraded frames, all of them obtained from high resolution images with unknown shifting displacements between them. We derive an iterative method to estimate the unknown shifts and the ..."
Abstract
- Add to MetaCart
In this paper we consider the problem of reconstructing a high resolution image from a set of undersampled and degraded frames, all of them obtained from high resolution images with unknown shifting displacements between them. We derive an iterative method to estimate the unknown shifts and the high resolution image given the low resolution observations. Finally, the proposed method is tested on real images.

