Results 1 - 10
of
19
Image Quality Assessment: From Error Visibility to Structural Similarity
- IEEE TRANSACTIONS ON IMAGE PROCESSING
, 2004
"... Objective methods for assessing perceptual image quality have traditionally attempted to quantify the visibility of errors between a distorted image and a reference image using a variety of known properties of the human visual system. Under the assumption that human visual perception is highly adapt ..."
Abstract
-
Cited by 301 (26 self)
- Add to MetaCart
Objective methods for assessing perceptual image quality have traditionally attempted to quantify the visibility of errors between a distorted image and a reference image using a variety of known properties of the human visual system. Under the assumption that human visual perception is highly adapted for extracting structural information from a scene, we introduce an alternative framework for quality assessment based on the degradation of structural information. As a specific example of this concept, we develop a Structural Similarity Index and demonstrate its promise through a set of intuitive examples, as well as comparison to both subjective ratings and state-of-the-art objective methods on a database of images compressed with JPEG and JPEG2000.
Image Quality Assessment: From Error Measurement to Structural Similarity
- IEEE Trans. Image Processing
, 2004
"... Objective methods for assessing perceptual image quality traditionally attempt to quantify the visibility of errors (di#erences) between a distorted image and a reference image using a variety of known properties of the human visual system. Under the assumption that human visual perception is highly ..."
Abstract
-
Cited by 68 (10 self)
- Add to MetaCart
Objective methods for assessing perceptual image quality traditionally attempt to quantify the visibility of errors (di#erences) between a distorted image and a reference image using a variety of known properties of the human visual system. Under the assumption that human visual perception is highly adapted for extracting structural information from a scene, we introduce an alternative complementary framework for quality assessment based on the degradation of structural information. As a specific example of this concept, we develop a Structural Similarity Index and demonstrate its promise through a set of intuitive examples, as well as comparison to both subjective ratings and state-of-the-art objective methods on a database of images compressed with JPEG and JPEG2000. A MatLab implementation of the proposed algorithm is available online at http://www.cns.nyu.edu/~lcv/ssim/.
Neuronal adaptation to visual motion in area MT of the macaque
- Neuron
, 2003
"... whether adaptation is a universal feature of cortical computation or whether it mostly occurs early in a sensory processing stream and is simply inherited as adapted signals pass to downstream cortical areas. With this motivation in mind, we studied adaptation in neurons in cortical area MT (or V5), ..."
Abstract
-
Cited by 10 (1 self)
- Add to MetaCart
whether adaptation is a universal feature of cortical computation or whether it mostly occurs early in a sensory processing stream and is simply inherited as adapted signals pass to downstream cortical areas. With this motivation in mind, we studied adaptation in neurons in cortical area MT (or V5), an extrastriate visual area that contains a high proportion of neurons that are selective for the direction of motion of visual stimuli (Zeki, 1974; Maunsell and Van Essen, 1983a). Adaptation in MT is of interest for several reasons. First, psychophysical studies suggest that visual motion pro-cessing is strongly affected by adaptation (for a review, see Mather et al, 1998). For instance, the prolonged viewing of a moving stimulus causes subsequently viewed static or motion-balanced stimuli to appear to
Nonlinear Extraction of Independent Components of Natural Images Using Radial Gaussianization
, 2009
"... We consider the problem of efficiently encoding a signal by transforming it to a new representation whose components are statistically independent. A widely studied linear solution, known as independent component analysis (ICA), exists for the case when the signal is generated as a linear transforma ..."
Abstract
-
Cited by 9 (3 self)
- Add to MetaCart
We consider the problem of efficiently encoding a signal by transforming it to a new representation whose components are statistically independent. A widely studied linear solution, known as independent component analysis (ICA), exists for the case when the signal is generated as a linear transformation of independent nongaussian sources. Here, we examine a complementary case, in which the source is nongaussian and elliptically symmetric. In this case, no invertible linear transform suffices to decompose the signal into independent components, but we show that a simple nonlinear transformation, which we call radial gaussianization (RG), is able to remove all dependencies. We then examine this methodology in the context of natural image statistics. We first show that distributions of spatially proximal bandpass filter responses are better described as elliptical than as linearly transformed independent sources. Consistent with this, we demonstrate that the reduction in dependency achieved by applying RG to either nearby pairs or blocks of bandpass filter responses is significantly greater than that achieved by ICA. Finally, we show that the RG transformation may be closely approximated by divisive normalization, which has been used to model the nonlinear response properties of visual neurons.
Group sparse coding with a laplacian scale mixture prior
- Zemel,R.,andCulotta,A.,editors,Advances in Neural Information Processing Systems
, 2010
"... We propose a class of sparse coding models that utilizes a Laplacian Scale Mixture (LSM) prior to model dependencies among coefficients. Each coefficient is modeled as a Laplacian distribution with a variable scale parameter, with a Gamma distribution prior over the scale parameter. We show that, du ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
We propose a class of sparse coding models that utilizes a Laplacian Scale Mixture (LSM) prior to model dependencies among coefficients. Each coefficient is modeled as a Laplacian distribution with a variable scale parameter, with a Gamma distribution prior over the scale parameter. We show that, due to the conjugacy of the Gamma prior, it is possible to derive efficient inference procedures for both the coefficients and the scale parameter. When the scale parameters of a group of coefficients are combined into a single variable, it is possible to describe the dependencies that occur due to common amplitude fluctuations among coefficients, which have been shown to constitute a large fraction of the redundancy in natural images [1]. We show that, as a consequence of this group sparse coding, the resulting inference of the coefficients follows a divisive normalization rule, and that this may be efficiently implemented in a network architecture similar to that which has been proposed to occur in primary visual cortex. We also demonstrate improvements in image coding and compressive sensing recovery using the LSM model. 1
Decision-Theoretic Saliency: Computational Principles, Biological Plausibility, and Implications for Neurophysiology and Psychophysics
, 2009
"... A decision-theoretic formulation of visual saliency, first proposed for top-down processing (object recognition) (Gao & Vasconcelos, 2005a), is extended to the problem of bottom-up saliency. Under this formulation, optimality is defined in the minimum probability of error sense, under a constraint o ..."
Abstract
-
Cited by 7 (4 self)
- Add to MetaCart
A decision-theoretic formulation of visual saliency, first proposed for top-down processing (object recognition) (Gao & Vasconcelos, 2005a), is extended to the problem of bottom-up saliency. Under this formulation, optimality is defined in the minimum probability of error sense, under a constraint of computational parsimony. The saliency of the visual features at a given location of the visual field is defined as the power of those features to discriminate between the stimulus at the location and a null hypothesis. For bottom-up saliency, this is the set of visual features that surround the location under consideration. Discrimination is defined in an information-theoretic sense and the optimal saliency detector derived for a class of stimuli that complies with known statistical properties of natural images. It is shown that under the assumption that saliency is driven by linear filtering, the optimal detector consists of what is usually referred to as the standard architecture of V1: a cascade of linear filtering, divisive normalization, rectification, and spatial pooling. The optimal detector is also shown to replicate the fundamental properties of the psychophysics of saliency: stimulus pop-out, saliency asymmetries for stimulus presence versus absence, disregard of feature conjunctions, and Weber’s law. Finally, it is shown that the optimal saliency architecture can be applied to the solution of generic inference problems. In particular, for the class of stimuli studied, it performs the three fundamental operations of statistical inference: assessment of probabilities, implementation of Bayes decision rule, and feature selection.
Visual adaptation: Neural, psychological and computational aspects
, 2007
"... The term visual adaptation describes the processes by which the visual system alters its operating properties in response to changes in the environment. These continual adjustments in sensory processing are diagnostic as to the computational principles underlying the neural coding of information and ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
The term visual adaptation describes the processes by which the visual system alters its operating properties in response to changes in the environment. These continual adjustments in sensory processing are diagnostic as to the computational principles underlying the neural coding of information and can have profound consequences for our perceptual experience. New physiological and psychophysical data, along with emerging statistical and computational models, make this an opportune time to bring together experimental and theoretical perspectives. Here, we discuss functional ideas about adaptation in the light of recent data and identify exciting directions for future research.
Directly invertible nonlinear divisive normalization pyramid for image representation
- in Visual Content Processing and Representation- Proc 8th Int’l Workshop
, 2003
"... Abstract. We present a multiscale nonlinear image representation that permits an efficient coding of natural images. The input image is first decomposed into a set of subbands at multiple scales and orientations using near-orthogonal symmetric quadrature mirror filters. This is followed by a nonline ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
Abstract. We present a multiscale nonlinear image representation that permits an efficient coding of natural images. The input image is first decomposed into a set of subbands at multiple scales and orientations using near-orthogonal symmetric quadrature mirror filters. This is followed by a nonlinear “divisive normalization ” stage, in which each linear coefficient is divided by a value computed from a small set of neighboring coefficients in space, orientation and scale. This neighborhood is chosen to allow this nonlinear operation to be efficiently inverted. The parameters of the normalization operation are optimized in order to maximize the independence of the normalized responses for natural images. We demonstrate the near-independence of these nonlinear responses, and suggest a number of applications for which this representation should be well suited. 1
Local luminance and contrast in natural images
- Vision Research
, 2006
"... Within natural images there is substantial spatial variation in both local contrast and local luminance. Understanding the statistics of these variations is important for understanding the dynamics of receptive field stimulation that occur under natural viewing conditions and for understanding the r ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
Within natural images there is substantial spatial variation in both local contrast and local luminance. Understanding the statistics of these variations is important for understanding the dynamics of receptive field stimulation that occur under natural viewing conditions and for understanding the requirements for effective luminance and contrast gain control. Local luminance and contrast were measured in a large set of calibrated 12-bit gray-scale natural images, for a number of analysis patch sizes. For each image and patch size we measured the range of contrast, the range of luminance, the correlation in contrast and luminance as a function of the distance between patches, and the correlation between contrast and luminance within patches. The same analyses were also performed on hand segmented regions containing only ‘‘sky’’, ‘‘ground’’, ‘‘foliage’’, or ‘‘backlit foliage’’. Within the typical image, the 95 % range (2.5–97.5 percentile) for both local luminance and local contrast is somewhat greater than a factor of 10. The correlation in contrast and the correlation in luminance diminish rapidly with distance, and the typical correlation between luminance and contrast within patches is small (e.g., 0.2 compared to 0.8 for 1/f noise). We show that eye movements are frequently large enough that there will be little correlation in the contrast or luminance on a receptive field from one fixation to the next, and thus rapid contrast and luminance gain control are essential. The low correlation between local luminance and contrast implies that efficient contrast gain control mechanisms can operate largely independently of luminance gain control mechanisms. Ó 2005 Elsevier Ltd. All rights reserved. 1.
Is the Homunculus “Aware” of Sensory Adaptation?
, 2009
"... Neural activity and perception are both affected by sensory history. The work presented here explores the relationship between the physiological effects of adaptation and their perceptual consequences. Perception is modeled as arising from an encoder-decoder cascade, in which the encoder is defined ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
Neural activity and perception are both affected by sensory history. The work presented here explores the relationship between the physiological effects of adaptation and their perceptual consequences. Perception is modeled as arising from an encoder-decoder cascade, in which the encoder is defined by the probabilistic response of a population of neurons, and the decoder transforms this population activity into a perceptual estimate. Adaptation is assumed to produce changes in the encoder, and we examine the conditions under which the decoder behavior is consistent with observed perceptual effects in terms of both bias and discriminability. We show that for all decoders, discriminability is bounded from below by the inverse Fisher information. Estimation bias, on the other hand, can arise for a variety of different reasons and can range from zero to substantial. We specifically examine biases that arise when the decoder is fixed, “unaware ” of the changes in the encoding population (as opposed to “aware ” of the adaptation and changing accordingly). We simulate the effects of adaptation on two well-studied sensory attributes, motion direction and contrast, assuming a gain change description of encoder adaptation. Although we cannot uniquely constrain the source of decoder bias, we find for both motion and contrast that an “unaware ” decoder that maximizes the likelihood of the percept given by the preadaptation encoder leads to predictions that are consistent with behavioral data. This model implies that adaptation-induced biases arise as a result of temporary suboptimality of the decoder.

