Results 1 - 10
of
26
Universal Discrete Denoising: Known Channel
- IEEE Trans. Inform. Theory
, 2003
"... A discrete denoising algorithm estimates the input sequence to a discrete memoryless channel (DMC) based on the observation of the entire output sequence. For the case in which the DMC is known and the quality of the reconstruction is evaluated with a given single-letter fidelity criterion, we pr ..."
Abstract
-
Cited by 55 (23 self)
- Add to MetaCart
A discrete denoising algorithm estimates the input sequence to a discrete memoryless channel (DMC) based on the observation of the entire output sequence. For the case in which the DMC is known and the quality of the reconstruction is evaluated with a given single-letter fidelity criterion, we propose a discrete denoising algorithm that does not assume knowledge of statistical properties of the input sequence. Yet, the algorithm is universal in the sense of asymptotically performing as well as the optimum denoiser that knows the input sequence distribution, which is only assumed to be stationary and ergodic. Moreover, the algorithm is universal also in a semi-stochastic setting, in which the input is an individual sequence, and the randomness is due solely to the channel noise.
A tutorial introduction to the minimum description length principle
- in Advances in Minimum Description Length: Theory and Applications. 2005
"... ..."
An empirical study of minimum description length model selection with infinite parametric complexity
- JOURNAL OF MATHEMATICAL PSYCHOLOGY
, 2006
"... Parametric complexity is a central concept in Minimum Description Length (MDL) model selection. In practice it often turns out to be infinite, even for quite simple models such as the Poisson and Geometric families. In such cases, MDL model selection as based on NML and Bayesian inference based on J ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
Parametric complexity is a central concept in Minimum Description Length (MDL) model selection. In practice it often turns out to be infinite, even for quite simple models such as the Poisson and Geometric families. In such cases, MDL model selection as based on NML and Bayesian inference based on Jeffreys ’ prior can not be used. Several ways to resolve this problem have been proposed. We conduct experiments to compare and evaluate their behaviour on small sample sizes. We find interestingly poor behaviour for the plug-in predictive code; a restricted NML model performs quite well but it is questionable if the results validate its theoretical motivation. A Bayesian marginal distribution with Jeffreys’ prior can still be used if one sacrifices the first observation to make a proper posterior; this approach turns out to be most dependable.
Model Selection by Normalized Maximum Likelihood
, 2005
"... The Minimum Description Length (MDL) principle is an information theoretic approach to inductive inference that originated in algorithmic coding theory. In this approach, data are viewed as codes to be compressed by the model. From this perspective, models are compared on their ability to compress a ..."
Abstract
-
Cited by 6 (1 self)
- Add to MetaCart
The Minimum Description Length (MDL) principle is an information theoretic approach to inductive inference that originated in algorithmic coding theory. In this approach, data are viewed as codes to be compressed by the model. From this perspective, models are compared on their ability to compress a data set by extracting useful information in the data apart from random noise. The goal of model selection is to identify the model, from a set of candidate models, that permits the shortest description length (code) of the data. Since Rissanen originally formalized the problem using the crude ‘two-part code ’ MDL method in the 1970s, many significant strides have been made, especially in the 1990s, with the culmination of the development of the refined ‘universal code’ MDL method, dubbed Normalized Maximum Likelihood (NML). It represents an elegant solution to the model selection problem. The present paper provides a tutorial review on these latest developments with a special focus on NML. An application example of NML in cognitive modeling is also provided.
The Empirical Distribution of Rate-Constrained Source Codes
- IEEE Trans. Inform. Theory
, 2003
"... Let X = (X1 , . . .) be a stationary ergodic finite-alphabet source, X denote its first n symbols, and Y be the codeword assigned to X by a lossy source code. The empirical kth-order joint distribution Q ) along the pair (X ). Our main interest is in the sample behavior of t ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
Let X = (X1 , . . .) be a stationary ergodic finite-alphabet source, X denote its first n symbols, and Y be the codeword assigned to X by a lossy source code. The empirical kth-order joint distribution Q ) along the pair (X ). Our main interest is in the sample behavior of this (random) distribution. Letting I(Q when (X we show that for any (sequence of) lossy source code(s) of rate a.s., where H(X) denotes the entropy rate of X. This is shown to imply, for a large class of sources including all i.i.d. sources and all sources satisfying the Shannon lower bound with equality, that for any sequence of codes which is good in the sense of asymptotically attaining a point on the rate distortion curve P X k , Y k a.s., whenever P X k , Y k is the unique distribution attaining the minimum in the definition of the kth-order rate distortion function. Further consequences of these results are explored. These include a simple proof of Kie#er's sample converse to lossy source coding, as well as pointwise performance bounds for compression-based denoisers. 1
On sequentially normalized maximum likelihood models
- in: Workshop on Information Theoretic Methods in Science and Engineering (WITMSE-08
, 2008
"... The important normalized maximum likelihood (NML) distribution is obtained via a normalization over all sequences of given length. It has two short-comings: the resulting model is usually not a random process, and in ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
The important normalized maximum likelihood (NML) distribution is obtained via a normalization over all sequences of given length. It has two short-comings: the resulting model is usually not a random process, and in
Universally Attainable Error-Exponents for Rate-Constrained Denoising of Noisy Sources
, 2002
"... Consider the problem of rate-constrained reconstruction of a finite-alphabet discrete memoryless signal X (X1 , . . . , Xn ), based on a noise-corrupted observation sequence Z , which is the finite-alphabet output of a Discrete Memoryless Channel (DMC) whose input is X . Suppose that there ..."
Abstract
-
Cited by 2 (2 self)
- Add to MetaCart
Consider the problem of rate-constrained reconstruction of a finite-alphabet discrete memoryless signal X (X1 , . . . , Xn ), based on a noise-corrupted observation sequence Z , which is the finite-alphabet output of a Discrete Memoryless Channel (DMC) whose input is X . Suppose that there is some uncertainty in the source distribution, in the channel characteristics, or in both. Equivalently, suppose that the distribution of the pairs (X i , Z i ), rather than completely being known, is only known to belong to a set #. Suppose further that the relevant performance criterion is the probability of excess distortion, i.e., letting ) denote the reconstruction, we are interested in the behavior of P # , where # is a (normalized) block distortion induced by a single-letter distortion measure and P # denotes the probability measure corresponding to the case where (X i , Z i ) #, # #.
An empirical study of MDL model selection with infinite parametric complexity
- J. Mathematical Psychology
, 2006
"... Parametric complexity is a central concept in MDL model selection. In practice it often turns out to be infinite, even for quite simple models such as the Poisson and Geometric families. In such cases, MDL model selection as based on NML and Bayesian inference based on Jeffreys ’ prior can not be us ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
Parametric complexity is a central concept in MDL model selection. In practice it often turns out to be infinite, even for quite simple models such as the Poisson and Geometric families. In such cases, MDL model selection as based on NML and Bayesian inference based on Jeffreys ’ prior can not be used. Several ways to resolve this problem have been proposed. We conduct experiments to compare and evaluate their behaviour on small sample sizes. We find interestingly poor behaviour for the plug-in predictive code; a restricted NML model performs quite well but it is questionable if the results validate its theoretical motivation. The Bayesian model with the improper Jeffreys ’ prior is the most dependable. 1

