Results 1 -
9 of
9
Minimum Description Length Induction, Bayesianism, and Kolmogorov Complexity
- IEEE Transactions on Information Theory
, 1998
"... The relationship between the Bayesian approach and the minimum description length approach is established. We sharpen and clarify the general modeling principles MDL and MML, abstracted as the ideal MDL principle and defined from Bayes's rule by means of Kolmogorov complexity. The basic condition un ..."
Abstract
-
Cited by 60 (7 self)
- Add to MetaCart
The relationship between the Bayesian approach and the minimum description length approach is established. We sharpen and clarify the general modeling principles MDL and MML, abstracted as the ideal MDL principle and defined from Bayes's rule by means of Kolmogorov complexity. The basic condition under which the ideal principle should be applied is encapsulated as the Fundamental Inequality, which in broad terms states that the principle is valid when the data are random, relative to every contemplated hypothesis and also these hypotheses are random relative to the (universal) prior. Basically, the ideal principle states that the prior probability associated with the hypothesis should be given by the algorithmic universal probability, and the sum of the log universal probability of the model plus the log of the probability of the data given the model should be minimized. If we restrict the model class to the finite sets then application of the ideal principle turns into Kolmogorov's mi...
Algorithmic Statistics
- IEEE Transactions on Information Theory
, 2001
"... While Kolmogorov complexity is the accepted absolute measure of information content of an individual finite object, a similarly absolute notion is needed for the relation between an individual data sample and an individual model summarizing the information in the data, for example, a finite set (or ..."
Abstract
-
Cited by 41 (8 self)
- Add to MetaCart
While Kolmogorov complexity is the accepted absolute measure of information content of an individual finite object, a similarly absolute notion is needed for the relation between an individual data sample and an individual model summarizing the information in the data, for example, a finite set (or probability distribution) where the data sample typically came from. The statistical theory based on such relations between individual objects can be called algorithmic statistics, in contrast to classical statistical theory that deals with relations between probabilistic ensembles. We develop the algorithmic theory of statistic, sufficient statistic, and minimal sufficient statistic. This theory is based on two-part codes consisting of the code for the statistic (the model summarizing the regularity, the meaningful information, in the data) and the model-to-data code. In contrast to the situation in probabilistic statistical theory, the algorithmic relation of (minimal) sufficiency is an absolute relation between the individual model and the individual data sample. We distinguish implicit and explicit descriptions of the models. We give characterizations of algorithmic (Kolmogorov) minimal sufficient statistic for all data samples for both description modes in the explicit mode under some constraints. We also strengthen and elaborate earlier results on the "Kolmogorov structure function" and "absolutely non-stochastic objects" those rare objects for which the simplest models that summarize their relevant information (minimal sucient statistics) are at least as complex as the objects themselves. We demonstrate a close relation between the probabilistic notions and the algorithmic ones: (i) in both cases there is an "information non-increase" law; (ii) it is shown that a function is a...
Syntactic Measures of Complexity
, 1999
"... page 14 Declaration - page 15 Notes of copyright and the ownership of intellectual property rights - page 15 The Author - page 16 Acknowledgements - page 16 1 - Introduction - page 17 1.1 - Background - page 17 1.2 - The Style of Approach - page 18 1.3 - Motivation - page 19 1.4 - Style of ..."
Abstract
-
Cited by 18 (2 self)
- Add to MetaCart
page 14 Declaration - page 15 Notes of copyright and the ownership of intellectual property rights - page 15 The Author - page 16 Acknowledgements - page 16 1 - Introduction - page 17 1.1 - Background - page 17 1.2 - The Style of Approach - page 18 1.3 - Motivation - page 19 1.4 - Style of Presentation - page 20 1.5 - Outline of the Thesis - page 21 2 - Models and Modelling - page 23 2.1 - Some Types of Models - page 25 2.2 - Combinations of Models - page 28 2.3 - Parts of the Modelling Apparatus - page 33 2.4 - Models in Machine Learning - page 38 2.5 - The Philosophical Background to the Rest of this Thesis - page 41 Syntactic Measures of Complexity - page 3 - 3 - Problems and Properties - page 44 3.1 - Examples of Common Usage - page 44 3.1.1 - A case of nails - page 44 3.1.2 - Writing a thesis - page 44 3.1.3 - Mathematics - page 44 3.1.4 - A gas - page 44 3.1.5 - An ant hill - page 45 3.1.6 - A car engine - page 45 3.1.7 - A cell as part of an organism -...
Kolmogorov's Structure Functions and Model Selection
- IEEE TRANS. INFORM. THEORY
, 2003
"... In 1974 Kolmogorov proposed a non-probabilistic approach to statistics, an individual combinatorial relation between the data and its model, expressed by the so-called "structure function" of the data. We show that the structure function determines all stochastic properties of the data in the sense ..."
Abstract
-
Cited by 17 (7 self)
- Add to MetaCart
In 1974 Kolmogorov proposed a non-probabilistic approach to statistics, an individual combinatorial relation between the data and its model, expressed by the so-called "structure function" of the data. We show that the structure function determines all stochastic properties of the data in the sense of determining the best- tting model at every model-complexity level. A consequence is this: minimizing the data-to-model code length (finding the ML estimator or MDL estimator), in a class of contemplated models of prescribed maximal (Kolmogorov) complexity, always results in a model of best fit, irrespective of whether the source producing the data is in the model class considered. In this setting, code minimization always separates optimal model information from the remaining accidental information, and not only with high probability. The function that maps the maximal allowed model complexity to the goodness-of-fit (expressed as minimal "randomness deficiency") of the best model cannot itself be monotonically approximated. However, the shortest one-part or two-part code above can -- implicitly optimizing this elusive goodness-of-fit. We show that -- within the obvious constraints -- every graph is realized by the structure function of some data. We determine the (un)computability properties of the various functions contemplated and of the "algorithmic minimal sufficient statistic."
Kolmogorov's Structure Functions with an Application to the Foundations of Model Selection
- In Proceedings of the 43rd Annual Symposium on Foundations of Computer Science. IEEE Computer Society
, 2002
"... In 1974 Kolmogorov proposed a non-probabilistic approach to statistics, an individual combinatorial relation between the data and its model. We vindicate, for the first time, the rightness of the original "structure function", proposed by Kolmogorov: minimizing the data-to-model code length (finding ..."
Abstract
-
Cited by 9 (0 self)
- Add to MetaCart
In 1974 Kolmogorov proposed a non-probabilistic approach to statistics, an individual combinatorial relation between the data and its model. We vindicate, for the first time, the rightness of the original "structure function", proposed by Kolmogorov: minimizing the data-to-model code length (finding the ML estimator or MDL estimator), in a class of contemplated models of prescribed maximal (Kolmogorov) complexity, always results in a model of best fit (expressed as minimal randomness deficiency). We show that both the structure function and the minimum randomness deficiency function can assume all shapes over their full domain (improving an old result of L.A. Levin and both an old and a recent one of V.V. Vyugin). We determine the (un)computability properties of the various functions and "algorithmic sufficient statistic." 1
PAGODA: A Model for Autonomous Learning in Probabilistic Domains
, 1992
"... as a testbed for designing intelligent agents. The system consists of an overall agent architecture and five components within the architecture. The five components are: 1. Goal-Directed Learning (GDL), a decision-theoretic method for selecting learning goals. 2. Probabilistic Bias Evaluation (PBE) ..."
Abstract
-
Cited by 5 (2 self)
- Add to MetaCart
as a testbed for designing intelligent agents. The system consists of an overall agent architecture and five components within the architecture. The five components are: 1. Goal-Directed Learning (GDL), a decision-theoretic method for selecting learning goals. 2. Probabilistic Bias Evaluation (PBE), a technique for using probabilistic background knowledge to select learning biases for the learning goals. 3. Uniquely Predictive Theories (UPTs) and Probability Computation using Independence (PCI), a probabilistic representation and Bayesian inference method for the agent's theories. 4. A probabilistic learning component, consisting of a heuristic search algorithm and a Bayesian method for evaluating proposed theories. 5. A decision-theoretic probabilistic planner, which searches through the probability space defined by the agent's current theory to select the best action. PAGODA is given as input an initial planning goal (its ove
Sophistication Revisited
- Proceedings of the 30th International Colloquium on Automata, Languages and Programming
, 2001
"... The Kolmogorov structure function divides the smallest program producing a string in two parts: the useful information present in the string, called sophistication if based on total functions, and the remaining accidental information. We revisit the notion of sophistication due to Koppel, formal ..."
Abstract
-
Cited by 4 (3 self)
- Add to MetaCart
The Kolmogorov structure function divides the smallest program producing a string in two parts: the useful information present in the string, called sophistication if based on total functions, and the remaining accidental information. We revisit the notion of sophistication due to Koppel, formalize a connection between sophistication and a variation of computational depth (intuitively the useful or nonrandom information in a string), prove the existence of strings with maximum sophistication and show that they encode solutions of the halting problem, i.e., they are the deepest of all strings.
Towards an Algorithmic Statistics (Extended Abstract)
"... ) Peter G'acs ? , John Tromp, and Paul Vit'anyi ?? Abstract. While Kolmogorov complexity is the accepted absolute measure of information content of an individual finite object, a similarly absolute notion is needed for the relation between an individual data sample and an individual model su ..."
Abstract
- Add to MetaCart
) Peter G'acs ? , John Tromp, and Paul Vit'anyi ?? Abstract. While Kolmogorov complexity is the accepted absolute measure of information content of an individual finite object, a similarly absolute notion is needed for the relation between an individual data sample and an individual model summarizing the information in the data, for example, a finite set where the data sample typically came from. The statistical theory based on such relations between individual objects can be called algorithmic statistics, in contrast to ordinary statistical theory that deals with relations between probabilistic ensembles. We develop a new algorithmic theory of typical statistic, sufficient statistic, and minimal sufficient statistic. 1 Introduction We take statistical theory to ideally consider the following problem: Given a data sample and a family of models (hypotheses) one wants to select the model that produced the data. But a priori it is possible that the data is atypical for the...
Sophisticated Infinite Sequences
"... Abstract. In this paper we revisit the notion of sophistication for infinite sequences. Koppel defined sophistication of an object as the length of the shortest (finite) total program (p) that with some (finite or infinite) data (d) produce it and |p | + |d | is smaller than the shortest description ..."
Abstract
- Add to MetaCart
Abstract. In this paper we revisit the notion of sophistication for infinite sequences. Koppel defined sophistication of an object as the length of the shortest (finite) total program (p) that with some (finite or infinite) data (d) produce it and |p | + |d | is smaller than the shortest description of the object plus a constant. However the notion of “description of infinite sequences” is not appropriately defined. In this work, we propose a new definition of sophistication for infinite sequences as the limit of the ratio of sophistication of the initial segments and its length. As the main results we prove that highly sophisticated sequences are dense when the sophistication is defined with lim sup and the set of sequences with sophistication equal to zero is also dense when we consider the definition with lim inf. We also prove that, similarly to what happens for finite strings, sophistication and depth, for infinite sequences are distinct complexity measures.

