Bayesian Variable Order Markov Models
BibTeX
@MISC{Dimitrakakis_bayesianvariable,
author = {Christos Dimitrakakis},
title = {Bayesian Variable Order Markov Models},
year = {}
}
OpenURL
Abstract
We present a simple, effective generalisation of variable order Markov models to full online Bayesian estimation. The mechanism used is close to that employed in context tree weighting. The main contribution is the addition of a prior, conditioned on context, on the Markov order. The resulting construction uses a simple recursion and can be updated efficiently. This allows the model to make predictions using more complex contexts, as more data is acquired, if necessary. In addition, our model can be alternatively seen as a mixture of tree experts. Experimental results show that the predictive model exhibits consistently good performance in a variety of domains. We consider Bayesian estimation of variable order Markov models (see Begleiter et al., 2004, for an overview). Such models create a tree of partitions, where the disjoint sets of every partition correspond to different contexts. We can associate a sub-model or expert with each context in order to make predictions. The main contribution of this paper is a conditional prior on the Markov order—or equivalently the context depth. This is based on a recursive construction that estimates, for each context at a certain depth k, whether it makes better predictions than the predictions of contexts at depths smaller than k. This simple model defines a mixture of variable order Marko models and its parameters can be updated in closed form in time O (D) for trees of depth D with each new observation. For unbounded length contexts, the complexity of the algorithm is O ( T 2) for an input sequence of length T. Furthermore, it exhibits robust performance in a variety of tasks. Finally, the model is easily extensible to controlled processes.
Citations
| 375 | Factorial hidden markov models - Ghahramani, Jordan - 1997 |
| 289 | Data compression using adaptive coding and partial string matching - Cleary, Witten - 1984 |
| 156 | Being Bayesian about network structure. a Bayesian approach to structure discovery in Bayesian networks - Friedman, Koller |
| 131 | Prior distributions on spaces of probability measures - Ferguson - 1974 |
| 91 | Learning with mixtures of trees - Meilǎ, Jordan |
| 66 | Variable length markov chains - Buhlmann, Wyner - 1999 |
| 42 | On Prediction Using Variable Order Markov Models - Begleiter, El-Yaniv, et al. |
| 15 | Bayes-adaptive POMDPs - Ross, Chalb-draa, et al. - 2008 |
| 7 | Lossless compression based on the sequence memoizer - Gasthaus, Wood, et al. - 2010 |
| 7 | The infinite Markov model - Mochihashi, Sumita |
| 1 | Variable order Markov decision processes: Exact Bayesian inference with an application to POMDPs. Submitted - Dimitrakakis - 2010 |
| 1 | Stationary autoregressive models via a bayesian nonparametric approach - Mena, Walker - 2005 |







