Inducing Probabilistic Grammars by Bayesian Model Merging (1994)
| Citations: | 112 - 0 self |
BibTeX
@MISC{Stolcke94inducingprobabilistic,
author = {Andreas Stolcke and Stephen Omohundro},
title = {Inducing Probabilistic Grammars by Bayesian Model Merging},
year = {1994}
}
Years of Citing Articles
OpenURL
Abstract
We describe a framework for inducing probabilistic grammars from corpora of positive samples. First, samples are incorporated by adding ad-hoc rules to a working grammar; subsequently, elements of the model (such as states or nonterminals) are merged to achieve generalization and a more compact representation. The choice of what to merge and when to stop is governed by the Bayesian posterior probability of the grammar given the data, which formalizes a trade-off between a close fit to the data and a default preference for simpler models (`Occam's Razor'). The general scheme is illustrated using three types of probabilistic grammars: Hidden Markov models, class-based n-grams, and stochastic context-free grammars. 1 Introduction Probabilistic modeling has become increasingly important for applications such as speech recognition, information retrieval, machine translation, and biological sequence processing. The types of models used vary widely, ranging from simple n-grams to Hidden Mark...







