Guiding unsupervised grammar induction using contrastive estimation (2005)
| Venue: | In Proc. of IJCAI Workshop on Grammatical Inference Applications |
| Citations: | 21 - 6 self |
BibTeX
@INPROCEEDINGS{Smith05guidingunsupervised,
author = {Noah A. Smith and Jason Eisner},
title = {Guiding unsupervised grammar induction using contrastive estimation},
booktitle = {In Proc. of IJCAI Workshop on Grammatical Inference Applications},
year = {2005},
pages = {73--82}
}
Years of Citing Articles
OpenURL
Abstract
We describe a novel training criterion for probabilistic grammar induction models, contrastive estimation [Smith and Eisner, 2005], which can be interpreted as exploiting implicit negative evidence and includes a wide class of likelihood-based objective functions. This criterion is a generalization of the function maximized by the Expectation-Maximization algorithm [Dempster et al., 1977]. CE is a natural fit for log-linear models, which can include arbitrary features but for which EM is computationally difficult. We show that, using the same features, log-linear dependency grammar models trained using CE can drastically outperform EMtrained generative models on the task of matching human linguistic annotations (the MATCHLIN-GUIST task). The selection of an implicit negative evidence class—a “neighborhood”—appropriate to a given task has strong implications, but a good neighborhood one can target the objective of grammar induction to a specific application. 1







