Parameter Estimation for Statistical Parsing Models: Theory and Practice of Distribution-Free Methods (2001)
Cached
Download Links
- [www.research.att.com]
- [www.research.att.com]
- [www.ai.mit.edu]
- [www.ai.mit.edu]
- DBLP
Other Repositories/Bibliography
| Citations: | 45 - 9 self |
BibTeX
@MISC{Collins01parameterestimation,
author = {Michael Collins},
title = {Parameter Estimation for Statistical Parsing Models: Theory and Practice of Distribution-Free Methods},
year = {2001}
}
Years of Citing Articles
OpenURL
Abstract
A fundamental problem in statistical parsing is the choice of criteria and algorithms used to estimate the parameters in a model. The predominant approach in computational linguistics has been to use a parametric model with some variant of maximum-likelihood estimation. The assumptions under which maximum-likelihood estimation is justified are arguably quite strong. This paper discusses the statistical theory underlying various parameter-estimation methods, and gives algorithms which depend on alternatives to (smoothed) maximumlikelihood estimation. We first give an overview of results from statistical learning theory. We then show how important concepts from the classification literature -- specifically, generalization results based on margins on training data -- can be derived for parsing models. Finally, we describe parameter estimation algorithms which are motivated by these generalization bounds.







