Unsupervised Learning of Disambiguation Rules for Part of Speech Tagging (1995)
| Venue: | In Natural Language Processing Using Very Large Corpora |
| Citations: | 101 - 1 self |
BibTeX
@INPROCEEDINGS{Brill95unsupervisedlearning,
author = {Eric Brill},
title = {Unsupervised Learning of Disambiguation Rules for Part of Speech Tagging},
booktitle = {In Natural Language Processing Using Very Large Corpora},
year = {1995},
pages = {1--13},
publisher = {Kluwer Academic Press}
}
Years of Citing Articles
OpenURL
Abstract
In this paper we describe an unsupervised learning algorithm for automatically training a rule-based part of speech tagger without using a manually tagged corpus. We compare this algorithm to the Baum-Welch algorithm, used for unsupervised training of stochastic taggers. Next, we show a method for combining unsupervised and supervised rule-based training algorithms to create a highly accurate tagger using only a small amount of manually tagged text.







