• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

A Simple Rule-Based Part of Speech Tagger (1992)

Cached

  • Download as a PDF

Download Links

  • [acl.ldc.upenn.edu]
  • [aclweb.org]
  • [www.rohan.sdsu.edu]
  • [www.ifi.unizh.ch]
  • [www.ifi.unizh.ch]
  • [www.ifi.unizh.ch]

  • Other Repositories/Bibliography

  • DBLP
  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Eric Brill
Citations:433 - 10 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Brill92asimple,
    author = {Eric Brill},
    title = {A Simple Rule-Based Part of Speech Tagger},
    year = {1992}
}

Years of Citing Articles

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

Automatic part of speech tagging is an area of natural language processing where statistical techniques have been more successful than rule- based methods. In this paper, we present a sim- ple rule-based part of speech tagger which automatically acquires its rules and tags with accuracy coinparable to stochastic taggers. The rule-based tagger has many advantages over these taggers, including: a vast reduction in stored information required, the perspicuity of a sinall set of meaningful rules, ease of finding and implementing improvements to the tagger, and better portability from one tag set, cor- pus genre or language to another. Perhaps the biggest contribution of this work is in demonstrating that the stochastic method is not the only viable method for part of speech tagging. The fact that a simple rule-based tagger that automatically learns its rules can perform so well should offer encouragement for researchers to further explore rule-based tagging, searching for a better and more expressive set of rule templates and other variations on the simple but effective theme described below.

Citations

649 A stochastic parts program and noun phrase parser for unrestricted text - Church - 1988
325 A Practical Part-of-Speech Tagger - Cutting, Kupiec, et al. - 1991
259 Frequency analysis of English usage: Lexicon and grammar - Francis, Kucera - 1982
148 Grammatical category disambiguation by statistical optimization - DeRose - 1988
94 The computational analysis of English. A corpus-based approach - Garside, Leech, et al. - 1987
61 Acquiring disambiguation rules from text - Hindle - 1989
49 Markov source modeling of text generation - Jelinek - 1985
46 Automatic grammatical tagging of English - Greene, Rubin - 1971
39 Natural language modeling for phoneme-to-text transcription - Derouault, Merialdo - 1986
33 A computational approach to grammatical coding of english words - Klein, Simmons - 1963
21 Augmenting a hidden Markov model for phrase dependent word tagging - Kupiec - 1989
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University