Abstract:
This paper presents a corpus-based approach for deriving heuristics to locate the antecedents of relative pronouns. The technique duplicates the performance of hand-coded rules and requires human intervention only during the training phase. Because the training instances are built on parser output rather than word cooccurrences, the technique requires a small number of training examples and can be used on small to medium-sized corpora. Our initial results suggest that the approach may provide a general method for the automated acquisition of a variety of disambiguation heuristics for natural language systems, especially for problems that require the assimilation of syntactic and semantic knowledge. 1 INTRODUCTION State-of-the-art natural language processing (NLP) systems typically rely on heuristics to resolve many classes of ambiguities, e.g., prepositional phrase attachment, part of speech disambiguation, word sense disambiguation, conjunction, pronoun resolution, and concept activat...
Citations
|
523
|
Knowledge Acquisition via Incremental Concept Formation
– Fisher
- 1987
|
|
232
|
Structural Ambiguity and Lexical Relations
– Hindle, Rooth
- 1993
|
|
165
|
Noun classification from predicate-argument structure
– Hindle
- 1990
|
|
148
|
Seven principles of surface structure parsing in natural language
– Kimball
- 1973
|
|
146
|
Word-sense disambiguation using statistical methods
– Brown, Pietra, et al.
- 1991
|
|
141
|
Resolving Pronoun References
– Hobbs
- 1978
|
|
132
|
On comprehending sentences: Syntactic parsing strategies
– Frazier
- 1978
|
|
96
|
Automatic acquisition of subcategorization frames from untagged text
– Brent
- 1991
|
|
72
|
Information, uncertainty, and the utility of categories
– Gluck, Corter
- 1985
|
|
64
|
Parsing the LOB corpus
– Marcken
- 1990
|
|
54
|
Constituent attachment and thematic role assignment in sentence processing: influences of content based expectations
– Taraban, McClelland
- 1988
|
|
47
|
Symbolic/Subsymbolic Sentence Analysis: Exploiting the Best of Two Worlds
– Lehnert
- 1990
|
|
33
|
Inside Computer Understanding: Five Programs plus Miniatures. Lawrence Erlbaum and Associates
– Schank
- 1981
|
|
25
|
A statistical filter for resolving pronoun references
– Dagan, Itai
- 1991
|
|
23
|
A computational mechanism for pronominal reference
– Ingria, Stallard
- 1989
|
|
20
|
Overview of the Third Message Understanding Evaluation and Conference
– Sundheim
- 1991
|
|
18
|
A Cognitively Plausible Approach to Understanding Complicated Syntax
– Cardie, Lehnert
- 1991
|
|
14
|
User Manual for Fidditch
– Hindle
- 1983
|
|
11
|
A syntactic filter on pronominal anaphora for slot grammar
– Lappin, McCord
- 1990
|
|
10
|
A Binding Rule for Government-Binding Parsing
– Correa
- 1988
|
|
3
|
Ã’University of Massachusetts
– Lehnert, Cardie, et al.
- 1992
|
|
1
|
Disambignating and interpreting verb definitions. Proceedings, 28th Annual Meeting of the Association for Computational Linguists
– Ravin
- 1990
|