Abstract:
. A major knowledge-engineering bottleneck for information extraction systems is the process of constructing an appropriate dictionary of extraction patterns. AutoSlog is a dictionary construction system that has been shown to substantially reduce the time required for knowledge engineering by learning extraction patterns automatically. However, an open question was whether these extraction patterns were useful for tasks other than information extraction. We describe a series of experiments that show how the extraction patterns learned by AutoSlog can be used for text classification. Three dictionaries produced by AutoSlog for different domains performed well in our text classification experiments. 1 Introduction Many researchers in natural language processing have turned their attention recently to a problem called information extraction (IE). Information extraction is a natural language processing task that involves extracting predefined types of information from text. Information e...
Citations
|
2526
|
Induction of decision trees
– Quinlan
- 1986
|
|
322
|
Explanation-based learning: An alternative view
– DeJong, Mooney
- 1986
|
|
151
|
Automatically Constructing a Dictionary for Information Extraction Tasks
– Riloff
- 1993
|
|
125
|
Inducing a conceptual dictionary
– Soderland, Fisher, et al.
|
|
100
|
Information extraction as a basis for highprecision text classification
– Riloff, Lehnert
- 1994
|
|
82
|
A performance evaluation of text analysis technologies
– Lehnert, Sundheim
- 1991
|
|
78
|
SCISOR: Extracting information from on-line news
– Jacobs, Rau
- 1990
|
|
68
|
An Empirical Study of Automated Dictionary Construction for Information Extraction in Three Domains
– Riloff
- 1996
|
|
60
|
FOULUP: a program that figures out meanings of words from context
– Granger
- 1977
|
|
47
|
Symbolic/Subsymbolic Sentence Analysis: Exploiting the Best of Two Worlds
– Lehnert
- 1990
|
|
40
|
Explanation-Based Generalization: A Unifying View’. Machine Learning 1:47–80
– Mitchell
- 1986
|
|
35
|
Acquiring Lexical Knowledge from Text: A Case Study
– Jacobs, Zernik
- 1988
|
|
34
|
University of massachusetts: Description of the CIRCUS system as used for MUC-4
– Lchncrt, Cardie, et al.
- 1992
|
|
24
|
Acquisition of Semantic Patterns for Information Extraction from Corpora
– Kim, Moldovan
- 1993
|
|
21
|
Automatically Acquiring Conceptual Patterns Without an Annotated Corpus
– Riloff, Shoen
- 1995
|
|
20
|
Towards a Self-Extending Parser
– Carbonell
- 1979
|
|
16
|
University of Massachusetts: MUC-4 Test Results and Analysis
– Lehnert, Cardie, et al.
- 1992
|