MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Using Learned Extraction Patterns for Text Classification (1996)

by Language Processing ,  S. Wermter ,  Ellen Riloff ,  G. Scheler ,  Ellen Riloff
Add To MetaCart

Abstract:

. A major knowledge-engineering bottleneck for information extraction systems is the process of constructing an appropriate dictionary of extraction patterns. AutoSlog is a dictionary construction system that has been shown to substantially reduce the time required for knowledge engineering by learning extraction patterns automatically. However, an open question was whether these extraction patterns were useful for tasks other than information extraction. We describe a series of experiments that show how the extraction patterns learned by AutoSlog can be used for text classification. Three dictionaries produced by AutoSlog for different domains performed well in our text classification experiments. 1 Introduction Many researchers in natural language processing have turned their attention recently to a problem called information extraction (IE). Information extraction is a natural language processing task that involves extracting predefined types of information from text. Information e...

Citations

2526 Induction of decision trees – Quinlan - 1986
322 Explanation-based learning: An alternative view – DeJong, Mooney - 1986
151 Automatically Constructing a Dictionary for Information Extraction Tasks – Riloff - 1993
125 Inducing a conceptual dictionary – Soderland, Fisher, et al.
100 Information extraction as a basis for highprecision text classification – Riloff, Lehnert - 1994
82 A performance evaluation of text analysis technologies – Lehnert, Sundheim - 1991
78 SCISOR: Extracting information from on-line news – Jacobs, Rau - 1990
68 An Empirical Study of Automated Dictionary Construction for Information Extraction in Three Domains – Riloff - 1996
60 FOULUP: a program that figures out meanings of words from context – Granger - 1977
47 Symbolic/Subsymbolic Sentence Analysis: Exploiting the Best of Two Worlds – Lehnert - 1990
40 Explanation-Based Generalization: A Unifying View’. Machine Learning 1:47–80 – Mitchell - 1986
35 Acquiring Lexical Knowledge from Text: A Case Study – Jacobs, Zernik - 1988
34 University of massachusetts: Description of the CIRCUS system as used for MUC-4 – Lchncrt, Cardie, et al. - 1992
24 Acquisition of Semantic Patterns for Information Extraction from Corpora – Kim, Moldovan - 1993
21 Automatically Acquiring Conceptual Patterns Without an Annotated Corpus – Riloff, Shoen - 1995
20 Towards a Self-Extending Parser – Carbonell - 1979
16 University of Massachusetts: MUC-4 Test Results and Analysis – Lehnert, Cardie, et al. - 1992