• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Bottom-Up Relational Learning of Pattern Matching Rules for Information Extraction (2003)

Cached

  • Download as a PDF
  •  
  • Download as a PS

Download Links

  • [www.jmlr.org]
  • [www.cs.utexas.edu]
  • [acl.ldc.upenn.edu]
  • [ucrel.lancs.ac.uk]
  • [lcg-www.uia.ac.be]
  • [ftp.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [ftp.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [ftp.cs.utexas.edu]
  • [ftp.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.cs.utexas.edu]
  • [www.jmlr.org]

  • Other Repositories/Bibliography

  • CiteULike
  • DBLP
  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Mary Elaine Califf , Raymond J. Mooney , David Cohn
Citations:277 - 16 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@INPROCEEDINGS{Califf03bottom-uprelational,
    author = {Mary Elaine Califf and Raymond J. Mooney and David Cohn},
    title = {Bottom-Up Relational Learning of Pattern Matching Rules for Information Extraction},
    booktitle = {},
    year = {2003},
    pages = {328--334}
}

Years of Citing Articles

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

Information extraction is a form of shallow text processing that locates a specified set of relevant items in a natural-language document. Systems for this task require significant domain-specific knowledge and are time-consuming and difficult to build by hand, making them a good application for machine learning. We present an algorithm, RAPIER, that uses pairs of sample documents and filled templates to induce pattern-match rules that directly extract fillers for the slots in the template. RAPIER is a bottom-up learning algorithm that incorporates techniques from several inductive logic programming systems. We have implemented the algorithm in a system that allows patterns to have constraints on the words, part-of-speech tags, and semantic classes present in the filler and the surrounding text. We present encouraging experimental results on two domains.

Citations

1302 WordNet: an online lexical database - Miller, Beckwith, et al. - 1990
962 Efficient induction of logic programs - Muggleton, Feng - 1990
784 Learning logical definitions from relations - Quinlan - 1989
560 Inverse entailment and progol - Muggleton - 1995
428 A note on inductive generalization - Plotkin - 1970
296 Learning Information Extraction Rules for Semistructured and free texts - Soderland
279 Doorenbos, “A Scalable Comparison Shopping Agent for The World Wide Web - Etzioni, Weld, et al. - 1996
258 Eliza: a computer program for the study of natural language communication between man and machine - Weizenbaum - 1966
244 Automatically generating extraction patterns from untagged text - Riloff - 1996
183 Automatically Constructing a Dictionary for Information Extraction Tasks - Riloff - 1993
136 Crystal: Inducing a conceptual dictionary - Soderland, Fisher, et al. - 1995
132 Some Advances in Rulebased Part of Speech Tagging - Brill - 1994
100 Using decision trees for coreference resolution - McCarthy, Lehnert - 1995
90 Machine learning for information extraction in informal domains - Freitag - 2000
81 A performance evaluation of text-analysis technologies - Lehnert, Sundheim - 1991
76 Multi-strategy learning for information extraction - Freitag - 1998
73 Relational learning techniques for natural language information extraction. Doctoral dissertation, The - Califf - 1998
68 Induction of firstorder decision lists: results on learning the past tense of English verbs - Mooney, Califf - 1995
63 Learning information extraction patterns from examples - Huffman - 1995
53 Text Categorization and Relational Learning - Cohen - 1995
34 Acquisition of linguistic patterns for knowledge-based information extraction - Kim, Moldovan - 1995
27 Combining top-down and bottom-up methods in inductive logic programming - Zelle, Mooney - 1994
20 Learning to tag multilingual texts through observation - Bennett, Aone, et al. - 1997
13 Issues in inductive learning of domainspecific text extraction rules - Soderland, Fisher, et al. - 1996
1 14 Relational Learning - Plotkin - 1970
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University