• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

DMCA

The Tradeoffs Between Open and Traditional Relation Extraction

Cached

  • Download as a PDF

Download Links

  • [turing.cs.washington.edu]
  • [ai.cs.washington.edu]
  • [www.sis.pitt.edu]
  • [www.sis.pitt.edu]
  • [aclweb.org]
  • [www.aclweb.org]
  • [wing.comp.nus.edu.sg]
  • [www.aclweb.org]
  • [aclweb.org]
  • [aclweb.org]
  • [www.aclweb.org]
  • [wing.comp.nus.edu.sg]
  • [www.cs.washington.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Michele Banko , Oren Etzioni
Citations:112 - 10 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Banko_thetradeoffs,
    author = {Michele Banko and Oren Etzioni},
    title = {The Tradeoffs Between Open and Traditional Relation Extraction},
    year = {}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

Traditional Information Extraction (IE) takes a relation name and hand-tagged examples of that relation as input. Open IE is a relationindependent extraction paradigm that is tailored to massive and heterogeneous corpora such as the Web. An Open IE system extracts a diverse set of relational tuples from text without any relation-specific input. How is Open IE possible? We analyze a sample of English sentences to demonstrate that numerous relationships are expressed using a compact set of relation-independent lexico-syntactic patterns, which can be learned by an Open IE system. What are the tradeoffs between Open IE and traditional IE? We consider this question in the context of two tasks. First, when the number of relations is massive, and the relations themselves are not pre-specified, we argue that Open IE is necessary. We then present a new model for Open IE called O-CRF and show that it achieves increased precision and nearly double the recall than the model employed by TEXTRUNNER, the previous stateof-the-art Open IE system. Second, when the number of target relations is small, and their names are known in advance, we show that O-CRF is able to match the precision of a traditional extraction system, though at substantially lower recall. Finally, we show how to combine the two types of systems into a hybrid that achieves higher precision than a traditional extractor, with comparable recall. 1

Keyphrases

open ie    traditional relation extraction    tradeoff open    open ie system    traditional extraction system    relation-specific input    relational tuples    traditional extractor    diverse set    comparable recall    traditional ie    numerous relationship    english sentence    relation name    traditional information extraction    new model    heterogeneous corpus    relation-independent lexico-syntactic pattern    relationindependent extraction paradigm    previous stateof-the-art open ie system    hand-tagged example    compact set    target relation   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University