• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

DMCA

2008b. Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora

Cached

  • Download as a PDF

Download Links

  • [aclweb.org]
  • [www.aclweb.org]
  • [www.aclweb.org]
  • [wing.comp.nus.edu.sg]
  • [aclweb.org]
  • [aclweb.org]
  • [ir.hit.edu.cn]
  • [www.aclweb.org]
  • [wing.comp.nus.edu.sg]
  • [ir.hit.edu.cn]
  • [ir.hit.edu.cn]
  • [www.mt-archive.info]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Shiqi Zhao , Haifeng Wang , Ting Liu , Sheng Li
Venue:In Proceedings of ACL-08:HLT
Citations:24 - 3 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@INPROCEEDINGS{Zhao_2008b.pivot,
    author = {Shiqi Zhao and Haifeng Wang and Ting Liu and Sheng Li},
    title = {2008b. Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora},
    booktitle = {In Proceedings of ACL-08:HLT},
    year = {},
    pages = {780--788}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

Paraphrase patterns are useful in paraphrase recognition and generation. In this paper, we present a pivot approach for extracting paraphrase patterns from bilingual parallel corpora, whereby the English paraphrase patterns are extracted using the sentences in a foreign language as pivots. We propose a loglinear model to compute the paraphrase likelihood of two patterns and exploit feature functions based on maximum likelihood estimation (MLE) and lexical weighting (LW). Using the presented method, we extract over 1,000,000 pairs of paraphrase patterns from 2M bilingual sentence pairs, the precision of which exceeds 67%. The evaluation results show that: (1) The pivot approach is effective in extracting paraphrase patterns, which significantly outperforms the conventional method DIRT. Especially, the log-linear model with the proposed feature functions achieves high performance. (2) The coverage of the extracted paraphrase patterns is high, which is above 84%. (3) The extracted paraphrase patterns can be classified into 5 types, which are useful in various applications. 1

Keyphrases

pivot approach    paraphrase pattern    extracting paraphrase pattern    bilingual corpus    extracted paraphrase pattern    maximum likelihood estimation    lexical weighting    paraphrase likelihood    log-linear model    various application    exploit feature function    feature function    high performance    evaluation result    loglinear model    foreign language    conventional method dirt    bilingual sentence pair    paraphrase recognition    english paraphrase pattern    bilingual parallel corpus   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University