Paraphrase Acquisition for Information Extraction (0)
Abstract:
We are trying to find paraphrases from Japanese news articles which can be used for Information Extraction. We focused on the fact that a single event can be reported in more than one article in different ways. However, certain kinds of noun phrases such as names, dates and numbers behave as "anchors" which are unlikely to change across articles. Our key idea is to identify these anchors among comparable articles and extract portions of expressions which share the anchors. This way we can extract expressions which convey the same information. Obtained paraphrases are generalized as templates and stored for future use.
Citations
| 115 | 2001. Discovery of inference rules for question-answering – Lin, Pantel |
| 104 | Extracting paraphrases from a parallel corpus – Barzilay, McKeown - 2001 |
| 60 | 2003. Syntax-based alignment of multiple translations: Extracting paraphrases and generating new sentences – Pang, Knight, et al. |
| 47 | Automatic paraphrase acquisition from news articles – Shinyama, Sekine, et al. - 2002 |
| 41 | Japanese Morphological Analysis System JUMAN version 3.5 – Kurohashi, Nagao - 1998 |
| 15 | Automatic pattern acquisition for Japanese information extraction – Sudo, Sekine, et al. - 2001 |
| 12 | UMASS Approaches to Detection and Tracking at TDT2 – Papka, Allan, et al. - 1999 |
| 4 | Hiromi Ozaku, and Hitoshi Isahara. 2000. Named Entity Extraction Based on A Maximum Entropy Model and Transformation Rules – Uchimoto, Ma, et al. - 2000 |
| 1 | Kurohashi-Nagao parser. Kyoto University, version 2.0 b6 edition – Kurohashi - 1998 |

