• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

The PLUG Project: Parallel corpora in Linkoping, uppsala, goteborg: Aims and achievements (1999)

by Anna Sagvail Hein
Venue:Uppsala University
Add To MetaCart

Tools

Sorted by:
Results 1 - 3 of 3

Combining Clues for Word Alignment

by Jörg Tiedemann - In Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL): 12–17 April 2003; Budapest Programme chairs Copestake A, Hajic J , 2003
"... In this paper, a word alignment approach is presented which is based on a combination of clues. Word alignment clues indicate associations between words and phrases. They can be based on features such as frequency, part-of-speech, phrase type, and the actual wordform strings. Clues can be found by c ..."
Abstract - Cited by 16 (0 self) - Add to MetaCart
In this paper, a word alignment approach is presented which is based on a combination of clues. Word alignment clues indicate associations between words and phrases. They can be based on features such as frequency, part-of-speech, phrase type, and the actual wordform strings. Clues can be found by calculating similarity measures or learned from word aligned data. The clue alignment approach...

Evaluation of Word Alignment Systems

by Lars Ahrenberg, Magnus Merkel, Anna Sågvall Hein, Jörg Tiedemann , 2000
"... Recent years have seen a few serious attempts to develop methods and measures for the evaluation of word alignment systems, notably the Blinker project (Melamed, 1998) and the ARCADE project (Vronis and Langlais, forthcoming). In this paper we discuss different approaches to the problem and report o ..."
Abstract - Cited by 9 (1 self) - Add to MetaCart
Recent years have seen a few serious attempts to develop methods and measures for the evaluation of word alignment systems, notably the Blinker project (Melamed, 1998) and the ARCADE project (Vronis and Langlais, forthcoming). In this paper we discuss different approaches to the problem and report on results from a project where two word alignment systems have been evaluated. These results include methods and tools for the generation of reference data and a set of measures for system performance. We note that the selection and sampling of reference data can have a great impact on scoring results.

Extracting Phrasal Terms using Bitext

by Jörg Tiedemann
"... This paper focuses on the improvement of automatically generated phrase lists by applying word alignment approaches to parallel bitext. Such phrase lists, in terms of multi-word collocations, serve several tasks such as the compilation of terminology databases or translation database in the multilin ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
This paper focuses on the improvement of automatically generated phrase lists by applying word alignment approaches to parallel bitext. Such phrase lists, in terms of multi-word collocations, serve several tasks such as the compilation of terminology databases or translation database in the multilingual case. Our investigations are based on the assumption that word alignment favors well-formed phrase structures rather than irregular text segments. If this is the case, word alignment will lter out irregular structures from automatically generated phrase lists. As a result, an improved phrase list may be compiled based on the bilingual lexicon that could be extracted. Furthermore, word alignment approaches can be used to identify additional multi-word units by comparing corresponding multiword units from the bitext. Our investigations will be focused on a Swedish/English text collection that has been aligned with the Uppsala Word Aligner (UWA). 1 Introduction Domain specic terminolog...
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University