Results 1 -
3 of
3
Example-based Machine Translation of the Basque Language
- In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas
, 2006
"... Basque is both a minority and a highly inflected language with free order of sentence constituents. Machine Translation of Basque is thus both a real need and a test bed for MT techniques. In this paper, we present a modular Data-Driven MT system which includes different chunkers as well as chunk al ..."
Abstract
-
Cited by 5 (5 self)
- Add to MetaCart
Basque is both a minority and a highly inflected language with free order of sentence constituents. Machine Translation of Basque is thus both a real need and a test bed for MT techniques. In this paper, we present a modular Data-Driven MT system which includes different chunkers as well as chunk aligners which can deal with the free order of sentence constituents of Basque. We conducted Basque to English translation experiments, evaluated on a large corpus (270, 000 sentence pairs). The experimental results show that our system significantly outperforms state-of-the-art approaches according to several common automatic evaluation metrics. 1
Thot: a Toolkit To Train Phrase-based Statistical Translation Models ∗
"... In this paper, we present the Thot toolkit, a set of tools to train phrase-based models for statistical machine translation, which is publicly available as open source software. The toolkit obtains phrase-based models from word-based alignment models; to our knowledge, this functionality has not bee ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
In this paper, we present the Thot toolkit, a set of tools to train phrase-based models for statistical machine translation, which is publicly available as open source software. The toolkit obtains phrase-based models from word-based alignment models; to our knowledge, this functionality has not been offered by any publicly available toolkit. The Thot toolkit also implements a new way for estimating phrase models, this allows to obtain more complete phrase models than the methods described in the literature, including a segmentation length submodel. The toolkit output can be given in different formats in order to be used by other statistical machine translation tools like Pharaoh, which is a beam search decoder for phrase-based alignment models which was used in order to perform translation experiments with the generated models. Additionally, the Thot toolkit can be used to obtain the best alignment between a sentence pair at phrase level. 1
Bilingual Corpora Segmentation Using Bilingual Recursive Alignments
"... A bilingual recursive alignment is a set of nested phrase-based alignments. This set is represented through a binary tree where the inner nodes can model a direct or an inverse translation. In this work, we compute recursive alignments using a greedy algorithm based on a statistical translation dict ..."
Abstract
- Add to MetaCart
A bilingual recursive alignment is a set of nested phrase-based alignments. This set is represented through a binary tree where the inner nodes can model a direct or an inverse translation. In this work, we compute recursive alignments using a greedy algorithm based on a statistical translation dictionary. Different bilingual segmentations can be obtained from a bilingual recursive alignment of a pair of sentences, depending on the degree of granularity to be achieved. Bilingual segmentation results are reported for a Spanish-English task and a Spanish-Basque task. 1.

