• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 104,303
Next 10 →

Pivot-based Triangulation for Low-Resource Languages

by Rohit Dholakia, Anoop Sarkar
"... This paper conducts a comprehensive study on the use of triangulation for four very low-resource languages: Mawukakan and Maninkakan, Haitian Kreyol and Malagasy. To the best of our knowledge, ours is the first effective translation system for the first two of these languages. We improve translation ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
the weighted mixture of direct and pivot based phrase pairs to improve translation quality. 1

Improving Pivot-Based Statistical Machine Translation Using Random Walk

by Xiaoning Zhu, Conghui Zhu, Tiejun Zhao, Zhongjun He, Hua Wu, Haifeng Wang
"... * This work was done when the first author was visiting Baidu. This paper proposes a novel approach that uti-lizes a machine learning method to improve pivot-based statistical machine translation (SMT). For language pairs with few bilingual data, a possible solution in pivot-based SMT using another ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
* This work was done when the first author was visiting Baidu. This paper proposes a novel approach that uti-lizes a machine learning method to improve pivot-based statistical machine translation (SMT). For language pairs with few bilingual data, a possible solution in pivot-based SMT using another

Translation Quality Indicators for Pivot-based Statistical MT

by Michael Paul, Eiichiro Sumita
"... Recent research on multilingual statisti-cal machine translation focuses on the us-age of pivot languages in order to over-come resource limitations for certain lan-guage pairs. This paper provides new in-sights into what factors make a good pivot language and investigates the impact of these factor ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Recent research on multilingual statisti-cal machine translation focuses on the us-age of pivot languages in order to over-come resource limitations for certain lan-guage pairs. This paper provides new in-sights into what factors make a good pivot language and investigates the impact

Pivot-based Machine Translation between Statistical and Black Box systems

by Antonio Toral
"... This paper presents a novel approach to pivot-based machine translation (MT): while the state-of-the-art uses two statistical systems, this proposal treats the second system as a black box. Our approach effecively provides pivot-based MT to target languages for which no suitable bilingual corpora ar ..."
Abstract - Add to MetaCart
This paper presents a novel approach to pivot-based machine translation (MT): while the state-of-the-art uses two statistical systems, this proposal treats the second system as a black box. Our approach effecively provides pivot-based MT to target languages for which no suitable bilingual corpora

Dialect Translation: Integrating Bayesian Co-segmentation Models with Pivot-based SMT

by Michael Paul, Andrew Finch, Paul R. Dixon, Eiichiro Sumita
"... Recent research on multilingual statistical machine translation (SMT) focuses on the usage of pivot languages in order to overcome resource limitations for certain language pairs. This paper proposes a new method to translate a dialect language into a foreign language by integrating transliteration ..."
Abstract - Add to MetaCart
Recent research on multilingual statistical machine translation (SMT) focuses on the usage of pivot languages in order to overcome resource limitations for certain language pairs. This paper proposes a new method to translate a dialect language into a foreign language by integrating transliteration

Discriminative Training and Maximum Entropy Models for Statistical Machine Translation

by Franz Josef Och, Hermann Ney , 2002
"... We present a framework for statistical machine translation of natural languages based on direct maximum entropy models, which contains the widely used source -channel approach as a special case. All knowledge sources are treated as feature functions, which depend on the source language senten ..."
Abstract - Cited by 497 (30 self) - Add to MetaCart
We present a framework for statistical machine translation of natural languages based on direct maximum entropy models, which contains the widely used source -channel approach as a special case. All knowledge sources are treated as feature functions, which depend on the source language

Pivoted Document Length Normalization

by Amit Singhal, Chris Buckley, Mandar Mitra - SIGIR'96 , 1996
"... Automatic information retrieval systems have to deal with documents of varying lengths in a text collection. Document length normalization is used to fairly retrieve documents of all lengths. In this study, we ohserve that a normalization scheme that retrieves documents of all lengths with similar c ..."
Abstract - Cited by 471 (16 self) - Add to MetaCart
different collections. We present pivoted normalization, a technique that can be used to modify any normalization function thereby reducing the gap between the relevance and the retrieval probabilities. Training pivoted normalization on one collection, we can successfully use it on other (new) text

Large-scale Japanese-Chinese Scientific Dictionary Construction via Pivot-based Statistical Machine Translation

by Chenhui Chu, Raj Dabre, Toshiaki Nakazawa, Sadao Kurohashi
"... Pivot-based statistical machine translation (SMT) (Wu and Wang, 2007) has been shown a possible way of constructing a dictionary for the language pairs that have scarce parallel data (Tsunakawa et al., ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Pivot-based statistical machine translation (SMT) (Wu and Wang, 2007) has been shown a possible way of constructing a dictionary for the language pairs that have scarce parallel data (Tsunakawa et al.,

Machine Learning in Automated Text Categorization

by Fabrizio Sebastiani - ACM COMPUTING SURVEYS , 2002
"... The automated categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last ten years, due to the increased availability of documents in digital form and the ensuing need to organize them. In the research community the dominant approach to this p ..."
Abstract - Cited by 1658 (22 self) - Add to MetaCart
to this problem is based on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of preclassified documents, the characteristics of the categories. The advantages of this approach over the knowledge engineering approach (consisting in the manual

A comparison of pivot methods for phrase-based statistical machine translation

by Masao Utiyama, Hitoshi Isahara - in Proceedings of the conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (NAACL-HLT
"... We compare two pivot strategies for phrase-based statistical machine translation (SMT), namely phrase translation and sentence translation. The phrase translation strategy means that we directly construct a phrase translation table (phrase-table) of the source and target language pair from two phras ..."
Abstract - Cited by 45 (0 self) - Add to MetaCart
We compare two pivot strategies for phrase-based statistical machine translation (SMT), namely phrase translation and sentence translation. The phrase translation strategy means that we directly construct a phrase translation table (phrase-table) of the source and target language pair from two
Next 10 →
Results 1 - 10 of 104,303
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2018 The Pennsylvania State University