On Some Pitfalls in Automatic Evaluation and Significance Testing for MT (2005)

by Stefan Riezler
Citations:29 - 3 self

Documents Related by Co-Citation

1470 BLEU: a Method for Automatic Evaluation of Machine Translation – Kishore Papineni, Salim Roukos, Todd Ward, Wei-jing Zhu - 2002
452 Minimum Error Rate Training in Statistical Machine Translation – Franz Josef Och - 2003
875 Moses: Open Source Toolkit for Statistical Machine Translation”. ACL-2007 – Hieu Hoang, Alexandra Birch, Chris Callison-burch, Richard Zens, Marcello Federico, Nicola Bertoldi, Chris Dyer, Brooke Cowan, Wade Shen, Christine Moran, Ondřej Bojar, Alexandra Constantin, Evan Herbst - 2007
633 Statistical phrase-based translation – Franz Josef Och, Daniel Marcu - 2003
375 Hierarchical phrase-based translation – David Chiang - 2007
1250 A Systematic Comparison of Various Statistical Alignment Models – Franz Josef Och, Hermann Ney, Franz Josef, Och Hermann Ney - 2003
147 Alignment by agreement – Percy Liang, et al. - 2006
309 Automatic Evaluation of Machine Translation Quality Using N-gram CoOccurrence Statistics – G Doddington - 2003
42 A unigram orientation model for statistical machine translation – C Tillmann - 2004
13 Phrasal: A statistical machine translation toolkit for Exploring new model features – Daniel Cer, Michel Galley, Daniel Jurafsky, Christopher D Manning - 2010
348 The alignment template approach to statistical machine translation – F Och, H Ney - 2004
367 Discriminative Training and Maximum Entropy Models for Statistical Machine Translation – Franz Josef Och, Hermann Ney - 2002
293 Online passiveaggressive algorithms – Koby Crammer, Ofer Dekel, Shai Shalev-shwartz, Yoram Singer - 2006
69 Online Large-Margin Training of Syntactic and Structural Translation Features – David Chiang, Yuval Marton, Philip Resnik
78 Computer Intensive Methods for Testing Hypothesis: An Introduction – E W Noreen - 1989
121 A Smorgasbord of Features for Statistical Machine Translation – Franz Josef Och, Kenji Yamada, Alex Fraser, Daniel Gildea, Viren Jain, et al. - 2004
15 Statistical significance of muc-6 results – N Chinchor - 1992
57 Unsupervised topic modelling for multi-party spoken discourse – Matthew Purver, Konrad P. Körding, Thomas L. Griffiths - 2006
71 Modeling Online Reviews with Multi-grain Topic Models – Ivan Titov, Ryan McDonald - 2008