Why Initialization Matters for IBM Model 1: Multiple Optima and Non-Strict Convexity (2011)

by Kristina Toutanova, Michel Galley
Venue:In Proceedings of the ACL