• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Why initialization matters for ibm model 1: Multiple optima and nonstrict convexity (2011)

by K Toutanova, M Galley
Venue:In Proc. of ACL
Add To MetaCart

Tools

Sorted by:
Results 1 - 1 of 1

Concavity and Initialization for Unsupervised Dependency Grammar Induction

by Kevin Gimpel, Noah A. Smith
"... We examine models for unsupervised learning with concave log-likelihood functions. We begin with the most well-known example, IBM Model 1 for word alignment (Brown et al., 1993), and study its properties, discussing why other models for unsupervised learning are so seldom concave. We then present co ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
We examine models for unsupervised learning with concave log-likelihood functions. We begin with the most well-known example, IBM Model 1 for word alignment (Brown et al., 1993), and study its properties, discussing why other models for unsupervised learning are so seldom concave. We then present concave models for dependency grammar induction and validate them experimentally. Despite their simplicity, we find that initializing the dependency model with valence using our concave models can approach state of the art grammar induction results for English and Chinese. 1
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University