#### DMCA

## Learning Common Grammar from Multilingual Corpus

### Cached

### Download Links

Citations: | 2 - 1 self |

### Citations

2692 |
Aspects of the theory of syntax.
- Chomsky
- 1965
(Show Context)
Citation Context ...eanings in different languages. The reasons for these common properties can be attributed to: 1) a common ancestor language, 2) borrowing from nearby languages, and 3) the innate abilities of humans (=-=Chomsky, 1965-=-). We assume hidden commonalities in syntax across languages, and try to extract a common grammar from non-parallel multilingual corpora. For this purpose, we propose a generative model for multilingu... |

571 | Stochastic inversion transduction grammars and bilingual parsing of parallel corpora.
- Wu
- 1997
(Show Context)
Citation Context ...icity. The proposed framework can be used for probabilistic grammar models other than PCFG. Grammar induction using bilingual parallel corpora has been studied mainly in machine translation research (=-=Wu, 1997-=-; Melamed, 2003; Eisner, 2003; Chiang, 2005; Blunsom et al., 2009; Snyder et al., 2009). These methods require sentencealigned parallel data, which can be costly to obtain and difficult to scale to ma... |

429 |
The estimation of stochastic context-free grammars using the insideoutside algorithm.
- Lari, Young
- 1990
(Show Context)
Citation Context ...ang et al., 2007) while the updating of common grammar parameters by (9) and (10) is new. The inference can be carried out efficiently using the inside-outside algorithm based on dynamic programming (=-=Lari and Young, 1990-=-). After the inference, the probability of a common grammar rule A → BC is calculated by ˆφA→BC = ˆ θ1 ˆ φABC, where ˆ θ1 = α θ 1 /(αθ 0 + αθ 1 ) and ˆ φABC = α φ / ∑ represent ABC B ′,C′ αφ AB ′ C ′ ... |

370 |
Origins of the coalescent:
- Kingman
- 2000
(Show Context)
Citation Context ...l., 2007), and use probabilistic grammar models other than PCFGs. In our model, all the multilingual grammars are generated from a general model. We can extend it hierarchically using the coalescent (=-=Kingman, 1982-=-). That model may help to infer an evolutionary tree of languages in terms of grammatical structure without the etymological information that is generally used (Gray and Atkinson, 2003). Finally, the ... |

346 |
The language instinct: How the mind creates language.
- Pinker
- 1994
(Show Context)
Citation Context ...fficient inference. Experiments on a non-parallel multilingual corpus of eleven languages demonstrate the feasibility of the proposed method. 1 Introduction Languages share certain common properties (=-=Pinker, 1994-=-). For example, the word order in most European languages is subject-verb-object (SVO), and some words with similar forms are used with similar meanings in different languages. The reasons for these c... |

229 | Corpusbased induction of syntactic structure: Models of dependency and constituency.
- Klein, Manning
- 2004
(Show Context)
Citation Context ... distribution (Yu et al., 2005). 2 Related work The unsupervised grammar induction task has been extensively studied (Carroll and Charniak, 1992; Stolcke and Omohundro, 1994; Klein and Manning, 2002; =-=Klein and Manning, 2004-=-; Liang et al., 2007). Recently, models have been proposed that outperform PCFG in the grammar induction task (Klein and Manning, 2002; Klein and Manning, 2004). We used PCFG as a first step for captu... |

156 | Inducing probabilistic grammars by Bayesian model merging.
- Stolcke, Omohundro
- 1994
(Show Context)
Citation Context ...hat the model parameters are drawn from a common prior distribution (Yu et al., 2005). 2 Related work The unsupervised grammar induction task has been extensively studied (Carroll and Charniak, 1992; =-=Stolcke and Omohundro, 1994-=-; Klein and Manning, 2002; Klein and Manning, 2004; Liang et al., 2007). Recently, models have been proposed that outperform PCFG in the grammar induction task (Klein and Manning, 2002; Klein and Mann... |

131 | Language-tree divergence times support the Anatolian theory of Indo-European origin. - Gray, Atkinson - 2003 |

118 | The infinite PCFG using hierarchical Dirichlet processes.
- Liang, Petrov, et al.
- 2007
(Show Context)
Citation Context ... 2005). 2 Related work The unsupervised grammar induction task has been extensively studied (Carroll and Charniak, 1992; Stolcke and Omohundro, 1994; Klein and Manning, 2002; Klein and Manning, 2004; =-=Liang et al., 2007-=-). Recently, models have been proposed that outperform PCFG in the grammar induction task (Klein and Manning, 2002; Klein and Manning, 2004). We used PCFG as a first step for capturing commonalities i... |

110 | Two experiments on learning probabilistic dependency grammars from corpora.
- Carroll, Charniak
- 1992
(Show Context)
Citation Context ...ividual models by assuming that the model parameters are drawn from a common prior distribution (Yu et al., 2005). 2 Related work The unsupervised grammar induction task has been extensively studied (=-=Carroll and Charniak, 1992-=-; Stolcke and Omohundro, 1994; Klein and Manning, 2002; Klein and Manning, 2004; Liang et al., 2007). Recently, models have been proposed that outperform PCFG in the grammar induction task (Klein and ... |

34 | Bayesian synchronous grammar induction.
- Blunsom, Cohn, et al.
- 2008
(Show Context)
Citation Context ...listic grammar models other than PCFG. Grammar induction using bilingual parallel corpora has been studied mainly in machine translation research (Wu, 1997; Melamed, 2003; Eisner, 2003; Chiang, 2005; =-=Blunsom et al., 2009-=-; Snyder et al., 2009). These methods require sentencealigned parallel data, which can be costly to obtain and difficult to scale to many languages. On the other hand, our model does not require sente... |

30 | inference for PCFGs via Markov chain Monte Carlo - Bayesian |

27 | An application of the variational bayesian approach to probabilistic contextfree grammars.
- Kurihara, Sato
- 2004
(Show Context)
Citation Context ...ptimal approximated posterior can be obtained by updating parameters by (2) - (10) alternatively until convergence. The updating of language dependent distributions by (2) - (8) is also described in (=-=Kurihara and Sato, 2004-=-; Liang et al., 2007) while the updating of common grammar parameters by (9) and (10) is new. The inference can be carried out efficiently using the inside-outside algorithm based on dynamic programmi... |

11 | A probabilistic approach to language change - Bouchard-Côté, Liang, et al. - 2008 |

8 | multilingual grammar induction - Unsupervised |