An Empirical Study of Smoothing Techniques for Language Modeling (1998)

by Stanley F. Chen , Stanley F. Chen , Joshua Goodman , Joshua Goodman
Citations:631 - 19 self

Documents Related by Co-Citation

577 Estimation of probabilities from sparse data for the language model component of a speech recognizer – Slava M. Katz - 1987
288 Interpolated estimation of Markov source parameters from sparse data – F Jelinek, R L Mercer - 1980
149 On structuring probabilistic dependencies in stochastic language modeling. Computer Speech and Language – H Ney, U Essen, R Kneser - 1994
286 The population frequencies of species and the estimation of population parameters – I J Good - 1953
540 Class-Based n-gram Models of Natural Language – Peter F. Brown, Peter V. deSouza, Robert L. Mercer, Vincent J. Della Pietra, Jenifer C. Lai - 1992
179 Improved backing-off for m-gram language modeling – Kneser, Hermann Ney - 1995
67 A Hierarchical Dirichlet Language Model – David J.C. MacKay, Linda C. Bauman Peto - 1994
6234 Maximum likelihood from incomplete data via the EM algorithm – A. P. Dempster, N. M. Laird, D. B. Rubin - 1977
891 The Mathematics of Statistical Machine Translation: Parameter Estimation – Peter F. Brown, Vincent J.Della Pietra, Stephen A. Della Pietra, Robert. L. Mercer - 1993
33 Estimation of probabilities in the language model of the IBM speech recognition system – Arthur Nadas - 1984
478 Distributional Clustering Of English Words – Fernando Pereira, Naftali Tishby, Lillian Lee - 1993
355 Generalized Iterative Scaling for Log-Linear Models – J N Darroch, D Ratcliff - 1972
63 A Spelling Correction Program Based on a Noisy Channel Model – Mark D Kernighan, Kenneth W Church, William A Gale - 1990
502 A statistical approach to machine translation – Peter F. Brown, John Cocke, Stephen A. Della Pietra, Vincent J. Della Pietra, Fredrick Jelinek, John D. Lafferty, Robert L. Mercer, Paul S. Roossin - 1990
422 An inequality and associated maximization technique in statistical estimations of probabilistic functions of Markov processes. Inequalities – L E Baum - 1972
125 Dimensions of Meaning – Hinrich Schütze - 1992
119 A comparison of the enhanced Good-Turing and deleted estimation methods for estimating probabilities of English bigrams. Computer Speech and Language – Kenneth W Church, William A Gale - 1991
465 Inducing Features of Random Fields – Stephen Della Pietra, Vincent Della Pietra, John Lafferty - 1997
10 Lattice Based Language Models – Pierre Dupont, Ronald Rosenfeld - 1997