|
577
|
Estimation of probabilities from sparse data for the language model component of a speech recognizer
– Slava M. Katz
- 1987
|
|
288
|
Interpolated estimation of Markov source parameters from sparse data
– F Jelinek, R L Mercer
- 1980
|
|
149
|
On structuring probabilistic dependencies in stochastic language modeling. Computer Speech and Language
– H Ney, U Essen, R Kneser
- 1994
|
|
286
|
The population frequencies of species and the estimation of population parameters
– I J Good
- 1953
|
|
540
|
Class-Based n-gram Models of Natural Language
– Peter F. Brown, Peter V. deSouza, Robert L. Mercer, Vincent J. Della Pietra, Jenifer C. Lai
- 1992
|
|
179
|
Improved backing-off for m-gram language modeling
– Kneser, Hermann Ney
- 1995
|
|
67
|
A Hierarchical Dirichlet Language Model
– David J.C. MacKay, Linda C. Bauman Peto
- 1994
|
|
6234
|
Maximum likelihood from incomplete data via the EM algorithm
– A. P. Dempster, N. M. Laird, D. B. Rubin
- 1977
|
|
891
|
The Mathematics of Statistical Machine Translation: Parameter Estimation
– Peter F. Brown, Vincent J.Della Pietra, Stephen A. Della Pietra, Robert. L. Mercer
- 1993
|
|
33
|
Estimation of probabilities in the language model of the IBM speech recognition system
– Arthur Nadas
- 1984
|
|
478
|
Distributional Clustering Of English Words
– Fernando Pereira, Naftali Tishby, Lillian Lee
- 1993
|
|
355
|
Generalized Iterative Scaling for Log-Linear Models
– J N Darroch, D Ratcliff
- 1972
|
|
63
|
A Spelling Correction Program Based on a Noisy Channel Model
– Mark D Kernighan, Kenneth W Church, William A Gale
- 1990
|
|
502
|
A statistical approach to machine translation
– Peter F. Brown, John Cocke, Stephen A. Della Pietra, Vincent J. Della Pietra, Fredrick Jelinek, John D. Lafferty, Robert L. Mercer, Paul S. Roossin
- 1990
|
|
422
|
An inequality and associated maximization technique in statistical estimations of probabilistic functions of Markov processes. Inequalities
– L E Baum
- 1972
|
|
125
|
Dimensions of Meaning
– Hinrich Schütze
- 1992
|
|
119
|
A comparison of the enhanced Good-Turing and deleted estimation methods for estimating probabilities of English bigrams. Computer Speech and Language
– Kenneth W Church, William A Gale
- 1991
|
|
465
|
Inducing Features of Random Fields
– Stephen Della Pietra, Vincent Della Pietra, John Lafferty
- 1997
|
|
10
|
Lattice Based Language Models
– Pierre Dupont, Ronald Rosenfeld
- 1997
|