Efficient Sampling and Feature Selection in Whole Sentence Maximum Entropy Language Models

by Stanley F. Chen , Ronald Rosenfeld
Citations:22 - 5 self

Documents Related by Co-Citation

1087 A Maximum Entropy approach to Natural Language Processing – Adam L. Berger, Stephen A. Della Pietra , Vincent J. Della Pietra - 1996
178 Adaptive Statistical Language Modeling: A Maximum Entropy Approach – Ronald Rosenfeld, Jaime Carbonell, Alexander Rudnicky - 1994
554 Inducing Features of Random Fields – Stephen Della Pietra, Vincent Della Pietra, John Lafferty - 1997
246 A Maximum Entropy Approach to Adaptive Statistical Language Modeling – Ronald Rosenfeld - 1996
28 A Whole Sentence Maximum Entropy Language Model – R. Rosenfeld - 1997
230 A Gaussian Prior for Smoothing Maximum Entropy Models – Stanley F. Chen, Ronald Rosenfeld - 1999
431 Generalized iterative scaling for log-linear models – J N Darroch, D Ratcliff - 1972
8 Linguistic features for whole sentence maximum entropy language models – X Zhu, S F Chen, R Rosenfeld - 1999
31 A multispan language modeling framework for large vocabulary speech recognition – Jerome R. Bellegarda, Senior Member - 1998
740 Statistical methods for speech recognition – F Jelinek - 1997
5 Interactive Feature Induction And Logistic Regression For Whole Sentence Exponential Language Models – Ronald Rosenfeld , Larry Wasserman, Can Cai, Xiaojin Zhu - 1999
5 The 1996 broadcast news speech and language model corpus – David Graff - 1997
672 Information theory and statistical mechanics – E T Jaynes - 1957
857 An Empirical Study of Smoothing Techniques for Language Modeling – Stanley F. Chen - 1998
702 Class-Based n-gram Models of Natural Language – Peter F. Brown, Peter V. deSouza, Robert L. Mercer, Vincent J. Della Pietra, Jenifer C. Lai - 1992
274 Improved backing-off for m-gram language modeling – Reinhard Kneser, Hermann Ney - 1995
95 Modeling Long Distance Dependence in Language: Topic Mixtures vs. Dynamic Cache Models – R. Iyer, M. Ostendorf - 1996
100 Improved Clustering Techniques for Class-based Statistical Language Modelling – R Kneser, H Ney
337 Interpolated estimation of Markov source parameters from sparse data – Frederick Jelinek, Robert L Mercer - 1980