## Smoothing Methods In Maximum Entropy Language Modeling (1999)

Venue: | In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, volume I |

Citations: | 7 - 1 self |

### BibTeX

@INPROCEEDINGS{Martin99smoothingmethods,

author = {S. C. Martin and H. Ney and J. Zaplo},

title = {Smoothing Methods In Maximum Entropy Language Modeling},

booktitle = {In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, volume I},

year = {1999},

pages = {545--548}

}

### Years of Citing Articles

### Abstract

This paper discusses various aspects of smoothing techniques in maximum entropy language modeling, a topic not sufficiently covered by previous publications. We show (1) that straightforward maximum entropy models with nested features, e.g. tri-, bi-, and unigrams, result in unsmoothed relative frequencies models; (2) that maximum entropy models with nested features and discounted feature counts approximate backing-off smoothed relative frequencies models with Kneser's advanced marginal back-off distribution; this explains some of the reported success of maximum entropy models in the past; (3) perplexity results for nested and non-nested features, e.g. trigrams and distance-trigrams, on a 4-million word subset of the Wall Street Journal Corpus, showing that the smoothing method has more effect on the perplexity than the method to combine information.

