|
81
|
A Neural Probabilistic Language Model
– Yoshua Bengio, Réjean Ducharme, Pascal Vincent, Christian Jauvin
- 2003
|
|
353
|
Training Products of Experts by Minimizing Contrastive Divergence
– Geoffrey Hinton
- 2000
|
|
154
|
Learning distributed representations of concepts
– G E Hinton
- 1986
|
|
449
|
SRILM—An extensible language modeling toolkit
– Andreas Stolcke
- 2002
|
|
16
|
Connectionist Language Modeling For Large Vocabulary Continuous Speech Recognition
– Holger Schwenk, Jean-luc Gauvain
- 2002
|
|
540
|
Class-Based n-gram Models of Natural Language
– Peter F. Brown, Peter V. deSouza, Robert L. Mercer, Vincent J. Della Pietra, Jenifer C. Lai
- 1992
|
|
1996
|
WordNet: An Electronic Lexical Database
– Christiane Fellbaum, editor
- 1998
|
|
2168
|
Indexing by latent semantic analysis
– Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman
- 1990
|
|
11
|
Hierarchical probabilistic neural network language model
– Frederic Morin, Yoshua Bengio
- 2005
|
|
18
|
Sequential Neural Text Compression
– Jürgen Schmidhuber, Stefan Heil
- 1996
|
|
77
|
Natural Language Processing with Modular PDP Networks and Distributed Lexicon
– Risto Miikkulainen, Michael G. Dyer
- 1991
|
|
28
|
Comparison Of Part-Of-Speech And Automatically Derived Category-Based Language Models For Speech Recognition
– T.R. Niesler, E. W. D. Whittaker, P.C. Woodland
- 1998
|
|
288
|
Interpolated estimation of Markov source parameters from sparse data
– F Jelinek, R L Mercer
- 1980
|
|
70
|
A Bit of Progress in Language Modeling
– Joshua T. Goodman
- 2001
|
|
1313
|
Finding structure in time
– Jeffrey L. Elman
- 1990
|
|
847
|
A Maximum Entropy approach to Natural Language Processing
– Adam L. Berger, Stephen A. Della Pietra , Vincent J. Della Pietra
- 1996
|
|
198
|
Distributional Clustering of Words for Text Classification
– L. Douglas Baker, Andrew Kachites Mccallum
- 1998
|
|
478
|
Distributional Clustering Of English Words
– Fernando Pereira, Naftali Tishby, Lillian Lee
- 1993
|
|
577
|
Estimation of probabilities from sparse data for the language model component of a speech recognizer
– Slava M. Katz
- 1987
|