Quick Training of Probabilistic Neural Nets by Importance Sampling (2003)

Cached

Download Links

by Yoshua Bengio , Jean-Sébastien Senécal
Citations:7 - 4 self

Documents Related by Co-Citation

81 A Neural Probabilistic Language Model – Yoshua Bengio, Réjean Ducharme, Pascal Vincent, Christian Jauvin - 2003
353 Training Products of Experts by Minimizing Contrastive Divergence – Geoffrey Hinton - 2000
154 Learning distributed representations of concepts – G E Hinton - 1986
449 SRILM—An extensible language modeling toolkit – Andreas Stolcke - 2002
16 Connectionist Language Modeling For Large Vocabulary Continuous Speech Recognition – Holger Schwenk, Jean-luc Gauvain - 2002
540 Class-Based n-gram Models of Natural Language – Peter F. Brown, Peter V. deSouza, Robert L. Mercer, Vincent J. Della Pietra, Jenifer C. Lai - 1992
1996 WordNet: An Electronic Lexical Database – Christiane Fellbaum, editor - 1998
2168 Indexing by latent semantic analysis – Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman - 1990
11 Hierarchical probabilistic neural network language model – Frederic Morin, Yoshua Bengio - 2005
18 Sequential Neural Text Compression – Jürgen Schmidhuber, Stefan Heil - 1996
77 Natural Language Processing with Modular PDP Networks and Distributed Lexicon – Risto Miikkulainen, Michael G. Dyer - 1991
28 Comparison Of Part-Of-Speech And Automatically Derived Category-Based Language Models For Speech Recognition – T.R. Niesler, E. W. D. Whittaker, P.C. Woodland - 1998
288 Interpolated estimation of Markov source parameters from sparse data – F Jelinek, R L Mercer - 1980
70 A Bit of Progress in Language Modeling – Joshua T. Goodman - 2001
1313 Finding structure in time – Jeffrey L. Elman - 1990
847 A Maximum Entropy approach to Natural Language Processing – Adam L. Berger, Stephen A. Della Pietra , Vincent J. Della Pietra - 1996
198 Distributional Clustering of Words for Text Classification – L. Douglas Baker, Andrew Kachites Mccallum - 1998
478 Distributional Clustering Of English Words – Fernando Pereira, Naftali Tishby, Lillian Lee - 1993
577 Estimation of probabilities from sparse data for the language model component of a speech recognizer – Slava M. Katz - 1987