Quick Training of Probabilistic Neural Nets by Importance Sampling (2003)

by Yoshua Bengio , Jean-Sébastien Senécal
Citations:11 - 5 self

Documents Related by Co-Citation

145 A Neural Probabilistic Language Model – Yoshua Bengio, Réjean Ducharme, Pascal Vincent, Christian Jauvin - 2003
33 Hierarchical probabilistic neural network language model – Frederic Morin, Yoshua Bengio - 2005
509 Training Products of Experts by Minimizing Contrastive Divergence – Geoffrey Hinton - 2000
755 SRILM -- An extensible language modeling toolkit – Andreas Stolcke - 2002
27 Connectionist Language Modeling For Large Vocabulary Continuous Speech Recognition – Holger Schwenk, Jean-luc Gauvain - 2002
172 Learning distributed representations of concepts – G Hinton - 1986
43 Three New Graphical Models for Statistical Language Modelling – Andriy Mnih, Geoffrey Hinton
113 A unified architecture for natural language processing: Deep neural networks with multitask learning – Ronan Collobert, Jason Weston - 2008
44 A scalable hierarchical distributed language model – Andriy Mnih, Geoffrey Hinton - 2008
698 Class-Based n-gram Models of Natural Language – Peter F. Brown, Peter V. deSouza, Robert L. Mercer, Vincent J. Della Pietra, Jenifer C. Lai - 1992
1082 A Maximum Entropy approach to Natural Language Processing – Adam L. Berger, Stephen A. Della Pietra , Vincent J. Della Pietra - 1996
2663 WordNet: An Electronic Lexical Database – Christiane Fellbaum, ed - 1998
2703 Indexing by latent semantic analysis – Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, Richard Harshman - 1990
549 Distributional Clustering Of English Words – Fernando Pereira, Naftali Tishby, Lillian Lee - 1993
55 Word representations: A simple and general method for semisupervised learning – Joseph Turian, Département D’informatique Et, Recherche Opérationnelle (diro, Université De Montréal, Lev Ratinov, Yoshua Bengio - 2010
5 Adaptive Importance Sampling to Accelerate Training of a Neural Probabilistic Language Model – Jean-sébastien Senécal, Jean-sébastien Senécal, Yoshua Bengio - 2003
21 Sequential Neural Text Compression – Jürgen Schmidhuber, Stefan Heil - 1996
30 Comparison Of Part-Of-Speech And Automatically Derived Category-Based Language Models For Speech Recognition – T.R. Niesler, E. W. D. Whittaker, P.C. Woodland - 1998
83 Natural Language Processing with Modular PDP Networks and Distributed Lexicon – Risto Miikkulainen, Michael G. Dyer - 1991