Greedy layer-wise training of deep networks (2007)

Cached

Download Links

by Yoshua Bengio , Pascal Lamblin , Dan Popovici , Hugo Larochelle , Université De Montréal , Montréal Québec
Venue:In NIPS
Citations:105 - 18 self

Documents Related by Co-Citation

241 A fast learning algorithm for deep belief nets – Geoffrey E. Hinton, Simon Osindero - 2006
204 Reducing the dimensionality of data with neural networks – G Hinton, R Salakhutdinov
353 Training Products of Experts by Minimizing Contrastive Divergence – Geoffrey Hinton - 2000
71 Efficient learning of sparse representations with an energy-based model – Marc'Aurelio Ranzato, Christopher Poultney, Sumit Chopra, Yann Lecun - 2006
487 Gradient-based learning applied to document recognition – Yann Lecun, Léon Bottou, Yoshua Bengio, Patrick Haffner - 1998
45 An empirical evaluation of deep architectures on problems with many factors of variation – Hugo Larochelle, Dumitru Erhan, Aaron Courville, James Bergstra, Yoshua Bengio - 2007
427 Sparse coding with an overcomplete basis set: a strategy employed by V1 – Bruno A. Olshausen, David J. Fieldt - 1997
495 Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories – Cordelia Schmid
111 Self-taught learning: Transfer learning from unlabeled data – Rajat Raina, Alexis Battle, Honglak Lee, Benjamin Packer, Andrew Y. Ng - 2007
42 Scaling learning algorithms towards ai – Y Bengio, Y LeCun
683 Emergence of simple-cell receptive field properties by learning a sparse code for natural images – B A Olshausen, D J Field - 1996
43 Sparse Feature Learning for Deep Belief Networks – Y-lan Boureau, Yann Lecun
43 Sparse deep belief net model for visual area V2 – Chaitanya Ekanadham - 2008
93 Convolutional Deep Belief Networks for Scalable Unsupervised Learning of Hierarchical Representations – Honglak Lee, Roger Grosse, Rajesh Ranganath, Andrew Y. Ng
108 Efficient sparse coding algorithms – Honglak Lee, Alexis Battle, Rajat Raina, Andrew Y. Ng - 2007
17 The curse of highly variable functions for local kernel machines – Yoshua Bengio, Olivier Delalleau, Nicolas Le Roux - 2006
51 Modeling human motion using binary latent variables – Graham W. Taylor, Geoffrey E. Hinton, Sam Roweis - 2006
35 A hierarchical Bayesian model for learning nonlinear statistical regularities in natural signals – Y Karklin, M S Lewicki - 2005
54 On Contrastive Divergence Learning – Miguel A. Carreira-Perpinan, Geoffrey E. Hinton