Variational learning and bits-back coding: an information-theoretic view to Bayesian learning
| Venue: | IEEE Transactions on Neural Networks |
| Citations: | 16 - 7 self |
BibTeX
@ARTICLE{Honkela_variationallearning,
author = {Antti Honkela and Harri Valpola},
title = {Variational learning and bits-back coding: an information-theoretic view to Bayesian learning},
journal = {IEEE Transactions on Neural Networks},
year = {},
pages = {2004}
}
Years of Citing Articles
OpenURL
Abstract
Abstract—The bits-back coding first introduced by Wallace in 1990 and later by Hinton and van Camp in 1993 provides an interesting link between Bayesian learning and information-theoretic minimum-description-length (MDL) learning approaches. The bits-back coding allows interpreting the cost function used in the variational Bayesian method called ensemble learning as a code length in addition to the Bayesian view of misfit of the posterior approximation and a lower bound of model evidence. Combining these two viewpoints provides interesting insights to the learning process and the functions of different parts of the model. In this paper, the problem of variational Bayesian learning of hierarchical latent variable models is used to demonstrate the benefits of the two views. The code-length interpretation provides new views to many parts of the problem such as model comparison and pruning and helps explain many phenomena occurring in learning. Index Terms—Bits-back coding, ensemble learning, hierarchical latent variable models, minimum description length, variational Bayesian learning. I.







