• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

On Bayesian model assessment and choice using cross-validation predictive densities (0)

by A Vehtari, J Lampinen
Add To MetaCart

Tools

Sorted by:
Results 1 - 4 of 4

Bayesian Model Assessment and Comparison Using Cross-Validation Predictive Densities

by Aki Vehtari, Jouko Lampinen - Neural Computation , 2002
"... In this work, we discuss practical methods for the assessment, comparison, and selection of complex hierarchical Bayesian models. A natural way to assess the goodness of the model is to estimate its future predictive capability by estimating expected utilities. Instead of just making a point estimat ..."
Abstract - Cited by 21 (9 self) - Add to MetaCart
In this work, we discuss practical methods for the assessment, comparison, and selection of complex hierarchical Bayesian models. A natural way to assess the goodness of the model is to estimate its future predictive capability by estimating expected utilities. Instead of just making a point estimate, it is important to obtain the distribution of the expected utility estimate, as it describes the uncertainty in the estimate. The distributions of the expected utility estimates can also be used to compare models, for example, by computing the probability of one model having a better expected utility than some other model. We propose an approach using crossvalidation predictive densities to obtain expected utility estimates and Bayesian bootstrap to obtain samples from their distributions. We also discuss the probabilistic assumptions made and properties of two practical cross-validation methods, importance sampling and k-fold cross-validation. As illustrative examples, we use MLP neural networks and Gaussian Processes (GP) with Markov chain Monte Carlo sampling in one toy problem and two challenging real-world problems.

Bayesian Input Variable Selection Using Posterior Probabilities and Expected Utilities

by Aki Vehtari, Jouko Lampinen , 2002
"... We consider the input variable selection in complex Bayesian hierarchical models. Our goal is to find a model with the smallest number of input variables having statistically or practically at least the same expected utility as the full model with all the available inputs. A good estimate for the ..."
Abstract - Cited by 5 (1 self) - Add to MetaCart
We consider the input variable selection in complex Bayesian hierarchical models. Our goal is to find a model with the smallest number of input variables having statistically or practically at least the same expected utility as the full model with all the available inputs. A good estimate for the expected utility can be computed using cross-validation predictive densities. In the case of input selection and a large number of input combinations, the computation of the cross-validation predictive densities for each model easily becomes computationally prohibitive. We propose to use the posterior probabilities obtained via variable dimension MCMC methods to find out potentially useful input combinations, for which the final model choice and assessment is done using the expected utilities.

Model Selection via Predictive Explanatory Power 20

by Aki Vehtari, Jouko Lampinen - Helsinki University of Technology, Laboratory of Computational Engineering , 1998
"... We consider model selection as a decision problem from a predictive perspective. The optimal Bayesian way of handling model uncertainty is to integrate over model space. Model selection can then be seen as point estimation in the model space. We propose a model selection method based on Kullback-Lei ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
We consider model selection as a decision problem from a predictive perspective. The optimal Bayesian way of handling model uncertainty is to integrate over model space. Model selection can then be seen as point estimation in the model space. We propose a model selection method based on Kullback-Leibler divergence from the predictive distribution of the full model to the predictive distributions of the submodels. The loss of predictive explanatory power is defined as the expectation of this predictive discrepancy. The goal is to find the simplest submodel which has a similar predictive distribution as the full model, that is, the simplest submodel whose loss of explanatory power is acceptable. To compute the expected predictive discrepancy between complex models, for which analytical solutions do not exist, we propose to use predictive distributions obtained via k-fold cross-validation. We compare the performance of the method to posterior probabilities (Bayes factors), deviance information criteria (DIC) and direct maximization of the expected utility via crossvalidation.

Expected Utility Estimation via Cross-Validation

by Aki Vehtari, Jouko Lampinen, J. M. Bernardo, M. J. Bayarri, J. O. Berger, A. P. Dawid, D. Heckerman, A. F. M. Smith, M. West (eds, Via Cross-validation , 2003
"... SS-VALIDATION; MODEL ASSESSMENT; MODEL COMPARISON; PREDICTIVE DENSITIES; INFORMATION CRITERIA. 1. INTRODUCTION 1.1. Expected Utilities In prediction and decision problems, it is natural to assess the predictive ability of the model by estimating the expected utilities, that is, the relative valu ..."
Abstract - Add to MetaCart
SS-VALIDATION; MODEL ASSESSMENT; MODEL COMPARISON; PREDICTIVE DENSITIES; INFORMATION CRITERIA. 1. INTRODUCTION 1.1. Expected Utilities In prediction and decision problems, it is natural to assess the predictive ability of the model by estimating the expected utilities, that is, the relative values of consequences of using the model (Good, 1952; Bernardo and Smith, 1994). The posterior predictive distribution of output y for the new input x given the training data D = ); i = 1, 2, . . . , n} is obtained by , D,M) = Z , #, D,M)p(# , D,M)d#. where # denotes all the model parameters and hyperparameters of the prior structures and M is all the prior knowledge in the model specification. We assume that knowing does not give more information about #, that is, p(# , D,M) = p(# D,M ). We would like to estimate how good our model is by estimating the quality of the predictions the model makes for future observations from the same process that generate
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University