Results 1 - 10
of
10
Strictly Proper Scoring Rules, Prediction, and Estimation
, 2007
"... Scoring rules assess the quality of probabilistic forecasts, by assigning a numerical score based on the predictive distribution and on the event or value that materializes. A scoring rule is proper if the forecaster maximizes the expected score for an observation drawn from the distribution F if he ..."
Abstract
-
Cited by 86 (13 self)
- Add to MetaCart
Scoring rules assess the quality of probabilistic forecasts, by assigning a numerical score based on the predictive distribution and on the event or value that materializes. A scoring rule is proper if the forecaster maximizes the expected score for an observation drawn from the distribution F if he or she issues the probabilistic forecast F, rather than G ̸ = F. It is strictly proper if the maximum is unique. In prediction problems, proper scoring rules encourage the forecaster to make careful assessments and to be honest. In estimation problems, strictly proper scoring rules provide attractive loss and utility functions that can be tailored to the problem at hand. This article reviews and develops the theory of proper scoring rules on general probability spaces, and proposes and discusses examples thereof. Proper scoring rules derive from convex functions and relate to information measures, entropy functions, and Bregman divergences. In the case of categorical variables, we prove a rigorous version of the Savage representation. Examples of scoring rules for probabilistic forecasts in the form of predictive densities include the logarithmic, spherical, pseudospherical, and quadratic scores. The continuous ranked probability score applies to probabilistic forecasts that take the form of predictive cumulative distribution functions. It generalizes the absolute error and forms a special case of a new and very general type of score, the energy score. Like many other scoring rules, the energy score admits a kernel representation in terms of negative definite functions, with links to inequalities of Hoeffding type, in both univariate and multivariate settings. Proper scoring rules for quantile and interval forecasts are also discussed. We relate proper scoring rules to Bayes factors and to cross-validation, and propose a novel form of cross-validation known as random-fold cross-validation. A case study on probabilistic weather forecasts in the North American Pacific Northwest illustrates the importance of propriety. We note optimum score approaches to point and quantile
A new understanding of prediction markets via no-regret learning
- In ACM EC
, 2010
"... We explore the striking mathematical connections that exist between market scoring rules, cost function based prediction markets, and no-regret learning. We first show that any cost function based prediction market can be interpreted as an algorithm for the commonly studied problem of learning from ..."
Abstract
-
Cited by 16 (5 self)
- Add to MetaCart
We explore the striking mathematical connections that exist between market scoring rules, cost function based prediction markets, and no-regret learning. We first show that any cost function based prediction market can be interpreted as an algorithm for the commonly studied problem of learning from expert advice by equating the set of outcomes on which bets are placed in the market with the set of experts in the learning setting, and equating trades made in the market with losses observed by the learning algorithm. If the loss of the market organizer is bounded, this bound can be used to derive an O ( √ T) regret bound for the corresponding learning algorithm. We then show that the class of markets with convex cost functions exactly corresponds to the class of Follow the Regularized Leader learning algorithms, with the choice of a cost function in the market corresponding to the choice of a regularizer in the learning problem. Finally, we show an equivalence between market scoring rules and prediction markets with convex cost functions. This implies both that any market scoring rule can be implemented as a cost function based market maker, and that market scoring rules can be interpreted naturally as Follow the Regularized Leader algorithms. These connections provide new insight into how it is that commonly studied markets, such as the Logarithmic Market Scoring Rule, can aggregate opinions into accurate estimates of the likelihood of future events.
Subjective Bayesian Analysis: Principle and practice
- BAYESIAN ANALYSIS
, 2006
"... We address the position of subjectivism within Bayesian statistics. We argue, first, that the subjectivist Bayes approach is the only feasible method for tackling many important practical problems. Second, we describe the essential role of the subjectivist approach in scientific analysis. Third, we ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
We address the position of subjectivism within Bayesian statistics. We argue, first, that the subjectivist Bayes approach is the only feasible method for tackling many important practical problems. Second, we describe the essential role of the subjectivist approach in scientific analysis. Third, we consider possible modifications to the Bayesian approach from a subjectivist viewpoint. Finally, we address the issue of pragmatism in implementing the subjectivist approach.
Collective revelation: A mechanism for self-verified, weighted, and truthful predictions
- In: Proc. 10th ACM Conf. on Electronic Commerce
, 2009
"... Decision makers can benefit from the subjective judgment of experts. For example, estimates of disease prevalence are quite valuable, yet can be difficult to measure objectively. Useful features of mechanisms for aggregating expert opinions include the ability to: (1) incentivize participants to be ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
Decision makers can benefit from the subjective judgment of experts. For example, estimates of disease prevalence are quite valuable, yet can be difficult to measure objectively. Useful features of mechanisms for aggregating expert opinions include the ability to: (1) incentivize participants to be truthful; (2) adjust for the fact that some experts are better informed than others; and (3) circumvent the need for objective, “ground truth ” observations. Subsets of these properties are attainable by previous elicitation methods, including proper scoring rules, prediction markets, and the Bayesian truth serum. Our mechanism of collective revelation, however, is the first to simultaneously achieve all three. Furthermore, we introduce a general technique for constructing budget-balanced mechanisms—where no net payments are made to participants—that applies both to collective revelation and to past peer-prediction methods.
Eliciting Expert Judgments: Literature Review
"... Expert judgements are routine in biosecurity risk analysis. This report reviews methods for eliciting probabilities, quantities, and conceptual models. It defines ‘expert’ status, reviews the literature on biases and heuristics in expert judgements and outlines methods for detecting and eliciting va ..."
Abstract
- Add to MetaCart
Expert judgements are routine in biosecurity risk analysis. This report reviews methods for eliciting probabilities, quantities, and conceptual models. It defines ‘expert’ status, reviews the literature on biases and heuristics in expert judgements and outlines methods for detecting and eliciting values, attitudes and motivations. The report describes direct and indirect techniques for eliciting point estimates and uncertainties for quantities, frequencies and probabilities, and for eliciting the structure of conceptual models. It evaluates the use of language-based risk categories and describes methods to detect and adjust for bias and variability in expert judgements. Feedback and training are likely to make useful additions to elicitation protocols. Few of the formal techniques for elicitation, calibration or verification have been evaluated in conditions typical of biosecurity risk analysis, creating an opportunity to test
Default Estimation and Expert Information
, 2008
"... Default is a rare event, even in segments in the midrange of a bank’s portfolio. Inference about default rates is essential for risk management and for compliance with the requirements of Basel II. Most commercial loans are in the middle-risk categories and are to unrated companies. Expert informati ..."
Abstract
- Add to MetaCart
Default is a rare event, even in segments in the midrange of a bank’s portfolio. Inference about default rates is essential for risk management and for compliance with the requirements of Basel II. Most commercial loans are in the middle-risk categories and are to unrated companies. Expert information is crucial in inference about defaults. A Bayesian approach is proposed and illustrated using a prior distribution assessed from an industry expert. The binomial model, most common in applications, is extended to allow correlated defaults. A check of robustness is illustrated with an ɛ − mixture of priors.
Elicitation of Multivariate Prior Distributions: A nonparametric Bayesian approach
"... In the context of Bayesian statistical analysis, elicitation is the process of formulating a prior density f(·) about one or more uncertain quantities to represent a person’s knowledge and beliefs. Several different methods of eliciting prior distributions for one unknown parameter have been propose ..."
Abstract
- Add to MetaCart
In the context of Bayesian statistical analysis, elicitation is the process of formulating a prior density f(·) about one or more uncertain quantities to represent a person’s knowledge and beliefs. Several different methods of eliciting prior distributions for one unknown parameter have been proposed. However, there are relatively few methods for specifying a multivariate prior distribution and most are just applicable to specific classes of problems and/or based on restrictive conditions, such as independence of variables. Besides, many of these procedures require the elicitation of variances and correlations, and sometimes elicitation of hyperparameters which are difficult for experts to specify in practice. Garthwaite, Kadane and O’Hagan (2005) discuss the different methods proposed in the literature and the difficulties of eliciting multivariate prior distributions. We describe a flexible method of eliciting multivariate prior distributions applicable to a wide class of practical problems. Our approach does not assume a parametric form for the unknown prior density f(·), instead we use nonparametric Bayesian inference, modelling f(·) by a Gaussian process prior distribution. The expert is then asked to specify certain summaries of his/her distribution, such as the mean, mode, marginal quantiles and a small number of joint probabilities. The analyst receives that information, treating it as a data set D with which to update his/her prior beliefs to obtain the posterior distribution for f(·). Theoretical properties of joint and marginal priors are derived and numerical illustrations to demonstrate our approach are given.
Elicitation for Bayesian Phylogeny
, 2007
"... Phylogeny provides important information for evolutionary relationships and is useful in applications. Bayesian phylogenetic inference developed quickly because of its computational feasibility. However, the vague priors usually used in Bayesian phylogenetics do not capture knowledge available from ..."
Abstract
- Add to MetaCart
Phylogeny provides important information for evolutionary relationships and is useful in applications. Bayesian phylogenetic inference developed quickly because of its computational feasibility. However, the vague priors usually used in Bayesian phylogenetics do not capture knowledge available from non-genetic sources such as studies of physiology and development. This thesis gives a practical way to elicit a biologist’s 1.1 Phylogeny A phylogeny, sometimes called evolutionary tree, “is a branching tree diagram showing the course of evolution in a group of organisms ” (Felsenstein, 1983, p. 246). It studies the species relationships by analyzing and finding the time the species split from their most
Characterization of Proper and Strictly Proper Scoring Rules for
, 2012
"... We give necessary and sufficient conditions for a scoring rule to be proper (or strictly proper) for a quantile if utility is linear, and the distribution is unrestricted. We also give results when the set of distributions is limited, for example, to distributions that have first moments. Keywords: ..."
Abstract
- Add to MetaCart
We give necessary and sufficient conditions for a scoring rule to be proper (or strictly proper) for a quantile if utility is linear, and the distribution is unrestricted. We also give results when the set of distributions is limited, for example, to distributions that have first moments. Keywords: Elicitation.

