Results 1 - 10
of
16
Strictly Proper Scoring Rules, Prediction, and Estimation
, 2007
"... Scoring rules assess the quality of probabilistic forecasts, by assigning a numerical score based on the predictive distribution and on the event or value that materializes. A scoring rule is proper if the forecaster maximizes the expected score for an observation drawn from the distribution F if he ..."
Abstract
-
Cited by 86 (13 self)
- Add to MetaCart
Scoring rules assess the quality of probabilistic forecasts, by assigning a numerical score based on the predictive distribution and on the event or value that materializes. A scoring rule is proper if the forecaster maximizes the expected score for an observation drawn from the distribution F if he or she issues the probabilistic forecast F, rather than G ̸ = F. It is strictly proper if the maximum is unique. In prediction problems, proper scoring rules encourage the forecaster to make careful assessments and to be honest. In estimation problems, strictly proper scoring rules provide attractive loss and utility functions that can be tailored to the problem at hand. This article reviews and develops the theory of proper scoring rules on general probability spaces, and proposes and discusses examples thereof. Proper scoring rules derive from convex functions and relate to information measures, entropy functions, and Bregman divergences. In the case of categorical variables, we prove a rigorous version of the Savage representation. Examples of scoring rules for probabilistic forecasts in the form of predictive densities include the logarithmic, spherical, pseudospherical, and quadratic scores. The continuous ranked probability score applies to probabilistic forecasts that take the form of predictive cumulative distribution functions. It generalizes the absolute error and forms a special case of a new and very general type of score, the energy score. Like many other scoring rules, the energy score admits a kernel representation in terms of negative definite functions, with links to inequalities of Hoeffding type, in both univariate and multivariate settings. Proper scoring rules for quantile and interval forecasts are also discussed. We relate proper scoring rules to Bayes factors and to cross-validation, and propose a novel form of cross-validation known as random-fold cross-validation. A case study on probabilistic weather forecasts in the North American Pacific Northwest illustrates the importance of propriety. We note optimum score approaches to point and quantile
Using Bayesian model averaging to calibrate forecast ensembles. Monthly Weather Review 133
, 2005
"... Ensembles used for probabilistic weather forecasting often exhibit a spread-error correlation, but they tend to be underdispersive. This paper proposes a statistical method for postprocessing ensembles based on Bayesian model averaging (BMA), which is a standard method for combining predictive distr ..."
Abstract
-
Cited by 43 (22 self)
- Add to MetaCart
Ensembles used for probabilistic weather forecasting often exhibit a spread-error correlation, but they tend to be underdispersive. This paper proposes a statistical method for postprocessing ensembles based on Bayesian model averaging (BMA), which is a standard method for combining predictive distributions from different sources. The BMA predictive probability density function (PDF) of any quantity of interest is a weighted average of PDFs centered on the individual bias-corrected forecasts, where the weights are equal to posterior probabilities of the models generating the forecasts and reflect the models ’ relative contributions to predictive skill over the training period. The BMA weights can be used to assess the usefulness of ensemble members, and this can be used as a basis for selecting ensemble members; this can be useful given the cost of running large ensembles. The BMA PDF can be represented as an unweighted ensemble of any desired size, by simulating from the BMA predictive distribution. The BMA predictive variance can be decomposed into two components, one corresponding to the between-forecast variability, and the second to the within-forecast variability. Predictive PDFs or intervals based solely on the ensemble spread incorporate the first component but not the second. Thus BMA provides a theoretical explanation of the tendency of ensembles to exhibit a spread-error correlation but yet
Probabilistic forecasts, calibration and sharpness
- Journal of the Royal Statistical Society Series B
, 2007
"... Summary. Probabilistic forecasts of continuous variables take the form of predictive densities or predictive cumulative distribution functions. We propose a diagnostic approach to the evaluation of predictive performance that is based on the paradigm of maximizing the sharpness of the predictive dis ..."
Abstract
-
Cited by 24 (11 self)
- Add to MetaCart
Summary. Probabilistic forecasts of continuous variables take the form of predictive densities or predictive cumulative distribution functions. We propose a diagnostic approach to the evaluation of predictive performance that is based on the paradigm of maximizing the sharpness of the predictive distributions subject to calibration. Calibration refers to the statistical consistency between the distributional forecasts and the observations and is a joint property of the predictions and the events that materialize. Sharpness refers to the concentration of the predictive distributions and is a property of the forecasts only. A simple theoretical framework allows us to distinguish between probabilistic calibration, exceedance calibration and marginal calibration. We propose and study tools for checking calibration and sharpness, among them the probability integral transform histogram, marginal calibration plots, the sharpness diagram and proper scoring rules. The diagnostic approach is illustrated by an assessment and ranking of probabilistic forecasts of wind speed at the Stateline wind energy centre in the US Pacific Northwest. In combination with cross-validation or in the time series context, our proposal provides very general, nonparametric alternatives to the use of information criteria for model diagnostics and model selection.
Calibrated probabilistic forecasting at the Stateline wind energy center: The regime-switching space-time (RST) method
- Journal of the American Statistical Association
, 2004
"... With the global proliferation of wind power, accurate short-term forecasts of wind resources at wind energy sites are becoming paramount. Regime-switching space-time (RST) models merge meteorological and statistical expertise to obtain accurate and calibrated, fully probabilistic forecasts of wind s ..."
Abstract
-
Cited by 14 (10 self)
- Add to MetaCart
With the global proliferation of wind power, accurate short-term forecasts of wind resources at wind energy sites are becoming paramount. Regime-switching space-time (RST) models merge meteorological and statistical expertise to obtain accurate and calibrated, fully probabilistic forecasts of wind speed and wind power. The model formulation is parsimonious, yet takes account of all the salient features of wind speed: alternating atmospheric regimes, temporal and spatial correlation, diurnal and seasonal non-stationarity, conditional heteroscedasticity, and non-Gaussianity. The RST method identifies forecast regimes at the wind energy site and fits a conditional predictive model for each regime. Geographically dispersed meteorological observations in the vicinity of the wind farm are used as off-site predictors. The RST technique was applied to 2-hour ahead forecasts of hourly average wind speed at the Stateline wind farm in the US Pacific Northwest. In July 2003, for instance, the RST forecasts had root-mean-square error (RMSE) 28.6 % less than the persistence forecasts. For each month in the test period, the RST forecasts had lower RMSE than forecasts using state-of-the-art vector time series techniques. The RST method provides probabilistic forecasts in the form of
Calibrated probabilistic forecasting using ensemble model output statistics and minimum CRPS estimation
- Monthly Weather Review
, 2005
"... Ensemble prediction systems typically show positive spread-error correlation, but they are subject to forecast bias and underdispersion, and therefore uncalibrated. This work proposes the use of ensemble model output statistics (EMOS), an easy to imple-ment post-processing technique that addresses b ..."
Abstract
-
Cited by 13 (8 self)
- Add to MetaCart
Ensemble prediction systems typically show positive spread-error correlation, but they are subject to forecast bias and underdispersion, and therefore uncalibrated. This work proposes the use of ensemble model output statistics (EMOS), an easy to imple-ment post-processing technique that addresses both forecast bias and underdispersion and takes account of the spread-skill relationship. The technique is based on multiple lin-ear regression and akin to the superensemble approach that has traditionally been used for deterministic-style forecasts. The EMOS technique yields probabilistic forecasts that take the form of Gaussian predictive probability density functions (PDFs) for continuous weather variables, and can be applied to gridded model output. The EMOS predictive mean is an optimal, bias-corrected weighted average of the ensemble member forecasts, with coefficients that are constrained to be nonnegative and associated with the member model skill. The EMOS predictive mean provides a highly accurate deterministic-style forecast. The EMOS predictive variance is a linear function of the ensemble spread. For fitting the EMOS coefficients, the method of minimum CRPS estimation is introduced.
Geostatistical Space-Time Models, Stationarity, Separability and Full Symmetry
"... Geostatistical approaches to modeling spatio-temporal data rely on parametric covariance models and rather stringent assumptions, such as stationarity, separability and full symmetry. This paper reviews recent advances in the literature on space-time covariance functions in light of the aforemention ..."
Abstract
-
Cited by 10 (2 self)
- Add to MetaCart
Geostatistical approaches to modeling spatio-temporal data rely on parametric covariance models and rather stringent assumptions, such as stationarity, separability and full symmetry. This paper reviews recent advances in the literature on space-time covariance functions in light of the aforementioned notions, which are illustrated using wind data from Ireland. Experiments with time-forward kriging predictors suggest that the use of more complex and more realistic covariance models results in improved predictive performance.
Quantifying Dynamical Predictability: the Pseudo-Ensemble Approach ∗∗∗∗
"... Abstract The ensemble technique has been widely used in numerical weather prediction and extended-range forecasting. Current approaches to evaluate the predictability using the ensemble technique can be divided into two major groups. One is dynamical, including generating Lyapunov vectors, bred vect ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
Abstract The ensemble technique has been widely used in numerical weather prediction and extended-range forecasting. Current approaches to evaluate the predictability using the ensemble technique can be divided into two major groups. One is dynamical, including generating Lyapunov vectors, bred vectors, and singular vectors, sampling the fastest error-growing directions of the phase space, and examining the dependence of prediction efficiency on ensemble size. The other is statistical, including distributional analysis and quantifying prediction utility by the Shannon entropy and the relative entropy. Currently, with simple models, one could choose as many ensembles as possible, with each ensemble containing a large number of members. When the forecast models become increasingly complicated, however, one would only be able to afford a small number of ensembles, each with limited number of members, thus sacrificing estimation accuracy of the forecast errors. To uncover connections between different information theoretic approaches and between dynamical and statistical approaches, we propose an (ǫ, τ)-entropy and scale-dependent Lyapunov exponent—based general theoretical framework to quantify information loss in ensemble forecasting. More importantly, to tremendously expedite computations, reduce data storage, and improve forecasting accuracy, we propose a technique for constructing a large number of “pseudo ” ensembles from one single solution or scalar dataset. This pseudo-ensemble technique appears to be applicable under rather general conditions, one important situation being that observational data are available but the exact dynamical model is unknown.
Complexity and predictability of daily precipitation in a semi-arid
"... Abstract. Complexity and predictability of daily precipitation in a tropical semi-arid region (Ceará State, Brazil) is assessed by applying entropy concepts. Precipitation regimes in that region depend on several dynamical forcings, the most important being the displacement and activity of the Inter ..."
Abstract
- Add to MetaCart
Abstract. Complexity and predictability of daily precipitation in a tropical semi-arid region (Ceará State, Brazil) is assessed by applying entropy concepts. Precipitation regimes in that region depend on several dynamical forcings, the most important being the displacement and activity of the Intertropical Convergence Zone in the Atlantic Ocean. Topography is another important factor that influences the spatial distribution of rainfall in the region. A hierarchical approach based on sequences of events of different lengths is used to estimate complexity of daily precipitation records. It is shown that precipitation in Ceará exhibit more random than periodic sequences, which indicates a large degree of complexity. Nevertheless, there is indication of potentially inherent rules in the precipitation time-series that could ultimately improve prediction on time-scales between 9–11 days. It is suggested that synoptic-scale disturbances (1–8 days) represent important sources of rules in the precipitation regimes in this region. 1
A New Framework for the Assessment and Calibration of
- J. Climate
, 2002
"... We present a new framework for the assessment and calibration of medium range ensemble temperature forecasts. The method is based on maximising the likelihood of a simple parametric model for the temperature distribution, and leads to some new insights into the predictability of uncertainty. ..."
Abstract
- Add to MetaCart
We present a new framework for the assessment and calibration of medium range ensemble temperature forecasts. The method is based on maximising the likelihood of a simple parametric model for the temperature distribution, and leads to some new insights into the predictability of uncertainty.
1098 MONTHLY WEATHER REVIEW VOLUME 133 Calibrated Probabilistic Forecasting Using Ensemble Model Output Statistics and Minimum CRPS Estimation
, 2004
"... Ensemble prediction systems typically show positive spread-error correlation, but they are subject to forecast bias and dispersion errors, and are therefore uncalibrated. This work proposes the use of ensemble model output statistics (EMOS), an easy-to-implement postprocessing technique that address ..."
Abstract
- Add to MetaCart
Ensemble prediction systems typically show positive spread-error correlation, but they are subject to forecast bias and dispersion errors, and are therefore uncalibrated. This work proposes the use of ensemble model output statistics (EMOS), an easy-to-implement postprocessing technique that addresses both forecast bias and underdispersion and takes into account the spread-skill relationship. The technique is based on multiple linear regression and is akin to the superensemble approach that has traditionally been used for deterministic-style forecasts. The EMOS technique yields probabilistic forecasts that take the form of Gaussian predictive probability density functions (PDFs) for continuous weather variables and can be applied to gridded model output. The EMOS predictive mean is a bias-corrected weighted average of the ensemble member forecasts, with coefficients that can be interpreted in terms of the relative contributions of the member models to the ensemble, and provides a highly competitive deterministic-style forecast. The EMOS predictive variance is a linear function of the ensemble variance. For fitting the EMOS coefficients, the method of minimum continuous ranked probability score (CRPS) estimation is introduced. This technique finds the coefficient values that optimize the CRPS for the training data. The EMOS technique was applied to 48-h forecasts of sea level pressure and surface temperature over the North American Pacific Northwest in spring 2000, using the University of Washington mesoscale ensemble. When compared to the bias-corrected ensemble, deterministic-style EMOS forecasts of sea level pressure had root-mean-square error 9 % less and mean absolute error 7 % less. The EMOS predictive PDFs were sharp, and much better calibrated than the raw ensemble or the bias-corrected ensemble. 1.

