Results 1 -
6 of
6
Logistic Regression in Rare Events Data
, 1999
"... We study rare events data, binary dependent variables with dozens to thousands of times fewer ones (events, such as wars, vetoes, cases of political activism, or epidemiological infections) than zeros (“nonevents”). In many literatures, these variables have proven difficult to explain and predict, a ..."
Abstract
-
Cited by 33 (4 self)
- Add to MetaCart
We study rare events data, binary dependent variables with dozens to thousands of times fewer ones (events, such as wars, vetoes, cases of political activism, or epidemiological infections) than zeros (“nonevents”). In many literatures, these variables have proven difficult to explain and predict, a problem that seems to have at least two sources. First, popular statistical procedures, such as logistic regression, can sharply underestimate the probability of rare events. We recommend corrections that outperform existing methods and change the estimates of absolute and relative risks by as much as some estimated effects reported in the literature. Second, commonly used data collection strategies are grossly inefficient for rare events data. The fear of collecting data with too few events has led to data collections with huge numbers of observations but relatively few, and poorly measured, explanatory variables, such as in international conflict data with more than a quarter-million dyads, only a few of which are at war. As it turns out, more efficient sampling designs exist for making valid inferences, such as sampling all available events (e.g., wars) and a tiny fraction of nonevents (peace). This enables scholars to save as much as 99 % of their (nonfixed) data collection costs or to collect much more meaningful explanatory
Financial options and statistical prediction intervals
- ANN. STATIST
, 2003
"... The paper shows how to convert statistical prediction sets into worst case hedging strategies for derivative securities. The prediction sets can, in particular, be ones for volatilities and correlations of the underlying securities, and for interest rates. This permits a transfer of statistical conc ..."
Abstract
-
Cited by 9 (5 self)
- Add to MetaCart
The paper shows how to convert statistical prediction sets into worst case hedging strategies for derivative securities. The prediction sets can, in particular, be ones for volatilities and correlations of the underlying securities, and for interest rates. This permits a transfer of statistical conclusions into prices for options and similar financial instruments. A prime feature of our results is that one can construct the trading strategy as if the prediction set had a 100 % probability. If, in fact, the set has probability 1−α, the hedging strategy will work with at least the same probability. Different types of prediction regions are considered. The starting value A0 for the trading strategy corresponding to the 1 − α prediction region is a form of long term value at risk. At the same time, A0 is coherent.
Bayesian analysis of extreme values by mixture modeling
- Extremes
, 2003
"... Modeling of extreme values in the presence of heterogeneity is still a relatively unexplored area. We consider losses pertaining to several related categories. For each category, we view exceedances over a given threshold as generated by a Poisson process whose intensity is regulated by a specific l ..."
Abstract
-
Cited by 6 (1 self)
- Add to MetaCart
Modeling of extreme values in the presence of heterogeneity is still a relatively unexplored area. We consider losses pertaining to several related categories. For each category, we view exceedances over a given threshold as generated by a Poisson process whose intensity is regulated by a specific location, shape and scale parameter. Using a Bayesian approach, we develop a hierarchical mixture prior, with an unknown number of components, for each of the above parameters. Computations are performed using Reversible Jump MCMC. Our model accounts for possible grouping effects and takes advantage of the similarity across categories, both for estimation and prediction purposes. Some guidance on the specification of the prior distribution is provided, together with an assessment of inferential robustness. The method is illustrated throughout using a data set on large claims against a well-known insurance company over a 15-year period.
Measuring Risk With Extreme Value Theory
- Extremes and Integrated Risk Management
, 2000
"... this paper, but taking account of the variation in t estimated for the GARCH process. In another recent paper, Tsay (1999) has used methods similar to those of the present paper, but allowing the extreme value parameters to depend on daily interest rates. ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
this paper, but taking account of the variation in t estimated for the GARCH process. In another recent paper, Tsay (1999) has used methods similar to those of the present paper, but allowing the extreme value parameters to depend on daily interest rates.
Reference analysis
- In Handbook of Statistics 25
, 2005
"... This chapter describes reference analysis, a method to produce Bayesian inferential statements which only depend on the assumed model and the available data. Statistical information theory is used to define the reference prior function as a mathematical description of that situation where data would ..."
Abstract
-
Cited by 5 (2 self)
- Add to MetaCart
This chapter describes reference analysis, a method to produce Bayesian inferential statements which only depend on the assumed model and the available data. Statistical information theory is used to define the reference prior function as a mathematical description of that situation where data would best dominate prior knowledge about the quantity of interest. Reference priors are not descriptions of personal beliefs; they are proposed as formal consensus prior functions to be used as standards for scientific communication. Reference posteriors are obtained by formal use of Bayes theorem with a reference prior. Reference prediction is achieved by integration with a reference posterior. Reference decisions are derived by minimizing a reference posterior expected loss. An information theory based loss function, the intrinsic discrepancy, may be used to derive reference procedures for conventional inference problems in scientific investigation, such as point estimation, region estimation and hypothesis testing.
Explaining Rare Events in International Relations
, 2000
"... Some of the most important phenomena in international conflict are coded as "rare events data," binary dependent variables with dozens to thousands of times fewer events, such as wars, coups, etc., than "nonevents". Unfortunately, rare events data are difficult to explain and predict, a problem that ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
Some of the most important phenomena in international conflict are coded as "rare events data," binary dependent variables with dozens to thousands of times fewer events, such as wars, coups, etc., than "nonevents". Unfortunately, rare events data are difficult to explain and predict, a problem that seems to have at least two sources. First, and most importantly, the data collection strategies used in international conflict are grossly inefficient. The fear of collecting data with too few events has led to data collections with huge numbers of observations but relatively few, and poorly measured, explanatory variables. As it turns out, more efficient sampling designs exist for making valid inferences, such as sampling all available events (e.g., wars) and a tiny fraction of non-events (peace). This enables scholars to save as much as 99% of their (non-fixed) data collection costs, or to collect much more meaningful explanatory variables. Second, logistic regression, and other commonly ...

