Results 11 - 20
of
32
Causal inference in statistics: An overview
- Statistics Surveys
"... Abstract: This review presents empirical researchers with recent advances in causal inference, and stresses the paradigmatic shifts that must be undertaken in moving from traditional statistical analysis to causal analysis of multivariate data. Special emphasis is placed on the assumptions that unde ..."
Abstract
-
Cited by 12 (8 self)
- Add to MetaCart
Abstract: This review presents empirical researchers with recent advances in causal inference, and stresses the paradigmatic shifts that must be undertaken in moving from traditional statistical analysis to causal analysis of multivariate data. Special emphasis is placed on the assumptions that underly all causal inferences, the languages used in formulating those assumptions, the conditional nature of all causal and counterfactual claims, and the methods that have been developed for the assessment of such claims. These advances are illustrated using a general theory of causation based on the Structural Causal Model (SCM) described in Pearl (2000a), which subsumes and unifies other approaches to causation, and provides a coherent mathematical foundation for the analysis of causes and counterfactuals. In particular, the paper surveys the development of mathematical tools for inferring (from a combination of data and assumptions) answers to three types of causal queries: (1) queries about the effects of potential interventions, (also called “causal effects ” or “policy evaluation”) (2) queries about probabilities of counterfactuals, (including assessment of “regret, ” “attribution” or “causes of effects”) and (3) queries about direct and indirect effects (also known as “mediation”). Finally, the paper defines the formal and conceptual relationships between the structural and potential-outcome frameworks and presents tools for a symbiosis analysis that uses the strong features of both.
A new algorithm for maximum likelihood estimation in Gaussian graphical models for marginal independence
- In U. Kjærulff and C. Meek (Eds.), Proceedings of the 19th Conference on Uncertainty in Artificial Intelligence
, 2003
"... Graphical models with bi-directed edges (↔) represent marginal independence: the absence of an edge between two vertices indicates that the corresponding variables are marginally independent. In this paper, we consider maximum likelihood estimation in the case of continuous variables with a Gaussian ..."
Abstract
-
Cited by 12 (6 self)
- Add to MetaCart
Graphical models with bi-directed edges (↔) represent marginal independence: the absence of an edge between two vertices indicates that the corresponding variables are marginally independent. In this paper, we consider maximum likelihood estimation in the case of continuous variables with a Gaussian joint distribution, sometimes termed a covariance graph model. We present a new fitting algorithm which exploits standard regression techniques and establish its convergence properties. Moreover, we contrast our procedure to existing estimation algorithms. 1
Statistics and Causal Inference: A Review
, 2003
"... This paper aims at assisting empirical researchers benefit from recent advances in causal inference. The paper stresses the paradigmatic shifts that must be undertaken in moving from traditional statistical analysis to causal analysis of multivariate data. Special emphasis is placed on the assump ..."
Abstract
-
Cited by 11 (6 self)
- Add to MetaCart
This paper aims at assisting empirical researchers benefit from recent advances in causal inference. The paper stresses the paradigmatic shifts that must be undertaken in moving from traditional statistical analysis to causal analysis of multivariate data. Special emphasis is placed on the assumptions that underly all causal inferences, the languages used in formulating those assumptions, and the conditional nature of causal claims inferred from nonexperimental studies. These emphases are illustrated through a brief survey of recent results, including the control of confounding, the assessment of causal effects, the interpretation of counterfactuals, and a symbiosis between counterfactual and graphical methods of analysis.
Multiple testing and error control in Gaussian graphical model selection
- Statistical Science
"... Abstract. Graphical models provide a framework for exploration of multivariate dependence patterns. The connection between graph and statistical model is made by identifying the vertices of the graph with the observed variables and translating the pattern of edges in the graph into a pattern of cond ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
Abstract. Graphical models provide a framework for exploration of multivariate dependence patterns. The connection between graph and statistical model is made by identifying the vertices of the graph with the observed variables and translating the pattern of edges in the graph into a pattern of conditional independences that is imposed on the variables ’ joint distribution. Focusing on Gaussian models, we review classical graphical models. For these models the defining conditional independences are equivalent to vanishing of certain (partial) correlation coefficients associated with individual edges that are absent from the graph. Hence, Gaussian graphical model selection can be performed by multiple testing of hypotheses about vanishing (partial) correlation coefficients. We show and exemplify how this approach allows one to perform model selection while controlling error rates for incorrect edge inclusion. Key words and phrases: Acyclic directed graph, Bayesian network, bidirected graph, chain graph, concentration graph, covariance graph, DAG, graphical model, multiple testing, undirected graph. 1.
Term Dependence: A Basis for Luhn and Zipf Models
, 2001
"... There are regularities in the statistical information provided by natural language terms about neighboring terms. We find that when phrase rank increases, moving from common to less common phrases, the value of the expected mutual information measure (EMIM) between the terms regularly decreases. ..."
Abstract
-
Cited by 6 (1 self)
- Add to MetaCart
There are regularities in the statistical information provided by natural language terms about neighboring terms. We find that when phrase rank increases, moving from common to less common phrases, the value of the expected mutual information measure (EMIM) between the terms regularly decreases. Luhn's model suggests that mid-range terms are the best index terms and relevance discriminators. We suggest reasons for this principle based on the empirical relationships shown here between the rank of terms within phrases and the average mutual information between terms, which we refer to as the Inverse Representation--EMIM principle. We also suggest an Inverse EMIM term weight for indexing or retrieval applications that is The authors wishes to acknowledge the helpful suggestions on earlier drafts of this manuscript by Miles Efron and an anonymous referee. 1 consistent with Luhn's distribution. An information theoretic interpretation of Zipf's Law is provided. Using the regularity noted above, we suggest that Zipf's Law is a consequence of the statistical dependencies that exist between terms, described here using information theoretic concepts. 1
Graphical Models for Multivariate Time Series from Intensive Care Monitoring
, 2000
"... In critical care extremely high dimensional time series are generated by clinical information systems. This yields new perspectives of data recording and also causes a new challenge for statistical methodology. Recently graphical correlation models have been developed for analysing the partial assoc ..."
Abstract
-
Cited by 6 (3 self)
- Add to MetaCart
In critical care extremely high dimensional time series are generated by clinical information systems. This yields new perspectives of data recording and also causes a new challenge for statistical methodology. Recently graphical correlation models have been developed for analysing the partial associations between the components of multivariate time series. We apply this technique to the hemodynamic system of critically ill patients monitored in intensive care. We appraise the practical value of the procedure by reidentifying known associations between the variables. From separate analyses for different pathophysiological states we conclude that distinct clinical states can be characterised by distinct partial correlation structures.
Causal Inference in the Health Sciences: A Conceptual Introduction
- Health Services and Outcomes Research Methodology
, 2001
"... This paper provides a conceptual introduction to causal inference, aimed to assist health services researchers benefit from recent advances in this area. The paper stresses the paradigmatic shifts that must be undertaken in moving from traditional statistical analysis to causal analysis of multivari ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
This paper provides a conceptual introduction to causal inference, aimed to assist health services researchers benefit from recent advances in this area. The paper stresses the paradigmatic shifts that must be undertaken in moving from traditional statistical analysis to causal analysis of multivariate data. Special emphasis is placed on the assumptions that underlie all causal inferences, the languages used in formulating those assumptions, and the conditional nature of causal claims inferred from nonexperimental studies. These emphases are illustrated through a brief survey of recent results, including the control of confounding, corrections for noncompliance, and a symbiosis between counterfactual and graphical methods of analysis.
The Foundations of Causal Inference
- SUBMITTED TO SOCIOLOGICAL METHODOLOGY.
, 2010
"... This paper reviews recent advances in the foundations of causal inference and introduces a systematic methodology for defining, estimating and testing causal claims in experimental and observational studies. It is based on non-parametric structural equation models (SEM) – a natural generalization of ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
This paper reviews recent advances in the foundations of causal inference and introduces a systematic methodology for defining, estimating and testing causal claims in experimental and observational studies. It is based on non-parametric structural equation models (SEM) – a natural generalization of those used by econometricians and social scientists in the 1950-60s, and provides a coherent mathematical foundation for the analysis of causes and counterfactuals. In particular, the paper surveys the development of mathematical tools for inferring the effects of potential interventions (also called “causal effects” or “policy evaluation”), as well as direct and indirect effects (also known as “mediation”), in both linear and non-linear systems. Finally, the paper clarifies the role of propensity score matching in causal analysis, defines the relationships between the structural and
A SINful Approach to Model Selection for Gaussian Concentration Graphs
, 2003
"... A multivariate Gaussian graphical Markov model for an undirected graph G, also called a covariance selection model or concentration graph model, is defined in terms of the Markov properties, i.e., conditional independences associated with G, which in turn are equivalent to specified zeroes among t ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
A multivariate Gaussian graphical Markov model for an undirected graph G, also called a covariance selection model or concentration graph model, is defined in terms of the Markov properties, i.e., conditional independences associated with G, which in turn are equivalent to specified zeroes among the set of pairwise partial correlation coe#cients. By means of Fisher's z-transformation and Sidak's correlation inequality, conservative simultaneous confidence intervals for the entire set of partial correlations can be obtained, leading to a simple method for model selection that controls the overall error rate for incorrect edge inclusion. The simultaneous p-values corresponding to the partial correlations are partitioned into three disjoint sets, a significant set S, an indeterminate set I, and a non-significant set N. Our SIN model selection method selects two graphs, a graph GSI whose edges correspond to the set I, and a more conservative graph GS whose edges correspond to S only. Prior information about the presence and/or absence of particular edges can be incorporated readily. Similar considerations apply to covariance graph models, which are defined in terms of marginal independence rather than conditional independence. 1.
Equivalence in Non-Recursive Structural Equation Models
- Proceedings:Compstat 94 , Physica Verlag
, 1994
"... In the last decade, there has been considerable progress in understanding a certain class of statistical models, known as directed acyclic graph (DAG) models, which encode independence, and conditional independence constraints. (See Pearl, 1988). This research has had fruitful results in many areas: ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
In the last decade, there has been considerable progress in understanding a certain class of statistical models, known as directed acyclic graph (DAG) models, which encode independence, and conditional independence constraints. (See Pearl, 1988). This research has had fruitful results in many areas: there is now a relatively clear causal interpretation

