ABayesian network is a graphical model that encodes probabilistic relationships among variablesofinterest. When used in conjunction with statistical techniques, the graphical model has several advantages for data analysis. One, because the model encodes dependencies among all variables, it readily handles situations where some data entries are missing. Two, a Bayesian network can be used to learn causal relationships, and hence can be used to gain understanding about a problem domain and to predict the consequences of intervention. Three, because the model has both a causal and probabilistic semantics, it is an ideal representation for combining prior knowledge (which often comes in causal form) and data. Four, Bayesian statistical methods in conjunction with Bayesian networks offer an efficient and principled approach for avoiding the overfitting of data. In this paper, we discuss methods for constructing Bayesian networks from prior knowledge and summarize Bayesian statistical methods for using data to improve these models. With regard to the latter task, we describe methods for learning both the parameters and structure of a Bayesian network, including techniques for learning with incomplete data. In addition, werelateBayesian-network methods for learning to techniques for supervised and unsupervised learning. We illustrate the graphical-modeling approach using a real-world case study.
|
4735
|
Maximum Likelihood from incomplete data via the EM algorithm
– Dempster, Laird, et al.
- 1977
|
|
2439
|
Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images
– Geman, Geman
- 1984
|
|
1058
|
Density Estimation for Statistics and Data Analysis
– Silverman
- 1986
|
|
971
|
Estimating the dimension of a model
– Schwarz
- 1978
|
|
936
|
Local Computations with Probabilities on Graphical Structures and Their Applications to Expert Systems
– Lauritzen, Spigelholter
- 1988
|
|
726
|
A bayesian method for the induction of probabilistic networks from data
– Cooper, Herskovits
- 1992
|
|
626
|
The Foundations of Statistics
– Savage
- 1972
|
|
619
|
Introduction to Bayesian Networks
– Jensen
- 1996
|
|
615
|
Learning Bayesian networks: The combination of knowledge and statistical data
– Heckerman, Geiger, et al.
- 1995
|
|
600
|
Bayesian Theory
– Bernardo, Smith
- 1994
|
|
506
|
Bayes factors
– Kaas, Raftery
- 1995
|
|
442
|
Exploratory Data Analysis
– Tukey
- 1977
|
|
437
|
The computational complexity of probabilistic inference using Bayesian belief networks
– Cooper
- 1990
|
|
353
|
Bayesian interpolation
– MacKay
- 1992
|
|
344
|
Probabilistic inference using Markov chain Monte Carlo methods
– Neal
- 1993
|
|
332
|
Kahneman D. Judgment under uncertainty: heuristics and biases
– Tversky
|
|
308
|
Markov Chain Monte Carlo in Practice
– Gilks, Richardson, et al.
- 1996
|
|
303
|
A practical Bayesian framework for backpropagation networks
– MacKay
- 1992
|
|
297
|
Probability and Statistics
– Groot, Schervish
- 2001
|
|
271
|
Causation, Prediction and Search
– Spirtes, Glymour, et al.
- 2000
|
|
266
|
Graphical Models in Applied Multivariate Statistics
– Whittaker
- 1990
|
|
250
|
Influence diagrams
– Howard, Matheson
- 1981
|
|
246
|
Fusion, propagation, and structuring in belief networks
– Pearl
|
|
196
|
Approximating probabilistic inference in bayesian belief networks
– Dagum, Chavez
- 1993
|
|
183
|
Model selection and accounting for model uncertainty in graphical models using Occam’s window
– Madigan, Raftery
- 1994
|
|
175
|
The alarm monitoring system: A case study with two probabilistic inference techniques for belief networks
– Beinlich, Suermondt, et al.
- 1992
|
|
167
|
Sequential updating of conditional probabilities on directed graphical structures, Networks
– Spiegelhalter, Lauritzen
- 1990
|
|
164
|
Bayesian analysis in expert systems
– Spiegelhalter, Dawid, et al.
- 1993
|
|
151
|
A Theory of Inferred Causation
– Pearl, Verma
- 1991
|
|
135
|
Equivalence and synthesis of causal models
– Verma, Pearl
- 1990
|
|
128
|
Bayesian graphical models for discrete data
– Madigan, York
- 1995
|
|
125
|
Efficient approximations for the marginal likelihood of Bayesian networks with hidden variables
– Chickering, Heckerman
- 1997
|
|
123
|
Marginal likelihood from the Gibbs output
– Chib
- 1995
|
|
119
|
Bayesian updating in recursive graphical models by local computation
– Jensen, Lauritzen, et al.
- 1990
|
|
114
|
Propagation of Probabilities, Means and Variances in Mixed Graphical Association Models
– Lauritzen
- 1992
|
|
109
|
Approximations in Bayesian belief universes for knowledge based systems
– Jensen, Andersen
- 1990
|
|
108
|
Bayesian model selection in social research (with discussion
– Raftery
- 1995
|
|
99
|
Statistical theory: The prequential approach
– Dawid
- 1984
|
|
95
|
Causal Diagrams for Empirical Research
– Pearl
- 1995
|
|
90
|
Probability, frequency and reasonable expectations
– Cox
- 1946
|
|
89
|
Truth and Probability
– Ramsey
- 1926
|
|
78
|
Improving the convergence of back propagation learning with second-order methods
– Becker
- 1988
|
|
78
|
Learning equivalence classes of Bayesiannetwork structures
– Chickering
- 2002
|
|
74
|
Application of a general propagation algorithm for probabilistic expert systems
– Dawid
- 1992
|
|
70
|
The chain graph Markov property
– Frydenberg
- 1990
|
|
63
|
Theory of Probability
– Finetti
- 1974
|
|
63
|
Local learning in probabilistic networks with hidden variables
– Russell, Binder, et al.
- 1995
|
|
59
|
A transformational characterization of equivalent Bayesian network structures
– Chickering
- 1995
|
|
59
|
Probability and the Weighing of Evidence
– Good
- 1950
|
|
58
|
BUGS: A program to perform Bayesian inference using Gibbs sampling
– Thomas, Spiegelhalter, et al.
- 1992
|