Abstract:
. Mining for association rules in market basket data has proved a fruitful area of research. Measures such as conditional probability (confidence) and correlation have been used to infer rules of the form "the existence of item A implies the existence of item B." However, such rules indicate only a statistical relationship between A and B. They do not specify the nature of the relationship: whether the presence of A causes the presence of B, or the converse, or some other attribute or phenomenon causes both to appear together. In applications, knowing such causal relationships is extremely useful for enhancing understanding and effecting change. While distinguishing causality from correlation is a truly difficult problem, recent work in statistics and Bayesian learning provide some avenues of attack. In these fields, the goal has generally been to learn complete causal models, which are essentially impossible to learn in large-scale data mining applications with a large number of varia...
Citations
|
1521
|
Mining association rules between sets of items in large databases
– Agrawal, Imielinski, et al.
- 1993
|
|
615
|
Learning Bayesian networks: The combination of knowledge and statistical data
– Heckerman, Geiger, et al.
- 1995
|
|
500
|
Categorical Data Analysis
– Agresti
- 1990
|
|
395
|
Fast algorithms for mining association rules in large databases
– Agrawal, Srikant
- 1994
|
|
347
|
Dynamic itemset counting and implication rules for market basket data
– Brin, Motwani, et al.
- 1997
|
|
343
|
Beyond market baskets: Generalizing association rules to correlations
– Brin, Motwani, et al.
- 1997
|
|
342
|
Fast discovery of association rules
– Agrawal, Mannil, et al.
- 1996
|
|
342
|
Mining generalized association rules
– Srikant, Agrawal
- 1995
|
|
215
|
Database Mining: A Performance Perspective
– Agrawal, Imielinski, et al.
- 1993
|
|
151
|
A Theory of Inferred Causation
– Pearl, Verma
- 1991
|
|
95
|
Causal Diagrams for Empirical Research
– Pearl
- 1995
|
|
59
|
An algorithm for fast recovery of sparse causal graphs
– Spirtes, Glymour
- 1991
|
|
55
|
Bayesian networks for data mining
– Heckerman
- 1997
|
|
48
|
A Bayesian approach to causal discovery
– Heckerman, Meek, et al.
- 1999
|
|
45
|
A Bayesian approach for learning causal networks
– Heckerman
- 1995
|
|
35
|
Decision-Theoretic Foundations for Causal Reasoning
– Heckerman, Shachter
- 1995
|
|
31
|
Mathematical Statistics with Applications
– Mendenhall, Scheaffer
- 1973
|
|
21
|
Causal inference in the presence of latent variables and selection bias
– Spirtes, Meek, et al.
- 1995
|
|
15
|
Graphical Models for Probabilistic and Causal Reasoning
– Pearl
- 1998
|
|
14
|
A simple constraint-based algorithm for efficiently mining observational databases for causal relationships
– Cooper
- 1997
|
|
13
|
From Bayesian networks to causal networks
– Pearl
- 1993
|
|
13
|
A definition and graphical representation of causality
– Heckerman, Shachter
- 1995
|
|
13
|
Sampling large databases for finding association rules
– Toivonen
- 1996
|
|
8
|
Learning bayesian networks: The combination of knowledge and statistical data
– Chickering
- 1995
|
|
7
|
A survey of exact inference for contingency tables. Statistical science
– Agresti
- 1992
|
|
4
|
received his B.S. degree in mathematics and computer science from the University of Maryland at College Park in
– Brin
- 1993
|
|
1
|
Data Mining and Knowledge Discovery, 1(1997): 79119. [HGC94
– Heckerman, Geiger, et al.
|
|
1
|
A Bayesian method for the Induction of Probabilistic Networks from Data
– Mining, Discovery
|
|
1
|
Craig Silverstein obtained an A.B. degree in Computer Science from Harvard University and is a Ph.D. candidate (on leave) in Computer Science at Stanford University. He is a recipient of a National Defense Science and Engineering Graduate fellowship and a
– Causation, Springer-Verlag
- 1993
|
|
1
|
is the Stanford W. Ascherman Professor of Engineering in the Department of Computer Science at Stanford. He received the B.S. degree from Columbia University
– Ullman
- 1966
|
|
1
|
to his appointment at Stanford in 1979, he was a member of the technical staff of Bell Laboratories from 1966-1969, and on the faculty of Princeton University between 1969 and 1979. From 1990-1994, he was chair of the Stanford Computer Science Department.
– Prior
|
|
1
|
was elected to the National Academy of Engineering in 1989 and has held Guggenheim and Einstein Fellowships. He is the 1996 winner of the Sigmod Contributions Award and the 1998 winner of the Karl V. Karlstrom Outstanding Educator Award. He is the author
– Ullman
|