Abstract:
Mining frequent patterns in transaction databases, timeseries databases, and many other kinds of databases has been studied popularly in data mining research. Most of the previous studies adopt an Apriori-like candidate set generation-and-test approach. However, candidate set generation is still costly, especially when there exist proli#c patterns and#or long patterns. In this study,we propose a novel frequent pattern tree #FP-tree# structure, which is an extended pre#xtree structure for storing compressed, crucial information about frequent patterns, and develop an e#cient FP-tree- based mining method, FP-growth, for mining the complete set of frequent patterns by pattern fragment growth. E#ciency of mining is achieved with three techniques: #1# a large database is compressed into a highly condensed, much smaller data structure, whichavoids costly, repeated database scans, #2# our FP-tree-based mining adopts a pattern fragment growth method to avoid the costly generation of a large n...
Citations
|
1735
|
Fast Algorithms for Mining Association Rules
– Agrawal, Srikant
- 1994
|
|
707
|
Mining sequential patterns
– Agrawal, Srikant
|
|
343
|
Beyond market baskets: Generalizing association rules to correlations
– Brin, Motwani, et al.
- 1997
|
|
342
|
Fast discovery of association rules
– Agrawal, Mannil, et al.
- 1996
|
|
337
|
Mining sequential patterns: Generalizations and performance improvements
– Srikant, Agrawal
- 1996
|
|
291
|
An Efficient Algorithm for Mining Association Rules in Large Databases
– Savasere, Omiecinski, et al.
- 1995
|
|
268
|
Efficiently Mining Long Patterns from Databases
– Bayardo, J
- 1998
|
|
214
|
Exploratory mining and pruning optimizations of constrained associations rules
– Ng, Lakshmanan, et al.
|
|
210
|
Discovering frequent closed itemsets for association rules
– Pasquier, Bastide, et al.
- 1999
|
|
189
|
Discovery of Frequent Episodes in Event Sequences
– Mannila, Toivonen, et al.
- 1995
|
|
182
|
Mining association rules with item constraints
– Srikant, Vu, et al.
|
|
174
|
An effective hash-based algorithm for mining association rules
– Park, Chen, et al.
- 1995
|
|
169
|
2000, ‘CLOSET: An Efficient Algorithm for Mining Frequent Closed Itemsets
– Pei, Han, et al.
|
|
154
|
Efficient algorithms for discovering association rules
– Mannila, Toivonen, et al.
- 1994
|
|
150
|
C: CHARM: An efficient algorithm for Closed Itemset Mining
– MJ, Hsiao
|
|
134
|
Efficient mining of emerging patterns: Discovering trends and differences
– Dong, Li
- 1999
|
|
110
|
PrefixSpan: Mining sequential patterns efficiently by prefixprojected pattern growth
– Han, Pei, et al.
- 2001
|
|
103
|
A Tree Projection Algorithm for Generation of Frequent Itemsets
– Agarwal, Aggarwal, et al.
- 1999
|
|
93
|
Integrating association rule mining with relational database systems: Alternatives and implications
– Sarawagi, Thomas, et al.
|
|
85
|
Clustering association rules
– Lent, Swami, et al.
- 1997
|
|
83
|
Efficient mining of partial periodic patterns in time series database
– Han, Dong, et al.
|
|
72
|
Mining frequent item sets with convertible constraints
– Pei, Han, et al.
- 2001
|
|
65
|
Scalable techniques for mining causal structures
– Silverstein, Brin, et al.
- 1998
|
|
51
|
Metaruleguided mining of multi-dimensional association rules using data cubes
– Kamber, Hah, et al.
- 1997
|
|
40
|
H-Mine: Hyper-Structure Mining of Frequent
– Pei, Han, et al.
- 2001
|
|
35
|
Efficient mining of constrained correlated sets
– Grahne, Lakshmanan, et al.
|
|
7
|
is a professor in the Department of Computer Science at the University of Illinois at Urbana-Champaign. He has been working on research into data mining, data warehousing, stream data mining, spatiotemporal and multimedia data mining, biological data mini
– Han
- 2005
|
|
1
|
Data Engineering (ICDE’01
– Conf
|
|
1
|
received his M.Sc. degree in Computing Science at Simon Fraser University in 2001 and has been working as a software engineering in B.C
– Yin
|