Data mining is an interdisciplinary research area spanning severals disciplines such as database systems, machine learning, intelligent information systems, statistics, and expert systems. Data mining has evolved into an important and active area of research because of theoretical challenges and practical applications associated with the problem of discovering (or extracting) interesting and previously unknown knowledge from very large real-world databases. Many aspects of data mining have been investigated in several related fields. A unique but important aspect of the problem lies in the significance of needs to extend these studies to include the nature of the contents of the real-world databases. In this chapter, we discuss the theory and foundational issues in data mining, describe data mining methods and algorithms, and review data mining applications. Since a major focus of this book is on rough sets and its applications to database mining, one full section is devoted to summari...
|
4713
|
Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
– Pearl
- 1988
|
|
3018
|
Pattern Classification and Scene Analysis
– Duda, Hart
- 1973
|
|
2537
|
Induction of decision trees
– Quinlan
- 1986
|
|
1213
|
An Introduction to the Bootstrap
– Efron, Robert
- 1993
|
|
483
|
Generalization as search
– Mitchell
- 1982
|
|
326
|
Restructuring lattice theory: an approach based on hierarchies of concepts
– Wille
- 1982
|
|
305
|
Computer Systems that Learn
– Weiss, Kulikowski
- 1991
|
|
259
|
Knowledge Discovery in Databases: An Overview, introductory chapter of Knowledge Discovery
– Frawley, Piatetsky-Shapiro, et al.
- 1991
|
|
215
|
Database Mining: A Performance Perspective
– Agrawal, Imielinski, et al.
- 1993
|
|
169
|
Learning with many irrelevant features
– Almuallim, Dietterich
- 1991
|
|
161
|
The management of probabilistic data
– Barbará, Garcia-Molina, et al.
- 1992
|
|
157
|
Projection pursuit
– Huber
- 1985
|
|
143
|
Subset Selection in Regression
– Miller
- 1990
|
|
140
|
A Branch and Bound Algorithm for Feature Subset Selection
– Narendra, Fukunaga
- 1997
|
|
134
|
An empirical comparison of selection measures for decision-tree induction
– Mingers
- 1989
|
|
128
|
Knowledge discovery in databases: An attribute-oriented approach
– Han, Cai, et al.
- 1992
|
|
123
|
Concept Learning and the Problem of Small Disjuncts
– Holte, Acker, et al.
- 1989
|
|
101
|
Database systems: Achievements and opportunities
– Silberschatz, Stonebraker, et al.
- 1991
|
|
100
|
An interval classifier for database mining applications
– Agrawal, Ghosh, et al.
- 1992
|
|
95
|
Unknown attribute values in induction
– Quinlan
- 1989
|
|
94
|
The e ect of noise on concept learning
– Quinlan
- 1986
|
|
83
|
Systems for knowledge discovery in databases
– Matheus, Chan, et al.
- 1993
|
|
82
|
The Attribute Selection Problem in Decision Tree Generation
– Fayyad, Irani
- 1992
|
|
69
|
Classification Algorithms
– James
- 1985
|
|
41
|
Knowledge Discovery from Telecommunication Network Alarm Databases
– Hotanen, Klemettinen, et al.
- 1996
|
|
38
|
Class-dependent discretization for inductive learning from continuous and mixed-mode data
– Ching, Wong, et al.
- 1995
|
|
38
|
Compression, Significance and Accuracy
– Muggleton, Srinivasan, et al.
- 1992
|
|
37
|
The Discovery, Analysis, and Representation of Data Dependencies in Databases
– ZIARKO
- 1991
|
|
37
|
Feature selection using rough sets theory
– Modrzejewski
- 1993
|
|
34
|
Rough Classification
– Pawlak
- 1984
|
|
33
|
A statistical technique for extracting classificatory knowledge from databases
– CHAN, WONG
- 1991
|
|
27
|
Effects of sample size in classifier design
– Fukunaga, Hayes
- 1989
|
|
26
|
A decision theoretic framework for approximating concepts
– Yao, Wong
- 1992
|
|
25
|
A support system for interpreting statistical data
– Hoschka, Klosgen
- 1991
|
|
23
|
A method for attribute selection in inductive learning systems
– Baim
- 1988
|
|
21
|
Learning useful rules from inconclusive data
– Uthurusamy, Fayyad, et al.
- 1991
|
|
21
|
Learning with nested generalized exemplars
– Salzberg
- 1990
|
|
20
|
An extended Relational Database Model for Uncertain and Imprecise Information
– Lee
- 1992
|
|
18
|
Accelerated quantification of Bayesian networks with incomplete data
– Thiesson
- 1995
|
|
16
|
Knowledge discovery workbench for exploring business databases
– Piatetsky-Shapiro, Matheus
- 1992
|
|
16
|
Comparison of the probabilistic approximate classification and the fuzzy set model, Fuzzy Sets and Systems
– Wong, Ziarko
- 1987
|
|
15
|
A fuzzy model for relational databases
– Buckles, Petry
- 1982
|
|
13
|
Multi-interval discretization of continuous attributes as preprocessing for classification learning
– Fayyad, Irani
- 1993
|
|
12
|
SPOTLIGHT: A Data Explanation System
– Anand, Kahn
- 1992
|
|
11
|
Comparison of rough-set and statistical methods in inductive learning
– Wong, Ziarko, et al.
- 1986
|
|
11
|
Rough classification of patients after highly selective vagotomy for duodenal ulcer
– Pawlak, Slowinski, et al.
- 1986
|
|
11
|
Dynamic programming as applied to feature subset selection in a pattern recognition system
– Chang
- 1973
|
|
11
|
Rough classification with valued closeness relation
– Slowinski, Stefanowiski
- 1995
|
|
11
|
KDD-R: A comprehensive system for knowledge discovery in databases using rough sets
– Ziarko, Shan
- 1994
|
|
11
|
The rule induction system LERS - a version for personal computers
– Chmielewski, Grzymala-Busse, et al.
- 1993
|