Abstract:
This paper discusses our research overview and problems in discovering useful knowledge from a structured database. This research is an extended work of a prototype system, MASSON that discovers discriminant rules in the form of database query for a set of objects using genetic programming. In order to discover useful discriminant rules in a database, the basic approach that is employed in this research is first cluster the given database into clusters that contain a set of similar objects, then the set of objects in each cluster is provided to the discovery system, MASSON. We also discuss several research problems in clustering a structured database. Introduction Discriminant rules are rules that exclusively describe the characteristics for a set of objects in a given data set. In this framework, the rules cover only the specific set of objects without covering any other objects in a given data set. Discovery (or unsupervised learning) systems can discover such rules by themselves wh...
Citations
|
3356
|
C4.5: Programs for Machine Learning
– Quinlan
- 1993
|
|
1921
|
Genetic Programming I : On the Programming of Computers by Means of Natural Selection
– Koza
- 1992
|
|
1478
|
Algorithms for Clustering Data
– Jain, Dubes
- 1988
|
|
523
|
Knowledge Acquisition via Incremental Concept Formation
– Fisher
- 1987
|
|
410
|
Cluster Analysis for Applications
– Anderberg
- 1973
|
|
385
|
Implementing data cubes efficiently
– Harinarayan, Rajaraman, et al.
- 1996
|
|
278
|
Learning efficient classification procedures and their application to chess end games
– Quinlan
- 1983
|
|
258
|
Knowledge discovery in databases: An overview
– Frawley, Piatetsky-Shapiro, et al.
- 1992
|
|
207
|
Learning from observations: conceptual clustering
– MICHALSKI, STEPP
- 1983
|
|
57
|
A polynomial approach to the constructive induction of structural knowledge
– Kietz
- 1994
|
|
29
|
Conceptual Clustering in a First Order Logic Representation
– Bisson
- 1992
|
|
28
|
Substructure discovery in the SUBDUE system
– Holder, Cook, et al.
- 1994
|
|
24
|
Knowledge Discovery From Multiple Databases
– Ribeiro, Kaufman, et al.
- 1995
|
|
21
|
Restructuring databases for knowledge discovery by consolidation and link formation
– Goldberg, Senator
- 1995
|
|
12
|
Induction of decision trees from complex structured data
– Manago, Kodratoff
- 1991
|
|
11
|
Clustering sequences of complex objects
– Ketterlin
- 1997
|
|
5
|
Deriving queries from results using genetic programming
– Ryu, Eick
- 1996
|
|
1
|
A Unified Similarity Measure for Mixed-type Multi-valued Attributes
– Ryu, Eick
- 1998
|