Large Datasets Lead to Overly Complex Models: An Explanation and a Solution (1998)

by Tim Oates , David Jensen
Citations:44 - 4 self

Documents Related by Co-Citation

4905 C4.5: Programs for Machine Learning – J R Quinlan - 1993
3335 Induction of Decision Trees – J. R. Quinlan - 1986
2863 UCI Repository of machine learning databases [http://www.ics.uci.edu/~mlearn/MLRepository.html – C L Blake, C J Merz - 1998
3874 Classification and Regression Trees – L Breiman, J H Friedman, R A Olshen, C J Stone - 1984
66 The Effects of Training Set Size on Decision Tree Complexity – Oates - 1997
167 An empirical comparison of pruning methods for decision tree induction – J Mingers - 1989
293 Inferring Decision Trees Using the Minimum Description Length Principle – J R Quinlan, R L Rivest - 1989
156 An Exploratory Technique for Investigating Large Quantities of Categorical Data – G V Kass - 1980
74 Multiple Comparisons in Induction Algorithms – David Jensen, Paul R. Cohen - 1998
2479 Bagging Predictors – Leo Breiman, Leo Breiman - 1996
16 Adjusting for multiple comparisons in decision tree pruning – David Jensen, Matt Schmill - 1997
969 Fast Effective Rule Induction – William W. Cohen - 1995
85 A Survey of Methods for Scaling Up Inductive Algorithms – Foster Provost, Venkateswarlu Kolluri - 1999
122 Cached Sufficient Statistics for Efficient Machine Learning with Large Datasets – Andrew Moore, Mary Soon Lee - 1997
17 Using a Permutation Test for Attribute Selection in Decision Trees – Eibe Frank, Ian H. Witten - 1998
1696 A Theory of the Learnable – L. G. Valiant - 1984
199 Improved Use of Continuous Attributes in C4.5 – J. R. Quinlan - 1996
143 Concept learning and the problem of small disjuncts – Robert C. Holte, Liane E. Acker, Bruce W. Porter - 1989
38 Simplifying Decision Trees: A Survey – Leonard A. Breslow, David W. Aha - 1996