Large Datasets Lead to Overly Complex Models: An Explanation and a Solution (1998)

by Tim Oates , David Jensen
Citations:44 - 4 self

Active Bibliography

74 Multiple Comparisons in Induction Algorithms – David Jensen, Paul R. Cohen - 1998
c ○ 2000 Kluwer Academic Publishers. Printed in The Netherlands. Multiple Comparisons in Induction Algorithms – David D. Jensen, Paul R. Cohen
78 The role of Occam’s Razor in knowledge discovery – Pedro Domingos - 1999
13 Pruning decision trees and lists – Eibe Frank - 2000
19 Overfitting Explained – Paul R. Cohen , David Jensen - 1997
6 Adjusting for multiple testing in decision tree pruning – David Jensen - 1997
16 Adjusting for multiple comparisons in decision tree pruning – David Jensen, Matt Schmill - 1997
62 Tree induction vs. logistic regression: A learning-curve analysis – Claudia Perlich, Foster Provost, Jeffrey S. Simonoff - 2001
66 The Effects of Training Set Size on Decision Tree Complexity – Oates - 1997
Statistical Challenges to Inductive Inference in Linked Data – David Jensen Experimental, David Jensen - 1999
22 Statistical Challenges to Inductive Inference in Linked Data – David Jensen - 1999
RIDE: Rule-Learning in a Distributed Environment – Nitesh Chawla - 2000
78 Relational Learning Techniques for Natural Language Information Extraction – Mary Elaine Califf - 1998
114 Machine-Learning Research -- Four Current Directions – Thomas G. Dietterich
Structure and Majority Classes in Decision Tree Learning – Ray J. Hickey, Greg Ridgeway
119 Overfitting Avoidance as Bias – Cullen Schaffer - 1992
4 Impact of learning set quality and size on decision tree performances – M. Sebban, R. Nock, J. H. Chauchat, R. Rakotomalala - 2000
13 Best-first decision tree learning – Haijian Shi - 2007
6 Reduced-Error Pruning With Significance Tests – Eibe Frank, Ian H. Witten - 1998