MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Pessimistic Decision Tree Pruning Based on Tree Size (1997) [11 citations — 1 self]

by Yishay Mansour
Proceedings of the Fourteenth International Conference on Machine Learning
Add To MetaCart

Abstract:

In this work we develop a new criteria to perform pessimistic decision tree pruning. Our method is theoretically sound and is based on theoretical concepts such as uniform convergence and the Vapnik-Chervonenkis dimension. We show that our criteria is very well motivated, from the theory side, and performs very well in practice. The accuracy of the new criteria is comparable to that of the current method used in C4.5. 1 Introduction The phenomena of overfitting the data is well known in machine learning, and refers to the case that the learned hypothesis is so closely related to the training examples such that its generalization capabilities would be penalized. Overfitting would usually occur when the class of hypotheses used is as complex as the given training sample. For this reason we would like, in many cases, to limit the hypothesis we generate to be "less complex" than the training sample. In decision trees the overfitting phenomena can occur when the size of the tree is too lar...

Citations

3356 C4.5: Programs for Machine Learning – Quinlan - 1993
2573 Classification and Regression Trees – Breiman, Friedman, et al. - 1984
2526 Induction of decision trees – Quinlan - 1986
679 On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability and its Applications – Vapnik, Chervonenkis
638 UCI repository of machine learning databases. For information contact ml-repository@ics.uci.edu – Murphy, Aha - 1994
324 Approximate statistical test for comparing supervised classification learning algorithms – Dietterich - 1998
144 An empirical comparison of pruning methods for decision-tree induction – Mingers - 1989
106 Scaling up the accuracy of naive-Bayes classifiers: a decision-tree hybrid – Kohavi - 1996
59 Constructing decision trees in noisy domains – Niblett - 1987
41 Trading accuracy for simplicity in decision trees – Bohanec, Bratko - 1994
15 Expert systems - rule induction with statistical data – Mingers - 1987
1 Theory and applications of agnostic PAC-leaning with small decision trees – Auer, Holte, et al. - 1995
1 Comparative testing and evaluation of statistical and logical learning algorithms for large-scale applications in classification, prediction and control. ftp:ftp.ncc.up.pt/pub/statlog /datasets, (See also – StatLog