MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Stacked Generalization (1992) [380 citations — 7 self]

Abstract:

: This paper introduces stacked generalization, a scheme for minimizing the generalization error rate of one or more generalizers. Stacked generalization works by deducing the biases of the generalizer(s) with respect to a provided learning set. This deduction proceeds by generalizing in a second space whose inputs are (for example) the guesses of the original generalizers when taught with part of the learning set and trying to guess the rest of it, and whose output is (for example) the correct guess. When used with multiple generalizers, stacked generalization can be seen as a more sophisticated version of cross-validation, exploiting a strategy more sophisticated than cross-validation 's crude winner-takes-all for combining the individual generalizers. When used with a single generalizer, stacked generalization is a scheme for estimating (and then correcting for) the error of a generalizer which has been trained on a particular learning set and then asked a particular question. After...

Citations

2533 Induction of Decision Trees – Quinlan - 1986
1943 Adaptation in natural and artificial systems. The – Holland - 1975
1330 A theory of a learnable – Valiant - 1984
400 Towards memory-based reasoning – Stanfill, Waltz - 1986
185 Stochastic complexity and modeling – Rissanen - 1986
116 Nonlinear prediction of chaotic time series – Casdagli - 1989
116 NETtalk: A parallel network that learns to read aloud – Rosenberg, Sejnowski - 1986
95 of solving incorrectly posed problems – Morozov - 1984
74 Efficient Algorithms with Neural Network Behavior – Omohundro - 1987
72 Exploiting chaos to predict the future and reduce noise – Farmer, Sidorowich - 1988
55 How neural nets work – Lapedes, Farber - 1988
28 Asymptotics for and against cross-validation – Stone - 1977
26 Computers and the Theory of Statistics: Thinking the Unthinkable – Efron - 1993
15 Constructing a generalizer superior to NETtalk via mathematical theory of generalization. Neural Networks – Wolpert - 1989
14 From Stein’s Unbiased Risk Estimates to the Method of Generalized Cross-Validation – Li - 1985
10 Mit progress in understanding images – Poggio - 1988
6 The relationship between Occam's razor and convergent guessing – Wolpert - 1990
4 Comparison of predicted and experimentally determined structure of adenyl kinase," Nature 250 – Schulz - 1974
4 A mathematical theory of generalization: part – Wolpert - 1990
3 Machine learning. Annual review of computer science – Dietterich - 1990
3 A benchmark for how well neural nets generalize – Wolpert - 1989
3 A mathematical theory of generalization: part II – Wolpert - 1990
3 Improving the performance of generalizers via time-series-like pre-processing of the learning set. Los Alamos Laboratory Report LA-UR-91-350 – Wolpert - 1991
2 Hierarchical training of neural networks and prediction of chaotic time series, Phys. Lett. A 158 – Deppisch, Bauer, et al. - 1991
1 On the ability of neural networks to perform generalization by induction – Anshelevich - 1989