|
6698
|
Statistical Learning Theory
– V N Vapnik
- 1998
|
|
1325
|
Experiments with a New Boosting Algorithm
– Yoav Freund, Robert E. Schapire
- 1996
|
|
3910
|
Neural Networks for Pattern Recognition
– C M Bishop
- 1995
|
|
6234
|
Maximum likelihood from incomplete data via the EM algorithm
– A. P. Dempster, N. M. Laird, D. B. Rubin
- 1977
|
|
6517
|
Elements of information theory
– T Cover, J Thomas
- 1991
|
|
689
|
Estimation of dependencies based on empirical data
– V Vapnik
- 1982
|
|
15
|
Characterizing the Generalization Performance of Model Selection Strategies
– Dale Schuurmans, Lyle H. Ungar, Dean P. Foster
- 1997
|
|
946
|
Combining labeled and unlabeled data with co-training
– Avrim Blum, Tom Mitchell
- 1998
|
|
220
|
Stochastic complexity and modeling
– J Rissanen
- 1986
|
|
1998
|
Bagging Predictors
– Leo Breiman, Leo Breiman
- 1996
|
|
1368
|
Text Categorization with Support Vector Machines: Learning with Many Relevant Features
– Thorsten Joachims
- 1998
|
|
8
|
Comparison of VC method with classical methods for model selection
– V Cherkassky, F Mulier, V Vapnik
- 1997
|
|
159
|
The Effective Number of Parameters: An Analysis of Generalization and Regularization in Nonlinear Learning Systems
– John E. Moody
- 1992
|
|
101
|
An experimental and theoretical comparison of model selection methods. Machine Learning 27
– Michael Kearns, Yishay Mansour Y, Andrew Y. Ng, Dana Ron Z
- 1997
|
|
116
|
Overfitting Avoidance as Bias
– Cullen Schaffer
- 1992
|
|
82
|
The relative value of labeled and unlabeled samples in pattern recognition with an unknown mixing parameter
– V Castelli, T M Cover
- 1996
|
|
537
|
Neural networks and the bias/variance dilemma
– S Geman, E Bienenstock, R Doursat
- 1992
|
|
775
|
Wrappers for feature subset selection
– Ron Kohavi , George H. John
- 1997
|
|
29
|
Preventing "Overfitting" of Cross-Validation Data
– Andrew Y. Ng
- 1997
|