Abstract:
This paper addresses the problem of improving the accuracy of an hypothesis output by a learning algorithm in the distribution-free (PAC) learning modol, A coucept class is learntble (or strongly learnable) if, given access to a source of examples of the unknown concept, the learner with high probability is able to output an hypothesis that is correct on all but aa arbitrarily small h'action of the instances. The concept class is weakly learmtble if tbe learner can preduce an hypothesis that performs only slightly better than madam guessing, In this paper, it is shown that these two notions of learnability are equivalent. A method is described for converting a weak learning algorithm into one flat achieves arbitrarily high accuracy, This construction may have practical applications as a tool for efficiently converting a mediocre learning algorithm into one that per foms extremely well. In addition, the onstroction has some interesting theoretical consequences, including a set of general upper bounds on the complexity of any strong learning algorithm as a function of the allowed error e Keywords. Machine learning, learning from examples, learnability theory, PAC learning, polynomial-time identification, 1.
Citations
|
781
|
Probability inequalities for sums of bounded random variables
– Hoeffding
- 1963
|
|
525
|
Learnability and the Vapnik-Chervonenkis dimension
– Blumer, Ehrenfeucht, et al.
- 1989
|
|
498
|
Queries and concept learning
– Angluin
- 1988
|
|
310
|
Learning decision lists
– Rivest
- 1987
|
|
242
|
Cryptographic limitations on learning boolean formulae and finite automata
– Kearns, Valiant
- 1994
|
|
213
|
Fast probabilistic algorithms for hamiltonian circuits and matchings
– Angluin, Valiant
- 1979
|
|
177
|
Finding patterns common to a set of strings
– Angluin
- 1980
|
|
169
|
Computational limitations on learning from examples
– Pitt, Valiant
- 1988
|
|
140
|
On the learnability of Boolean formulae
– Kearns, Li, et al.
- 1987
|
|
83
|
Equivalence of models for polynomial learnability
– Haussler, Kearns, et al.
- 1991
|
|
53
|
Learning decision trees from random examples
– Ehrenfeucht, Haussler
- 1989
|
|
37
|
On the necessity of occam algorithms
– Board, Pitt
- 1990
|
|
34
|
Learning nested differences of intersection-closedclasses
– Helmbold, Sloan, et al.
- 1989
|
|
32
|
Predicting 0,1-functions on randomly drawn points
– Haussler, Littlestone, et al.
- 1990
|
|
20
|
Learning Boolean formulae or finite automata is as hard as factoring
– Kearns, Valiant
- 1988
|
|
16
|
On Space-bounded Learning and the Vapnik-Chervonenkis Dimension
– Floyd
- 1989
|
|
4
|
Some remarks about space-complexity of learning, and circuit complexity of recognizing
– BOUCHERON, SALLANTIN
- 1988
|
|
4
|
Expected mistake bounds for on-line learning algorithms. Unpublished manuscript
– Haussler, Littlestone, et al.
- 1988
|
|
2
|
The Computational Complexity of Machine Learning. Doctoral dissertation
– Kearns
- 1989
|
|
1
|
On learning n union of half spaces. Unpublished manuscript
– Baum
- 1989
|
|
1
|
Space efficient learning algorithms (Technical Report UCSC-CRL-88-2
– Haussler
- 1988
|
|
1
|
Pattern languages are not learnable, Unpublished manuscript
– Schapire, E
- 1989
|
|
1
|
Occam's razor, Infotwtation Ptwcessing Letters
– Blumer, Ehmnfeucht, et al.
- 1987
|
|
1
|
Some remarks about space-complexity of learning, and circuit complexity of recognizing
– Bouchemn, Sallantln
- 1988
|
|
1
|
Learning decision trees from random exmnples, hfimnatiou and putation
– Ehrenfeucht, Haussler
- 1989
|
|
1
|
Predicting {0, l}-functions on randomly drawn points
– Haussler, Littlestone, et al.
- 1988
|
|
1
|
Probability inequalities for sums of bounded randent variables
– Hoeffdiug
- 1963
|