Abstract:
The active selection of instances can significantly improve the generalisation performance of a learning machine. Large margin classifiers such as support vector machines classify data using the most informative instances (the support vectors). This makes them natural candidates for instance selection strategies. In this paper we propose an algorithm for the training of support vector machines using instance selection. We give a theoretical justification for the strategy and experimental results on real and artificial data demonstrating its effectiveness. The technique is most efficient when the data set can be learnt using few support vectors. 1. Introduction The labour-intensive task of labelling data is a serious bottleneck for many data mining tasks. Often cost or time constraints mean that only a fraction of the available instances can be labeled. For this reason there has been increasing interest in the problem of handling partially labeled data sets. One approac...
Citations
|
5044
|
Statistical Learning Theory
– Vapnik
- 1998
|
|
2227
|
UCI repository of machine learning databases
– Blake, Merz
|
|
1205
|
Schapire, “Decision-theoretic generalization of on-line learning and application to boosting
– Freund, E
- 1997
|
|
1073
|
A.J.: Learning with Kernels
– Schölkopf, Smola
- 2002
|
|
781
|
Probability inequalities for sums of bounded random variables
– Hoeffding
- 1963
|
|
597
|
2000), An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods, Cambridge Univ
– Cristianini, Shawe-Taylor
|
|
498
|
Queries and concept learning
– Angluin
- 1988
|
|
221
|
Backpropagation Applied to Handwritten Zip Code Recognition
– LeCun, Boser, et al.
- 1989
|
|
179
|
Selective sampling using the query by committee algorithm
– Freund, Seung, et al.
- 1997
|
|
108
|
Simplifying neural networks by soft weight sharing
– Nowlan, Hinton
- 1992
|
|
58
|
On a kernel-based method for pattern recognition, regression, approximation and operator inversion
– Smola, Scholkopf
- 1998
|
|
38
|
Shrinking the tube: a new support vector regression algorithm
– Schölkopf, Bartlett, et al.
- 1999
|
|
25
|
Further results on the margin distribution
– Shawe-Taylor, Cristianini
- 1999
|
|
24
|
Regression estimation with support vector learning machines
– Smola
- 1996
|
|
21
|
On the sample complexity of pac-learning using random and chosen examples
– Eisenberg
- 1990
|
|
17
|
Sparse kernel feature analysis
– Smola, Mangasarian, et al.
- 1999
|
|
1
|
Neural networks which learn in polynomial time from examples and queries
– Baum
- 1991
|