Results 1 -
7 of
7
Bagging Predictors
- Machine Learning
, 1996
"... Bagging predictors is a method for generating multiple versions of a predictor and using these to get an aggregated predictor. The aggregation averages over the versions when predicting a numerical outcome and does a plurality vote when predicting a class. The multiple versions are formed by making ..."
Abstract
-
Cited by 1998 (1 self)
- Add to MetaCart
Bagging predictors is a method for generating multiple versions of a predictor and using these to get an aggregated predictor. The aggregation averages over the versions when predicting a numerical outcome and does a plurality vote when predicting a class. The multiple versions are formed by making bootstrap replicates of the learning set and using these as new learning sets. Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy. The vital element is the instability of the prediction method. If perturbing the learning set can cause significant changes in the predictor constructed, then bagging can improve accuracy. 1. Introduction A learning set of L consists of data f(y n ; x n ), n = 1; : : : ; Ng where the y's are either class labels or a numerical response. We have a procedure for using this learning set to form a predictor '(x; L) --- if the input is x we ...
A System for Induction of Oblique Decision Trees
- Journal of Artificial Intelligence Research
, 1994
"... This article describes a new system for induction of oblique decision trees. This system, OC1, combines deterministic hill-climbing with two forms of randomization to find a good oblique split (in the form of a hyperplane) at each node of a decision tree. Oblique decision tree methods are tuned espe ..."
Abstract
-
Cited by 222 (11 self)
- Add to MetaCart
This article describes a new system for induction of oblique decision trees. This system, OC1, combines deterministic hill-climbing with two forms of randomization to find a good oblique split (in the form of a hyperplane) at each node of a decision tree. Oblique decision tree methods are tuned especially for domains in which the attributes are numeric, although they can be adapted to symbolic or mixed symbolic/numeric attributes. We present extensive empirical studies, using both real and artificial data, that analyze OC1's ability to construct oblique trees that are smaller and more accurate than their axis-parallel counterparts. We also examine the benefits of randomization for the construction of oblique decision trees. 1. Introduction Current data collection technology provides a unique challenge and opportunity for automated machine learning techniques. The advent of major scientific projects such as the Human Genome Project, the Hubble Space Telescope, and the human brain mappi...
Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey
- Data Mining and Knowledge Discovery
, 1997
"... Decision trees have proved to be valuable tools for the description, classification and generalization of data. Work on constructing decision trees from data exists in multiple disciplines such as statistics, pattern recognition, decision theory, signal processing, machine learning and artificial ne ..."
Abstract
-
Cited by 122 (1 self)
- Add to MetaCart
Decision trees have proved to be valuable tools for the description, classification and generalization of data. Work on constructing decision trees from data exists in multiple disciplines such as statistics, pattern recognition, decision theory, signal processing, machine learning and artificial neural networks. Researchers in these disciplines, sometimes working on quite different problems, identified similar issues and heuristics for decision tree construction. This paper surveys existing work on decision tree construction, attempting to identify the important issues involved, directions the work has taken and the current state of the art. Keywords: classification, tree-structured classifiers, data compaction 1. Introduction Advances in data collection methods, storage and processing technology are providing a unique challenge and opportunity for automated data exploration techniques. Enormous amounts of data are being collected daily from major scientific projects e.g., Human Genome...
Improving Classification Methods via Feature Selection
- Machine Learning
, 1992
"... We have been experimenting with methods for improving the speed and accuracy of machine learning programs on large data sets, especially those in which the data objects have large numbers of features. The development of automated solutions to this problem is crucial for the success of future data co ..."
Abstract
-
Cited by 10 (0 self)
- Add to MetaCart
We have been experimenting with methods for improving the speed and accuracy of machine learning programs on large data sets, especially those in which the data objects have large numbers of features. The development of automated solutions to this problem is crucial for the success of future data collection efforts, in which hundreds of millions of objects will need to be classified on-line. Accuracies must be very high in order to ensure that objects are not stored with the wrong labels or in the wrong databases. In addition, methods should be able to identify the most relevant features to use for a particular classification task. We have developed feature selection methods and classification algorithms for application on large, real-world databases. Our feature selection algorithm searches a small fraction of the possible subsets of features, and it often finds optimal or near-optimal classifiers. By combining this algorithm with machine learning methods, we have been able to elimina...
Decision trees: an overview and their use in medicine
- Journal of Medical Systems
, 2002
"... In medical decision making (classification, diagnosing, etc.) there are many situations where decision must be made effectively and reliably. Conceptual simple decision making models with the possibility of automatic learning are the most appropriate for performing such tasks. Decision trees are a r ..."
Abstract
-
Cited by 5 (2 self)
- Add to MetaCart
In medical decision making (classification, diagnosing, etc.) there are many situations where decision must be made effectively and reliably. Conceptual simple decision making models with the possibility of automatic learning are the most appropriate for performing such tasks. Decision trees are a reliable and effective decision making technique that provide high classification accuracy with a simple representation of gathered knowledge and they have been used in different areas of medical decision making. In the paper we present the basic characteristics of decision trees and the successful alternatives to the traditional induction approach with the emphasis on existing and possible future applications in medicine. Key words: decision trees, classification, decision making, machine learning 1.
Bagging in Computer Vision
- CVPR
, 1998
"... Previous research has shown that aggregatedpredictors improve the performance of non-parametric function approximation techniques. This paper presents the results of applying aggregatedpredictors to a computer vision problem, and shows that the method of bagging signi#cantly improves performance. In ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
Previous research has shown that aggregatedpredictors improve the performance of non-parametric function approximation techniques. This paper presents the results of applying aggregatedpredictors to a computer vision problem, and shows that the method of bagging signi#cantly improves performance. In fact, the results arebetter than those previously reportedon other domains. This paper explains this performance in terms of the variance and bias. 1
Bagging Predictors
"... Abstract. Bagging predictors is a method for generating multiple versions of a predictor and using these to get an aggregated predictor. The aggregation averages over the versions when predicting a numerical outcome and does a plurality vote when predicting a class. The multiple versions are formed ..."
Abstract
- Add to MetaCart
Abstract. Bagging predictors is a method for generating multiple versions of a predictor and using these to get an aggregated predictor. The aggregation averages over the versions when predicting a numerical outcome and does a plurality vote when predicting a class. The multiple versions are formed by making bootstrap replicates of the learning set and using these as new learning sets. Tests on real and simulated data sets using classification and regression trees and subset selection in linear regression show that bagging can give substantial gains in accuracy. The vital element is the instability of the prediction method. If perturbing the learning set can cause significant changes in the predictor constructed, then bagging can improve accuracy.

