Results 1 - 10
of
49
Statistical Comparisons of Classifiers over Multiple Data Sets
, 2006
"... While methods for comparing two learning algorithms on a single data set have been scrutinized for quite some time already, the issue of statistical tests for comparisons of more algorithms on multiple data sets, which is even more essential to typical machine learning studies, has been all but igno ..."
Abstract
-
Cited by 120 (0 self)
- Add to MetaCart
While methods for comparing two learning algorithms on a single data set have been scrutinized for quite some time already, the issue of statistical tests for comparisons of more algorithms on multiple data sets, which is even more essential to typical machine learning studies, has been all but ignored. This article reviews the current practice and then theoretically and empirically examines several suitable tests. Based on that, we recommend a set of simple, yet safe and robust non-parametric tests for statistical comparisons of classifiers: the Wilcoxon signed ranks test for comparison of two classifiers and the Friedman test with the corresponding post-hoc tests for comparison of more classifiers over multiple data sets. Results of the latter can also be neatly presented with the newly introduced CD (critical difference) diagrams.
KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems ⋆
"... be inserted by the editor) ..."
Properties of Machine Learning Applications for Use in Metamorphic Testing
"... It is challenging to test machine learning (ML) applications, which are intended to learn properties of data sets where the correct answers are not already known. In the absence of a test oracle, one approach to testing these applications is to use metamorphic testing, in which properties of the app ..."
Abstract
-
Cited by 8 (6 self)
- Add to MetaCart
It is challenging to test machine learning (ML) applications, which are intended to learn properties of data sets where the correct answers are not already known. In the absence of a test oracle, one approach to testing these applications is to use metamorphic testing, in which properties of the application are exploited to define transformation functions on the input, such that the new output will be unchanged or can easily be predicted based on the original output; if the output is not as expected, then a defect must exist in the application. Here, we seek to enumerate and classify the metamorphic properties of some machine learning algorithms, and demonstrate how these can be applied to reveal defects in the applications of interest. In addition to the results of our testing, we present a set of properties that can be used to define these metamorphic relationships so that metamorphic testing can be used as a general approach to testing machine learning applications. 1
Weighted clustering ensembles
- In Proceedings of The 6th SIAM International Conference on Data Mining
, 2006
"... Cluster ensembles offer a solution to challenges inherent to clustering arising from its ill-posed nature. Cluster ensembles can provide robust and stable solutions by leveraging the consensus across multiple clustering results, while averaging out emergent spurious structures that arise due to the ..."
Abstract
-
Cited by 7 (2 self)
- Add to MetaCart
Cluster ensembles offer a solution to challenges inherent to clustering arising from its ill-posed nature. Cluster ensembles can provide robust and stable solutions by leveraging the consensus across multiple clustering results, while averaging out emergent spurious structures that arise due to the various biases to which each participating algorithm is tuned. In this paper, we address the problem of combining multiple weighted clusters which belong to different subspaces of the input space. We leverage the diversity of the input clusterings in order to generate a consensus partition that is superior to the participating ones. Since we are dealing with weighted clusters, our consensus function makes use of the weight vectors associated with the clusters. The experimental results show that our ensemble technique is capable of producing a partition that is as good as or better than the best individual clustering. 1
Microarray data mining with visual programming
- Bioinformatics
, 2005
"... Summary: Visual programming offers an intuitive means of combining known analysis and visualization methods into powerful applications. The system presented here enables users who are not programmers to manage microarray and genomic data flow and to customize their analysis by combining common data ..."
Abstract
-
Cited by 5 (2 self)
- Add to MetaCart
Summary: Visual programming offers an intuitive means of combining known analysis and visualization methods into powerful applications. The system presented here enables users who are not programmers to manage microarray and genomic data flow and to customize their analysis by combining common data analysis tools to fit their needs.
Grounding Spatial Prepositions for Video Search
"... Spatial language video retrieval is an important real-world problem that forms a test bed for evaluating semantic structures for natural language descriptions of motion on naturalistic data. Video search by natural language query requires that linguistic input be converted into structures that opera ..."
Abstract
-
Cited by 4 (3 self)
- Add to MetaCart
Spatial language video retrieval is an important real-world problem that forms a test bed for evaluating semantic structures for natural language descriptions of motion on naturalistic data. Video search by natural language query requires that linguistic input be converted into structures that operate on video in order to find clips that match a query. This paper describes a framework for grounding the meaning of spatial prepositions in video. We present a library of features that can be used to automatically classify a video clip based on whether it matches a natural language query. To evaluate these features, we collected a corpus of natural language descriptions about the motion of people in video clips. We characterize the language used in the corpus, and use it to train and test models for the meanings of the spatial prepositions “to, ” “across, ” “through, ” “out, ” “along, ” “towards,” and “around. ” The classifiers can be used to build a spatial language video retrieval system that finds clips matching queries such as “across the kitchen.”
Reducing complex attribute interaction through non-algebraic feature construction
- In Proc. of the IASTED-AIA
, 2007
"... The importance of preprocessing data before looking for patterns is greatest when data representation is primitive. If lack of domain experts prevents the use of highly informative attributes, patterns are hard to uncover due to complex attribute interactions. Feature construction intends to create ..."
Abstract
-
Cited by 2 (2 self)
- Add to MetaCart
The importance of preprocessing data before looking for patterns is greatest when data representation is primitive. If lack of domain experts prevents the use of highly informative attributes, patterns are hard to uncover due to complex attribute interactions. Feature construction intends to create new features that encapsulate and highlight the hidden interactions. However, its success often relies on the appropriateness of a given set of algebraic operators for expressing the relevant combination of attributes in the current domain. When lacking prior knowledge of appropriate operators, systems use non-algebraic feature construction techniques to extract features directly from training data. The paper analyzes two such systems, MFE2/GA and HINT, concluding that their different design components suggest complementary functionalities. This is supported by an empirical system comparison using synthetic and real-world data where attribute interaction prevails.
Algorithms for Feature Selection in Rank-Order Spaces
, 2005
"... The problem of feature selection in supervised learning situations is considered, where all features are drawn from a common domain and are best interpreted via ordinal comparisons with other features, rather than as numerical values. In particular, each instance is a member of a space of ranked fea ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
The problem of feature selection in supervised learning situations is considered, where all features are drawn from a common domain and are best interpreted via ordinal comparisons with other features, rather than as numerical values. In particular, each instance is a member of a space of ranked features. This problem is pertinent in electoral, financial, and bioinformatics contexts, where features denote assessments in terms of counts, ratings, or rankings. Four algorithms for feature selection in such rank-order spaces are presented; two are information-theoretic, and two are order-theoretic. These algorithms are empirically evaluated against both synthetic and real world datasets. The main results of this paper are (i) characterization of relationships and equivalences between different feature selection strategies with respect to the spaces in which they operate, and the distributions they seek to approximate; (ii) identification of computationally simple and efficient strategies that perform surprisingly well; and (iii) a feasibility study of order-theoretic feature selection for large scale datasets. 1

