• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

ROC ’n’ Rule Learning – Towards a Better Understanding of Covering Algorithms (0)

by J Fürnkranz, P Flach
Venue:Machine Learning
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 32
Next 10 →

An Ensemble Classifier for Drifting Concepts

by Martin Scholz, Ralf Klinkenberg - In Proceedings of the Second International Workshop on Knowledge Discovery in Data Streams , 2005
"... This paper proposes a boosting-like method to train a classifier ensemble from data streams. It naturally adapts to concept drift and allows to quantify the drift in terms of its base learners. The algorithm is empirically shown to outperform learning algorithms that ignore concept drift. It per ..."
Abstract - Cited by 15 (1 self) - Add to MetaCart
This paper proposes a boosting-like method to train a classifier ensemble from data streams. It naturally adapts to concept drift and allows to quantify the drift in terms of its base learners. The algorithm is empirically shown to outperform learning algorithms that ignore concept drift. It performs no worse than advanced adaptive time window and example selection strategies that store all the data and are thus not suited for mining massive streams.

Pattern teams

by Arno J. Knobbe, Eric K. Y. Ho - Proceedings of the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD-06 , 2006
"... Abstract Pattern discovery algorithms typically produce many interesting patterns. In most cases, patterns are reported based on their individual merits, and little attention is given to the interestingness of a pattern in the context of other patterns reported. In this paper, we propose filtering t ..."
Abstract - Cited by 11 (2 self) - Add to MetaCart
Abstract Pattern discovery algorithms typically produce many interesting patterns. In most cases, patterns are reported based on their individual merits, and little attention is given to the interestingness of a pattern in the context of other patterns reported. In this paper, we propose filtering the returned set of patterns based on a number of quality measures for pattern sets. We refer to a small subset of patterns that optimises such a measure as a pattern team. A number of quality measures, both supervised and unsupervised, is proposed. We analyse to what extent each of the measures captures a number of ‘intuitions ’ users may have concerning effective and informative pattern teams. Such intuitions involve qualities such as independence of patterns, low overlap, and combined predictiveness. 1

An Empirical Quest for Optimal Rule Learning Heuristics

by Frederik Janssen, Johannes Fürnkranz , 2008
"... The primary goal of the research reported in this paper is to identify what criteria are responsible for the good performance of a heuristic rule evaluation function in a greedy topdown covering algorithm. We first argue that search heuristics for inductive rule learning algorithms typically trade o ..."
Abstract - Cited by 11 (7 self) - Add to MetaCart
The primary goal of the research reported in this paper is to identify what criteria are responsible for the good performance of a heuristic rule evaluation function in a greedy topdown covering algorithm. We first argue that search heuristics for inductive rule learning algorithms typically trade off consistency and coverage, and we investigate this trade-off by determining optimal parameter settings for five different parametrized heuristics. In order to avoid biasing our study by known functional families, we also investigate the potential of using meta-learning for obtaining alternative rule learning heuristics. The key results of this experimental study are not only practical default values for commonly used heuristics and a broad comparative evaluation of known and novel rule learning heuristics, but we also gain theoretical insights into factors that are responsible for a good performance. For example, we observe that consistency should be weighed more heavily than coverage, presumably because a lack of coverage can later be corrected by learning additional rules.

Sampling-Based Sequential Subgroup Mining

by Martin Scholz , 2005
"... Subgroup discovery is a learning task that aims at finding interesting rules from classified examples. The search is guided by a utility function, trading o# the coverage of rules against their statistical unusualness. One shortcoming of existing approaches is that they do not incorporate prior know ..."
Abstract - Cited by 10 (5 self) - Add to MetaCart
Subgroup discovery is a learning task that aims at finding interesting rules from classified examples. The search is guided by a utility function, trading o# the coverage of rules against their statistical unusualness. One shortcoming of existing approaches is that they do not incorporate prior knowledge. To this end a novel generic sampling strategy is proposed. It allows to turn pattern mining into an iterative process. In each iteration the focus of subgroup discovery lies on those patterns that are unexpected with respect to prior knowledge and previously discovered patterns. The result of this technique is a small diverse set of understandable rules that characterise a specified property of interest. As another contribution this article derives a simple connection between subgroup discovery and classifier induction. For a popular utility function this connection allows to apply any standard rule induction algorithm to the task of subgroup discovery after a step of stratified resampling. The proposed techniques are empirically compared to state of the art subgroup discovery algorithms.

An Experimental Comparison of Performance Measures for Classification

by C. Ferri, J. Hernández-Orallo, R. Modroiu , 2007
"... Performance metrics in classification are fundamental to assess the quality of learning methods and learned models. However, many different measures have been defined in the literature with the aim of making better choices in general or for a specific application area. Choices made by one metric are ..."
Abstract - Cited by 8 (4 self) - Add to MetaCart
Performance metrics in classification are fundamental to assess the quality of learning methods and learned models. However, many different measures have been defined in the literature with the aim of making better choices in general or for a specific application area. Choices made by one metric are claimed to be different from choices made by other metrics. In this work we analyse experimentally the behaviour of 18 different performance metrics in several scenarios, identifying clusters and relationships between measures. We also perform a sensitivity analysis for all of them in terms of several traits: class threshold choice, separability/ranking quality, calibration performance and sensitivity to changes in prior class distribution. From the definitions and the experiments, we give a comprehensive analysis on the relationships between metrics, and a taxonomy and arrangement of them according to the previous traits. This can be useful to choose the most adequate measure (or set of measures) for a specific application. Additionally, the study also highlights some niches in which new measures might be defined and also shows that some supposedly innovative measures make the same choices (or almost) than existing ones. Finally, this work can also be used as a reference for comparing experimental results in the pattern recognition and machine learning literature, when using different measures.

Gleaner: Creating Ensembles of Firstorder Clauses to Improve Recall-Precision Curves

by Mark Goadrich, Louis Oliphant, Jude Shavlik - Machine Learning , 2006
"... Abstract. Many domains in the field of Inductive Logic Programming (ILP) involve highly unbalanced data. A common way to measure performance in these domains is to use precision and recall instead of simply using accuracy. The goal of our research is to find new approaches within ILP particularly su ..."
Abstract - Cited by 6 (5 self) - Add to MetaCart
Abstract. Many domains in the field of Inductive Logic Programming (ILP) involve highly unbalanced data. A common way to measure performance in these domains is to use precision and recall instead of simply using accuracy. The goal of our research is to find new approaches within ILP particularly suited for large, highly-skewed domains. We propose Gleaner, a randomized search method that collects good clauses from a broad spectrum of points along the recall dimension in recall-precision curves and employs an “at least L of these K clauses ” thresholding method to combine sets of selected clauses. Our research focuses on Multi-Slot Information Extraction (IE), a task that typically involves many more negative examples than positive examples. We formulate this problem into a relational domain, using two large testbeds involving the extraction of important relations from the abstracts of biomedical journal articles. We compare Gleaner to ensembles of standard theories learned by Aleph, finding that Gleaner produces comparable testset results in a fraction of the training time.

From Local Patterns to Global Models: The LeGo Approach to Data Mining

by Arno Knobbe, Bruno Crémilleux, Johannes Fürnkranz, Martin Scholz
"... Abstract. In this paper we present LeGo, a generic framework that utilizes existing local pattern mining techniques for global modeling in a variety of diverse data mining tasks. In the spirit of well known KDD process models, our work identifies different phases within the data mining step, each of ..."
Abstract - Cited by 6 (2 self) - Add to MetaCart
Abstract. In this paper we present LeGo, a generic framework that utilizes existing local pattern mining techniques for global modeling in a variety of diverse data mining tasks. In the spirit of well known KDD process models, our work identifies different phases within the data mining step, each of which is formulated in terms of different formal constraints. It starts with a phase of mining patterns that are individually promising. Later phases establish the context given by the global data mining task by selecting groups of diverse and highly informative patterns, which are finally combined to one or more global models that address the overall data mining task(s). The paper discusses the connection to various learning techniques, and illustrates that our framework is broad enough to cover and leverage frequent pattern mining, subgroup discovery, pattern teams, multiview learning, and several other popular algorithms. The Safarii learning toolbox serves as a proof-of-concept of its high potential for practical data mining applications. Finally, we point out several challenging open research questions that naturally emerge in a constraint-based local-to-global pattern mining, selection, and combination framework. 1

Boosting Classifiers for Drifting Concepts

by Martin Scholz, Ralf Klinkenberg - Intelligent Data Analysis (IDA), Special Issue on Knowledge Discovery from Data Streams , 2006
"... This paper proposes a boosting-like method to train a classifier ensemble from data streams. It naturally adapts to concept drift and allows to quantify the drift in terms of its base learners. The algorithm is empirically shown to outperform learning algorithms that ignore concept drift. It perform ..."
Abstract - Cited by 5 (1 self) - Add to MetaCart
This paper proposes a boosting-like method to train a classifier ensemble from data streams. It naturally adapts to concept drift and allows to quantify the drift in terms of its base learners. The algorithm is empirically shown to outperform learning algorithms that ignore concept drift. It performs no worse than advanced adaptive time window and example selection strategies that store all the data and are thus not suited for mining massive streams.

ROCCER: A ROC convex hull rule learning algorithm

by Ronaldo C. Prati, Peter A. Flach - Proceedings of the ECML/PKDD Workshop on Advances in Inductive Rule Learning , 2004
"... In this paper we propose a method to construct rule sets that have a convex hull in ROC space. We introduce a rule selection algorithm called ROCCER, which operates by selecting rules from a larger set of rules in order to optimise Area Under the ROC Curve (AUC). Compared with set covering algor ..."
Abstract - Cited by 4 (0 self) - Add to MetaCart
In this paper we propose a method to construct rule sets that have a convex hull in ROC space. We introduce a rule selection algorithm called ROCCER, which operates by selecting rules from a larger set of rules in order to optimise Area Under the ROC Curve (AUC). Compared with set covering algorithms, our method is less dependent on the previously induced rules. Experimental results on three UCI datasets show significant improvements on two of these.

An analysis of stopping and filtering criteria for rule learning

by Johannes Fürnkranz, Peter Flach - In Boulicaut, J.-F , 2004
"... Abstract. In this paper, we investigate the properties of commonly used prepruning heuristics for rule learning by visualizing them in PN-space. PN-space is a variant of ROC-space, which is particularly suited for visualizing the behavior of rule learning and its heuristics. On the one hand, we thin ..."
Abstract - Cited by 4 (1 self) - Add to MetaCart
Abstract. In this paper, we investigate the properties of commonly used prepruning heuristics for rule learning by visualizing them in PN-space. PN-space is a variant of ROC-space, which is particularly suited for visualizing the behavior of rule learning and its heuristics. On the one hand, we think that our results lead to a better understanding of the effects of stopping and filtering criteria, and hence to a better understanding of rule learning algorithms in general. On the other hand, we uncover a few shortcomings of commonly used heuristics, thereby hopefully motivating additional work in this area. 1
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University