Results 1 - 10
of
2,903
DOI 10.1007/s10270-008-0106-z REGULAR PAPER Process mining: a two-step approach to balance between underfitting and overfitting
"... © The Author(s) 2008. This article is published with open access at Springerlink.com Abstract Process mining includes the automated discovery of processes from event logs. Based on observed events (e.g., activities being executed or messages being exchanged) a process model is constructed. One of th ..."
Abstract
- Add to MetaCart
© The Author(s) 2008. This article is published with open access at Springerlink.com Abstract Process mining includes the automated discovery of processes from event logs. Based on observed events (e.g., activities being executed or messages being exchanged) a process model is constructed. One
The CN2 Induction Algorithm
- MACHINE LEARNING
, 1989
"... Systems for inducing concept descriptions from examples are valuable tools for assisting in the task of knowledge acquisition for expert systems. This paper presents a description and empirical evaluation of a new induction system, cn2, designed for the efficient induction of simple, comprehensib ..."
Abstract
-
Cited by 884 (6 self)
- Add to MetaCart
Systems for inducing concept descriptions from examples are valuable tools for assisting in the task of knowledge acquisition for expert systems. This paper presents a description and empirical evaluation of a new induction system, cn2, designed for the efficient induction of simple
Lag length selection and the construction of unit root tests with good size and power
- Econometrica
, 2001
"... It is widely known that when there are errors with a moving-average root close to −1, a high order augmented autoregression is necessary for unit root tests to have good size, but that information criteria such as the AIC and the BIC tend to select a truncation lag (k) that is very small. We conside ..."
Abstract
-
Cited by 534 (14 self)
- Add to MetaCart
It is widely known that when there are errors with a moving-average root close to −1, a high order augmented autoregression is necessary for unit root tests to have good size, but that information criteria such as the AIC and the BIC tend to select a truncation lag (k) that is very small. We
Regression Shrinkage and Selection Via the Lasso
- Journal of the Royal Statistical Society, Series B
, 1994
"... We propose a new method for estimation in linear models. The "lasso" minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant. Because of the nature of this constraint it tends to produce some coefficients that are exactl ..."
Abstract
-
Cited by 4055 (51 self)
- Add to MetaCart
We propose a new method for estimation in linear models. The "lasso" minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant. Because of the nature of this constraint it tends to produce some coefficients
New Support Vector Algorithms
, 2000
"... this article with the regression case. To explain this, we will introduce a suitable definition of a margin that is maximized in both cases ..."
Abstract
-
Cited by 461 (42 self)
- Add to MetaCart
this article with the regression case. To explain this, we will introduce a suitable definition of a margin that is maximized in both cases
Process Mining: A Two-Step Approach using Transition Systems and Regions
- BPM Center Report BPM-06-30, BPM Center
, 2006
"... Abstract. More and more information about processes is recorded by information systems in the form of so-called “event logs”. Despite the omnipresence and richness of these event logs, most software vendors have been focusing on relatively simple questions under the assumption that the process is fi ..."
Abstract
-
Cited by 19 (9 self)
- Add to MetaCart
in literature. Unfortunately, these techniques have problems when discovering processes with complicated dependencies. This paper proposes a new two-step approach. First, a transition system is constructed which can be modified to avoid over-fitting. Then, using the “theory of regions”, the model is synthesized
On the underfitting and overfitting sets of models chosen by order selection criteria
- J. Multivariate Anal
, 1999
"... For a general class of order selection criteria, we establish analytic and nonasymptotic evaluations of both the underfitting and overfitting sets of selected models. These evaluations are further specified in various situations including regressions and autoregressions with finite or infinite varia ..."
Abstract
-
Cited by 18 (0 self)
- Add to MetaCart
For a general class of order selection criteria, we establish analytic and nonasymptotic evaluations of both the underfitting and overfitting sets of selected models. These evaluations are further specified in various situations including regressions and autoregressions with finite or infinite
Overfitting Avoidance as Bias
, 1992
"... Strategies for increasing predictive accuracy through selective pruning have been widely adopted by researchers in decision tree induction. It is easy to get the impression from research reports that there are statistical reasons for believing that these overfitting avoidance strategies do increase ..."
Abstract
-
Cited by 136 (2 self)
- Add to MetaCart
accuracy and that, as a research community, we are making progress toward developing powerful, general methods for guarding against overfitting in inducing decision trees. In fact, any overfitting avoidance strategy amounts to a form of bias and, as such, may degrade performance instead of improving it
Process Mining: On the Balance Between
"... Abstract. Process mining techniques attempt to extract non-trivial and useful information from event logs. One aspect of process mining is control-flow discovery, i.e., automatically constructing a process model (e.g., a Petri net) describing the causal dependencies between activities. One of the es ..."
Abstract
- Add to MetaCart
Abstract. Process mining techniques attempt to extract non-trivial and useful information from event logs. One aspect of process mining is control-flow discovery, i.e., automatically constructing a process model (e.g., a Petri net) describing the causal dependencies between activities. One
Results 1 - 10
of
2,903