Results 1 - 10
of
43
Flexible smoothing with B-splines and penalties
- Statistical Science
, 1996
"... B-splines are attractive for nonparametric modelling, but choosing the optimal number and positions of knots is a complex task. Equidistant knots can be used, but their small and discrete number allows only limited control over smoothness and fit. We propose to use a relatively large number of knots ..."
Abstract
-
Cited by 112 (2 self)
- Add to MetaCart
B-splines are attractive for nonparametric modelling, but choosing the optimal number and positions of knots is a complex task. Equidistant knots can be used, but their small and discrete number allows only limited control over smoothness and fit. We propose to use a relatively large number of knots and a difference penalty on coefficients of adjacent B-splines. We show connections to the familiar spline penalty on the integral of the squared second derivative. A short overview of B-splines, their construction, and penalized likelihood is presented. We discuss properties of penalized B-splines and propose various criteria for the choice of an optimal penalty parameter. Nonparametric logistic regression, density estimation and scatterplot smoothing are used as examples. Some details of the computations are presented. Keywords: Generalized linear models, smoothing, nonparametric models, splines, density estimation. Address for correspondence: DCMR Milieudienst Rijnmond, 's-Gravelandse...
Key Concepts in Model Selection: Performance and Generalizability
- Journal of Mathematical Psychology
, 2000
"... methods of model selection, and how do they work? Which methods perform better than others, and in what circumstances? These questions rest on a number of key concepts in a relatively underdeveloped field. The aim of this essay is to explain some background concepts, highlight some of the results in ..."
Abstract
-
Cited by 26 (11 self)
- Add to MetaCart
methods of model selection, and how do they work? Which methods perform better than others, and in what circumstances? These questions rest on a number of key concepts in a relatively underdeveloped field. The aim of this essay is to explain some background concepts, highlight some of the results in this special issue, and to add my own. The standard methods of model selection include classical hypothesis testing, maximum likelihood, Bayes method, minimum description length, cross-validation and Akaike’s information criterion. They all provide an implementation of Occam’s razor, in which parsimony or simplicity is balanced against goodness-of-fit. These methods primarily take account of the sampling errors in parameter estimation, although their relative success at this task depends on the circumstances. However, the aim of model selection should also include the ability of a model to generalize to predictions in a different domain. Errors of extrapolation, or generalization, are different from errors of parameter estimation. So, it seems that simplicity and parsimony may be an additional factor in managing these errors, in which case the standard methods of model selection are incomplete implementations of Occam’s razor. 1. WHAT IS MODEL SELECTION? William of Ockham (1285- 1347/49) will always be remembered for his famous postulations of Ockham’s razor (also spelled ‘Occam’), which states that entities are not to be multiplied beyond necessity. In a similar vein, Sir Isaac Newton’s first rule of hypothesizing instructs us that we are to admit no more causes of natural things than such as are both true and sufficient to explain their appearances. While they This paper is derived from a presentation at the Methods of Model Selection symposium at Indiana University
Construction and use of linear regression models for processor performance analysis
- In Proc. 12th IEEE Symposium on High Performance Computer Architecture
, 2006
"... Processor architects have a challenging task of evaluating a large design space consisting of several interacting parameters and optimizations. In order to assist architects in making crucial design decisions, we build linear regression models that relate processor performance to micro-architectural ..."
Abstract
-
Cited by 23 (2 self)
- Add to MetaCart
Processor architects have a challenging task of evaluating a large design space consisting of several interacting parameters and optimizations. In order to assist architects in making crucial design decisions, we build linear regression models that relate processor performance to micro-architectural parameters, using simulation based experiments. We obtain good approximate models using an iterative process in which Akaike’s information criteria is used to extract a good linear model from a small set of simulations, and limited further simulation is guided by the model using D-optimal experimental designs. The iterative process is repeated until desired error bounds are achieved. We used this procedure to establish the relationship of the CPI performance response to 26 key micro-architectural parameters using a detailed cycle-by-cycle superscalar processor simulator. The resulting models provide a significance ordering on all micro-architectural parameters and their interactions, and explain the performance variations of micro-architectural techniques. 1.
Understanding Long-Range Correlations in DNA Sequences
- Physica D
, 1994
"... . In this paper, we review the literature on statistical long-range correlation in DNA sequences. We examine the current evidence for these correlations, and conclude that a mixture of many length scales #including some relatively long ones# in DNA sequences is responsible for the observed 1=f - ..."
Abstract
-
Cited by 14 (6 self)
- Add to MetaCart
. In this paper, we review the literature on statistical long-range correlation in DNA sequences. We examine the current evidence for these correlations, and conclude that a mixture of many length scales #including some relatively long ones# in DNA sequences is responsible for the observed 1=f -like spectral component. We note the complexity of the correlation structure in DNA sequences. The observed complexity often makes it hard, or impossible, to decompose the sequence into a few statistically stationary regions. We suggest that, based on the complexityof DNA sequences, a fruitful approach to understand long-range correlation is to model duplication, and other rearrangement processes, in DNA sequences. One model, called #expansion-modi#cation system", contains only point duplication and point mutation. Though simplistic, this model is able to generate sequences with 1=f spectra. We emphasize the importance of DNA duplication in its contribution to the observed long-rang...
Testing the Hypothesis of Common Ancestry
, 2002
"... this paper, we assess the arguments that have been made in the biological literature and discuss a methodology that has not been applied to this problem before ..."
Abstract
-
Cited by 14 (3 self)
- Add to MetaCart
this paper, we assess the arguments that have been made in the biological literature and discuss a methodology that has not been applied to this problem before
Saccade target selection in frontal eye field of macaque. I. Visual and premovement activation
- The Journal of Neuroscience
, 1995
"... We investigated how the brain selects the targets for eye movements, a process in which the outcome of visual pro-cessing is converted into guided action. Macaque monkeys were trained to make a saccade to fixate a salient target presented either alone or with multiple distracters during visual searc ..."
Abstract
-
Cited by 14 (1 self)
- Add to MetaCart
We investigated how the brain selects the targets for eye movements, a process in which the outcome of visual pro-cessing is converted into guided action. Macaque monkeys were trained to make a saccade to fixate a salient target presented either alone or with multiple distracters during visual search. Neural activity was recorded in the frontal eye field, a cortical area at the interface of visual process-ing and eye movement production. Neurons discharging after stimulus presentation and before saccade initiation were analyzed. The initial visual response of frontal eye field neurons was modulated by the presence of multiple stimuli and by whether a saccade was going to be pro-duced, but the initial visual response did not discriminate the target of the search array from the distracters. In the latent period before saccade initiation, the activity of most
Kinky Tomographic Reconstruction
, 1996
"... We address the issue of how to make decisions about the degree of smoothness demanded of a flexible contour used to model the boundary of a 2D object. We demonstrate the use of a Bayesian approach to set the strength of the smoothness prior for a tomographic reconstruction problem. The Akaike Inform ..."
Abstract
-
Cited by 13 (10 self)
- Add to MetaCart
We address the issue of how to make decisions about the degree of smoothness demanded of a flexible contour used to model the boundary of a 2D object. We demonstrate the use of a Bayesian approach to set the strength of the smoothness prior for a tomographic reconstruction problem. The Akaike Information Criterion is used to determine whether to allow a kink in the contour.
Loosening the constraints on illusory conjunctions: the role of exposure duration and attention
- Journal of Experimental Psychology: Human Perception and Performance
, 1995
"... Illusory conjunctions are the incorrect combination of correctly perceived features, such as color and shape. They have been found to occur using a brief exposure (under 200 ms) and a dual task designed to divert attention. The present study investigated the roles of exposure duration and attention ..."
Abstract
-
Cited by 13 (4 self)
- Add to MetaCart
Illusory conjunctions are the incorrect combination of correctly perceived features, such as color and shape. They have been found to occur using a brief exposure (under 200 ms) and a dual task designed to divert attention. The present study investigated the roles of exposure duration and attention in obtaining illusory conjunctions. Several mathematical models of the feature integration task were also assessed. Experiment 1 tested participants ' accuracy at combining features using a long exposure and an attention-diverting task. Experiment 2 compared performance with and without the attention-diverting task. The final experiment compared performance using a brief (0.15 s) and a long (1.5 s) exposure duration without an attention-diverting task. Neither attention nor exposure duration had a significant effect on feature integration. According to traditional theories of visual recognition, people identify an object through an analysis of the various visual features of that object. Proposed visual features include shape primitives (e.g., Selfridge, 1959), volumetric solids (Biederman, 1985), spatial frequencies (DeValois &
Theoretical Analyses of Cross-Validation Error and Voting in Instance-Based Learning
, 1993
"... This paper begins with a general theory of error in cross-validation testing of algorithms for supervised learning from examples. It is assumed that the examples are described by attribute-value pairs, where the values are symbolic. Cross-validation requires a set of training examples and a set of t ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
This paper begins with a general theory of error in cross-validation testing of algorithms for supervised learning from examples. It is assumed that the examples are described by attribute-value pairs, where the values are symbolic. Cross-validation requires a set of training examples and a set of testing examples. The value of the attribute that is to be predicted is known to the learner in the training set, but unknown in the testing set. The theory demonstrates that cross-validation error has two components: error on the training set (inaccuracy) and sensitivity to noise (instability). This general theory is then applied to voting in instance-based learning. Given an example in the testing set, a typical instance-based learning algorithm predicts the designated attribute by voting among the k nearest neighbors (the k most similar examples) to the testing example in the training set. Voting is intended to increase the stability (resistance to noise) of instance-based learning, but a ...
A Theory of Cross-Validation Error
, 1993
"... This paper presents a theory of error in cross-validation testing of algorithms for predicting real-valued attributes. The theory justifies the claim that predicting real-valued attributes requires balancing the conflicting demands of simplicity and accuracy. Furthermore, the theory indicates precis ..."
Abstract
-
Cited by 7 (0 self)
- Add to MetaCart
This paper presents a theory of error in cross-validation testing of algorithms for predicting real-valued attributes. The theory justifies the claim that predicting real-valued attributes requires balancing the conflicting demands of simplicity and accuracy. Furthermore, the theory indicates precisely how these conflicting demands must be balanced, in order to minimize cross-validation error. A general theory is presented, then it is developed in detail for linear regression and instance-based learning.

