Results 1  10
of
198
Genetic Network Inference: From CoExpression Clustering To Reverse Engineering
, 2000
"... motivation: Advances in molecular biological, analytical and computational technologies are enabling us to systematically investigate the complex molecular processes underlying biological systems. In particular, using highthroughput gene expression assays, we are able to measure the output of the ge ..."
Abstract

Cited by 210 (0 self)
 Add to MetaCart
motivation: Advances in molecular biological, analytical and computational technologies are enabling us to systematically investigate the complex molecular processes underlying biological systems. In particular, using highthroughput gene expression assays, we are able to measure the output of the gene regulatory network. We aim here to review datamining and modeling approaches for conceptualizing and unraveling the functional relationships implicit in these datasets. Clustering of coexpression profiles allows us to infer shared regulatory inputs and functional pathways. We discuss various aspects of clustering, ranging from distance measures to clustering algorithms and multiplecluster memberships. More advanced analysis aims to infer causal connections between genes directly, i.e. who is regulating whom and how. We discuss several approaches to the problem of reverse engineering of genetic networks, from discrete Boolean networks, to continuous linear and nonlinear models. We conclude that the combination of predictive modeling with systematic experimental verification will be required to gain a deeper insight into living organisms, therapeutic targeting and bioengineering.
Recognizing People by Their Gait: The Shape of Motion
, 1996
"... > y)). Scaleindependent scalar features of each flow, based on moments of the moving point weighted by u, v,or(u, v), characterize the spatial distribution of the flow. We then analyze the periodic structure of these sequences of scalars. The scalar sequences for an image sequence h ..."
Abstract

Cited by 131 (8 self)
 Add to MetaCart
> y)). Scaleindependent scalar features of each flow, based on moments of the moving point weighted by u, v,or(u, v), characterize the spatial distribution of the flow. We then analyze the periodic structure of these sequences of scalars. The scalar sequences for an image sequence have the same fundamental period but differ in phase, which is a phase feature for each signal. Some phase features are consistent for one person and show significant statistical variation among persons. We use the phase feature vectors to recognize individuals by the shape of their motion. As few as three features out of the full set of twelve lead to excellent discrimination. Keywords: action recognition, gait recognition, motion features, optic flow, motion energy, spatial frequency, analysis Recognizing People by Their Gait: The Shape of Moti
General Notions of Statistical Depth Function
, 2000
"... Statistical depth functions are being formulated ad hoc with increasing popularity in nonparametric inference for multivariate data. Here we introduce several general structures for depth functions, classify many existing examples as special cases, and establish results on the possession, or lack th ..."
Abstract

Cited by 52 (24 self)
 Add to MetaCart
Statistical depth functions are being formulated ad hoc with increasing popularity in nonparametric inference for multivariate data. Here we introduce several general structures for depth functions, classify many existing examples as special cases, and establish results on the possession, or lack thereof, of four key properties desirable for depth functions in general. Roughly speaking, these properties may be described as: affine invariance, maximality at center, monotonicity relative to deepest point, and vanishing at infinity. This provides a more systematic basis for selection of a depth function. In particular, from these and other considerations it is found that the halfspace depth behaves very well overall in comparison with various competitors.
ADE4: a multivariate analysis and graphical display software
 Stat. Comput
, 1997
"... e searching, zooming, selection of points, and display of data values on factor maps. The user interface is simple and homogeneous among all the programs; this contributes to making the use of ADE4 very easy for nonspecialists in statistics, data analysis or computer science. Keywords: Multivar ..."
Abstract

Cited by 45 (8 self)
 Add to MetaCart
e searching, zooming, selection of points, and display of data values on factor maps. The user interface is simple and homogeneous among all the programs; this contributes to making the use of ADE4 very easy for nonspecialists in statistics, data analysis or computer science. Keywords: Multivariate analysis, principal component analysis, correspondence analysis, instrumental variables, canonical correspondence analysis, partial least squares regression, coinertia analysis, graphics, multivariate graphics, interactive graphics, Macintosh, HyperCard, Windows 95 1. Introduction ADE4 is a multivariate analysis and graphical display software for Apple Macintosh and Windows 95 microcomputers. It is made up of several standalone applications, called modules, that feature a wide range of multivariate analysis methods, from simple onetable analysis to threeway table analysis and twotable coupling methods. It also provides many possibilitie
On clustering of fMRI time series
, 1997
"... Introduction. The spatiotemporal fMRI signal is a combination of several interacting components: The locally correlated hemodynamic response, the network of neuronal activations, and global components such as the cardiac cycle, breathing etc. A priori this implies that the signal is correlated in t ..."
Abstract

Cited by 44 (3 self)
 Add to MetaCart
Introduction. The spatiotemporal fMRI signal is a combination of several interacting components: The locally correlated hemodynamic response, the network of neuronal activations, and global components such as the cardiac cycle, breathing etc. A priori this implies that the signal is correlated in time and space, and that these correlations have both short and long range components. Clustering is a classical nonparametric approach to explorative analysis data. By clustering we can group signals according to a given objective function. Clustering of waveforms has already been used in fMRI signal analysis, see e.g. (1). Clustering of stochastic data, however, is hard optimization problem with many potential pitfalls. The "optimal" cluster configuration depends on the particular choice of clustering scheme (e.g. kmeans, kmedians, hierachical clustering) examples are legio (2), but just as importantly on the choice of distance metr
Texture Analysis of SAR Sea Ice Imagery using Gray Level Cooccurrence Matrices
 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
, 1999
"... This paper presents a preliminary study for mapping sea ice patterns (texture) with 100m ERS1 synthetic aperture radar (SAR) imagery. We used graylevel cooccurrence matrices (GLCM) to quantitatively evaluate textural parameters and representations and to determine which parameter values and rep ..."
Abstract

Cited by 39 (2 self)
 Add to MetaCart
This paper presents a preliminary study for mapping sea ice patterns (texture) with 100m ERS1 synthetic aperture radar (SAR) imagery. We used graylevel cooccurrence matrices (GLCM) to quantitatively evaluate textural parameters and representations and to determine which parameter values and representations are best for mapping sea ice texture. We conducted experiments on the quantization levels of the image and the displacement and orientation values of the GLCM by examining the effects textural descriptors such as entropy have in the representation of different sea ice textures. We showed that a complete graylevel representation of the image is not necessary for texture mapping, an eightlevel quantization representation is undesirable for textural representation, and the displacement factor in texture measurements is more important than orientation. In addition, we developed three GLCM implementations and
Structural Properties and Convergence Results for Contours of Sample Statistical Depth Functions
, 2000
"... Statistical depth functions have become increasingly used in nonparametric inference for multivariate data. Here the contours of such functions are studied. Structural properties of the regions enclosed by contours, such as affine equivariance, nestedness, connectedness, and compactness, and almost ..."
Abstract

Cited by 22 (12 self)
 Add to MetaCart
Statistical depth functions have become increasingly used in nonparametric inference for multivariate data. Here the contours of such functions are studied. Structural properties of the regions enclosed by contours, such as affine equivariance, nestedness, connectedness, and compactness, and almost sure convergence results for sample depth contours, are established. Also, specialized results are established for some popular depth functions, including halfspace depth, and for the case of elliptical distributions. Finally, some needed foundational results on almost sure convergence of sample depth functions are provided.
ℓpnorm multiple kernel learning
 Journal of Machine Learning Research
, 2011
"... Learning linear combinations of multiple kernels is an appealing strategy when the right choice of features is unknown. Previous approaches to multiple kernel learning (MKL) promote sparse kernel combinations to support interpretability and scalability. Unfortunately, thisℓ1norm MKL is rarely obser ..."
Abstract

Cited by 22 (3 self)
 Add to MetaCart
Learning linear combinations of multiple kernels is an appealing strategy when the right choice of features is unknown. Previous approaches to multiple kernel learning (MKL) promote sparse kernel combinations to support interpretability and scalability. Unfortunately, thisℓ1norm MKL is rarely observed to outperform trivial baselines in practical applications. To allow for robust kernel mixtures that generalize well, we extend MKL to arbitrary norms. We devise new insights on the connection between several existing MKL formulations and develop two efficient interleaved optimization strategies for arbitrary norms, that isℓpnorms with p≥1. This interleaved optimization is much faster than the commonly used wrapper approaches, as demonstrated on several data sets. A theoretical analysis and an experiment on controlled artificial data shed light on the appropriateness of sparse, nonsparse and ℓ∞norm MKL in various scenarios. Importantly, empirical applications of ℓpnorm MKL to three realworld problems from computational biology show that nonsparse MKL achieves accuracies that surpass the stateoftheart. Data sets, source code to reproduce the experiments, implementations of the algorithms, and
Environmental Determinants of Lexical Processing Effort
, 2000
"... A central concern of psycholinguistic research is explaining the relative ease or difficulty involved in processing words. In this thesis, we explore the connection between lexical processing effort and measurable properties of the linguistic environment. Distributional information (information abou ..."
Abstract

Cited by 21 (3 self)
 Add to MetaCart
A central concern of psycholinguistic research is explaining the relative ease or difficulty involved in processing words. In this thesis, we explore the connection between lexical processing effort and measurable properties of the linguistic environment. Distributional information (information about a word's contexts of use) is easily extracted from large language corpora in the form of cooccurrence statistics. We claim that such simple distributional statistics can form the basis of a parsimonious model of lexical processing effort.
Bayesian Statistics
 in WWW', Computing Science and Statistics
, 1989
"... ∗ Signatures are on file in the Graduate School. This dissertation presents two topics from opposite disciplines: one is from a parametric realm and the other is based on nonparametric methods. The first topic is a jackknife maximum likelihood approach to statistical model selection and the second o ..."
Abstract

Cited by 20 (0 self)
 Add to MetaCart
∗ Signatures are on file in the Graduate School. This dissertation presents two topics from opposite disciplines: one is from a parametric realm and the other is based on nonparametric methods. The first topic is a jackknife maximum likelihood approach to statistical model selection and the second one is a convex hull peeling depth approach to nonparametric massive multivariate data analysis. The second topic includes simulations and applications on massive astronomical data. First, we present a model selection criterion, minimizing the KullbackLeibler distance by using the jackknife method. Various model selection methods have been developed to choose a model of minimum KullbackLiebler distance to the true model, such as Akaike information criterion (AIC), Bayesian information criterion (BIC), Minimum description length (MDL), and Bootstrap information criterion. Likewise, the jackknife method chooses a model of minimum KullbackLeibler distance through bias reduction. This bias, which is inevitable in model