Results 1 - 10
of
51
Principal Curves Revisited
- Statistics and Computing
, 1992
"... A principal curve (Hastie and Stuetzle, 1989) is a smooth curve passing through the "middle" of a distribution or data cloud, and is a generalization of linear principal components. We give an alternative definition of a principal curve, based on a mixture model. Estimation is carried out through an ..."
Abstract
-
Cited by 46 (0 self)
- Add to MetaCart
A principal curve (Hastie and Stuetzle, 1989) is a smooth curve passing through the "middle" of a distribution or data cloud, and is a generalization of linear principal components. We give an alternative definition of a principal curve, based on a mixture model. Estimation is carried out through an EM algorithm. Some comparisons are made to the Hastie- Stuetzle definition.
Efficient greedy learning of Gaussian mixture models
- Neural Computation
, 2003
"... This paper concerns the greedy learning of Gaussian mixtures. In the greedy approach, mixture components are inserted into the mixture one after the other. We propose a heuristic for searching for the optimal component to insert. In a randomized manner a set of candidate new components is generated. ..."
Abstract
-
Cited by 40 (7 self)
- Add to MetaCart
This paper concerns the greedy learning of Gaussian mixtures. In the greedy approach, mixture components are inserted into the mixture one after the other. We propose a heuristic for searching for the optimal component to insert. In a randomized manner a set of candidate new components is generated. For each of these candidates we find the locally optimal new component. The best local optimum is then inserted into the existing mixture. The resulting algorithm resolves the sensitivity to initialization of state-of-the-art methods, like EM, and has running time linear in the number of data points and quadratic in the (final) number of mixture components. Due to its greedy nature the algorithm can be particularly useful when the optimal number of mixture components is unknown. Experimental results comparing the proposed algorithm to other methods on density estimation and texture segmentation are provided.
A greedy EM algorithm for Gaussian mixture learning
- Neural Processing Letters
, 2000
"... Learning a Gaussian mixture with a local algorithm like EM can be difficult because (i) the true number of mixing components is usually unknown, (ii) there is no generally accepted method for parameter initialization, and (iii) the algorithm can get stuck in one of the many local maxima of the likel ..."
Abstract
-
Cited by 29 (9 self)
- Add to MetaCart
Learning a Gaussian mixture with a local algorithm like EM can be difficult because (i) the true number of mixing components is usually unknown, (ii) there is no generally accepted method for parameter initialization, and (iii) the algorithm can get stuck in one of the many local maxima of the likelihood function. In this paper we propose a greedy algorithm for learning a Gaussian mixture which tries to overcome these limitations. In particular, starting with a single component and adding components sequentially until a maximum number $k$, the algorithm is capable of achieving solutions superior to EM with $k$ components in terms of the likelihood of a test set. The algorithm is based on recent theoretical results on incremental mixture density estimation, and uses a combination of global and local search each time a new component is added to the mixture.
Managing uncertainty in call centers using Poisson mixtures
- Applied Stochastic Models in Business and Industry
, 2001
"... We model a call center as a queueing model with Poisson arrivals having an unknown varying arrival rate. We show how to compute prediction intervals for the arrival rate, and use the Erlang formula for the waiting time to compute the consequences for the occupancy level of the call center. We compar ..."
Abstract
-
Cited by 25 (4 self)
- Add to MetaCart
We model a call center as a queueing model with Poisson arrivals having an unknown varying arrival rate. We show how to compute prediction intervals for the arrival rate, and use the Erlang formula for the waiting time to compute the consequences for the occupancy level of the call center. We compare it to the current practice of using a point estimate of the arrival rate (assumed constant) as forecast.
The support reduction algorithm for computing nonparametric function estimates in mixture models
, 2003
"... ABSTRACT. In this paper, we study an algorithm (which we call the support reduction algorithm) that can be used to compute non-parametric M-estimators in mixture models. The algorithm is compared with natural competitors in the context of convex regression and the ‘Aspect problem ’ in quantum physic ..."
Abstract
-
Cited by 15 (6 self)
- Add to MetaCart
ABSTRACT. In this paper, we study an algorithm (which we call the support reduction algorithm) that can be used to compute non-parametric M-estimators in mixture models. The algorithm is compared with natural competitors in the context of convex regression and the ‘Aspect problem ’ in quantum physics.
Bayesian Model Selection in Finite Mixtures by Marginal Density Decompositions
- JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
, 2001
"... ..."
Does unemployment compensation affect unemployment duration
- The Economic Journal
, 2003
"... We use a flexible hazard rate model with unrestricted spell duration and calendar time effects to analyse a dataset including all Norwegian unemployment spells during the 1990’s. The dataset provides a unique access to conditionally independent variation in unemployment compensation. We find that a ..."
Abstract
-
Cited by 8 (3 self)
- Add to MetaCart
We use a flexible hazard rate model with unrestricted spell duration and calendar time effects to analyse a dataset including all Norwegian unemployment spells during the 1990’s. The dataset provides a unique access to conditionally independent variation in unemployment compensation. We find that a marginal increase in compensation reduces the escape rate from unemployment significantly, irrespective of business cycle conditions and spell duration. The escape rate rises sharply in the months just prior to benefit exhaustion. While men are more responsive than women with respect to marginal changes in compensation, women are most responsive with respect to benefit exhaustion.
Semiparametric estimation of a two-component mixture model
- Annals of Statistics
, 2006
"... Suppose that univariate data are drawn from a mixture of two distributions that are equal up to a shift parameter. Such a model is known to be nonidentifiable from a nonparametric viewpoint. However, if we assume that the unknown mixed distribution is symmetric, we obtain the identifiability of this ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
Suppose that univariate data are drawn from a mixture of two distributions that are equal up to a shift parameter. Such a model is known to be nonidentifiable from a nonparametric viewpoint. However, if we assume that the unknown mixed distribution is symmetric, we obtain the identifiability of this model, which is then defined by four unknown parameters: the mixing proportion, two location parameters and the cumulative distribution function of the symmetric mixed distribution. We propose estimators for these four parameters when no training data is available. Our estimators are shown to be strongly consistent under mild regularity assumptions and their convergence rates are studied. Their finite-sample properties are illustrated by a Monte Carlo study and our method is applied to real data.
Semi-parametric exponential family PCA
- Advances in Neural Information Processing 17 (NIPS
, 2004
"... We present a semi-parametric latent variable model based technique for density modelling, dimensionality reduction and visualization. Unlike previous methods, we estimate the latent distribution non-parametrically which enables us to model data generated by an underlying low dimensional, multimodal ..."
Abstract
-
Cited by 7 (1 self)
- Add to MetaCart
We present a semi-parametric latent variable model based technique for density modelling, dimensionality reduction and visualization. Unlike previous methods, we estimate the latent distribution non-parametrically which enables us to model data generated by an underlying low dimensional, multimodal distribution. In addition, we allow the components of latent variable models to be drawn from the exponential family which makes the method suitable for special data types, for example binary or count data. Simulations on real valued, binary and count data show favorable comparison to other related schemes both in terms of separating different populations and generalization to unseen samples. 1

