Results 1  10
of
56
Generalized weighted Chinese restaurant processes for species sampling mixture models
 Statistica Sinica
, 2003
"... Abstract: The class of species sampling mixture models is introduced as an extension of semiparametric models based on the Dirichlet process to models based on the general class of species sampling priors, or equivalently the class of all exchangeable urn distributions. Using Fubini calculus in conj ..."
Abstract

Cited by 85 (10 self)
 Add to MetaCart
Abstract: The class of species sampling mixture models is introduced as an extension of semiparametric models based on the Dirichlet process to models based on the general class of species sampling priors, or equivalently the class of all exchangeable urn distributions. Using Fubini calculus in conjunction with Pitman (1995, 1996), we derive characterizations of the posterior distribution in terms of a posterior partition distribution that extend the results of Lo (1984) for the Dirichlet process. These results provide a better understanding of models and have both theoretical and practical applications. To facilitate the use of our models we generalize the work in Brunner, Chan, James and Lo (2001) by extending their weighted Chinese restaurant (WCR) Monte Carlo procedure, an i.i.d. sequential importance sampling (SIS) procedure for approximating posterior mean functionals based on the Dirichlet process, to the case of approximation of mean functionals and additionally their posterior laws in species sampling mixture models. We also discuss collapsed Gibbs sampling, Pólya urn Gibbs sampling and a Pólya urn SIS scheme. Our framework allows for numerous applications, including multiplicative counting process models subject to weighted gamma processes, as well as nonparametric and semiparametric hierarchical models based on the Dirichlet process, its twoparameter extension, the PitmanYor process and finite dimensional Dirichlet priors. Key words and phrases: Dirichlet process, exchangeable partition, finite dimensional Dirichlet prior, twoparameter PoissonDirichlet process, prediction rule, random probability measure, species sampling sequence.
Bayesian nonparametric estimator derived from conditional Gibbs structures. Annals of Applied Probability
 J. Phys. A: Math. Gen
, 2008
"... We consider discrete nonparametric priors which induce Gibbstype exchangeable random partitions and investigate their posterior behavior in detail. In particular, we deduce conditional distributions and the corresponding Bayesian nonparametric estimators, which can be readily exploited for predictin ..."
Abstract

Cited by 34 (8 self)
 Add to MetaCart
(Show Context)
We consider discrete nonparametric priors which induce Gibbstype exchangeable random partitions and investigate their posterior behavior in detail. In particular, we deduce conditional distributions and the corresponding Bayesian nonparametric estimators, which can be readily exploited for predicting various features of additional samples. The results provide useful tools for genomic applications where prediction of future outcomes is required. 1. Introduction. Random
PoissonKingman Partitions
 of Lecture NotesMonograph Series
, 2002
"... This paper presents some general formulas for random partitions of a finite set derived by Kingman's model of random sampling from an interval partition generated by subintervals whose lengths are the points of a Poisson point process. These lengths can be also interpreted as the jumps of a sub ..."
Abstract

Cited by 27 (3 self)
 Add to MetaCart
(Show Context)
This paper presents some general formulas for random partitions of a finite set derived by Kingman's model of random sampling from an interval partition generated by subintervals whose lengths are the points of a Poisson point process. These lengths can be also interpreted as the jumps of a subordinator, that is an increasing process with stationary independent increments. Examples include the twoparameter family of PoissonDirichlet models derived from the Poisson process of jumps of a stable subordinator. Applications are made to the random partition generated by the lengths of excursions of a Brownian motion or Brownian bridge conditioned on its local time at zero.
Bayesian modelbased clustering procedures
 Journal of Computational and Graphical Statistics
"... This article establishes a general formulation for Bayesian modelbased clustering, in which subset labels are exchangeable, and items are also exchangeable, possibly up to covariate effects. The notational framework is rich enough to encompass a variety of existing procedures, including some recent ..."
Abstract

Cited by 24 (2 self)
 Add to MetaCart
This article establishes a general formulation for Bayesian modelbased clustering, in which subset labels are exchangeable, and items are also exchangeable, possibly up to covariate effects. The notational framework is rich enough to encompass a variety of existing procedures, including some recently discussed methods involving stochastic search or hierarchical clustering, but more importantly allows the formulation of clustering procedures that are optimal with respect to a specified loss function. Our focus is on loss functions based on pairwise coincidences, that is, whether pairs of items are clustered into the same subset or not. Optimization of the posterior expected loss function can be formulated as a binary integer programming problem, which can be readily solved by standard software when clustering a modest number of items, but quickly becomes impractical as problem scale increases. To combat this, a new heuristic itemswapping algorithm is introduced. This performs well in our numerical experiments, on both simulated and real data examples. The article includes a comparison of the statistical performance of the (approximate) optimal clustering with earlier methods that are modelbased but ad hoc in their detailed definition.
MCMC for normalized random measure mixture models Statistical Science 28
, 2013
"... Abstract. This paper concerns the use of Markov chain Monte Carlo methods for posterior sampling in Bayesian nonparametric mixture models with normalized random measure priors. Making use of some recent posterior characterizations for the class of normalized random measures, we propose novel Markov ..."
Abstract

Cited by 20 (9 self)
 Add to MetaCart
(Show Context)
Abstract. This paper concerns the use of Markov chain Monte Carlo methods for posterior sampling in Bayesian nonparametric mixture models with normalized random measure priors. Making use of some recent posterior characterizations for the class of normalized random measures, we propose novel Markov chain Monte Carlo methods of both marginal type and conditional type. The proposed marginal samplers are generalizations of Neal’s wellregarded Algorithm 8 for Dirichlet process mixture models, whereas the conditional sampler is a variation of those recently introduced in the literature. For both the marginal and conditional methods, we consider as a running example a mixture model with an underlying normalized generalized Gamma process prior, and describe comparative simulation results demonstrating the efficacies of the proposed methods. Key words and phrases: Bayesian nonparametrics, hierarchical mixture model, completely random measure, normalized random measure, Dirichlet process, normalized generalized Gamma process, MCMC posterior sampling method, marginalized sampler, Algorithm 8, conditional sampler, slice sampling. 1.
Poisson calculus for spatial neutral to the right processes
, 2003
"... In this paper we consider classes of nonparametric priors on spaces of distribution functions and cumulative hazard measures that are based on extensions of the neutral to the right (NTR) concept. In particular, spatial neutral to the right processes that extend the NTR concept from priors on the cl ..."
Abstract

Cited by 16 (4 self)
 Add to MetaCart
(Show Context)
In this paper we consider classes of nonparametric priors on spaces of distribution functions and cumulative hazard measures that are based on extensions of the neutral to the right (NTR) concept. In particular, spatial neutral to the right processes that extend the NTR concept from priors on the class of distributions on the real line to classes of distributions on general spaces are discussed. Representations of the posterior distribution of the spatial NTR processes are given. A different type of calculus than traditionally employed in the Bayesian literature, based on Poisson process partition calculus methods described in James (2002), is provided which offers a streamlined proof of posterior results for NTR models and its spatial extension. The techniques are applied to progressively more complex models ranging from the complete data case to semiparametric multiplicative intensity models. Refinements are then given which describes the underlying properties of spatial NTR processes analogous to those developed for the Dirichlet process. The analysis yields accessible moment formulae and characterizations of the posterior distribution and relevant marginal distributions. An EPPF formula and additionally a distribution related to the risk and death sets is computed. In the homogeneous case, these distributions turn out to be connected and overlap with recent work on regenerative compositions defined by suitable discretisation of subordinators. The formulae we develop for the marginal distribution of spatial NTR models provide clues on how to sample posterior distributions in complex settings. In addition the spatial NTR is further extended to the mixture model setting which allows for applicability of such processes to much more complex data structures. A description of a species sampling model derived from a spatial NTR model is also given.
Linear and quadratic functionals of random hazard rates: an asymptotic analysis
 Ann. Appl. Probab
, 2008
"... A popular Bayesian nonparametric approach to survival analysis consists in modeling hazard rates as kernel mixtures driven by a completely random measure. In this paper we derive asymptotic results for linear and quadratic functionals of such random hazard rates. In particular, we prove central limi ..."
Abstract

Cited by 12 (9 self)
 Add to MetaCart
(Show Context)
A popular Bayesian nonparametric approach to survival analysis consists in modeling hazard rates as kernel mixtures driven by a completely random measure. In this paper we derive asymptotic results for linear and quadratic functionals of such random hazard rates. In particular, we prove central limit theorems for the cumulative hazard function and for the pathsecond moment and pathvariance of the hazard rate. Our techniques are based on recently established criteria for the weak convergence of single and double stochastic integrals with respect to Poisson random measures. The findings are illustrated by considering specific models involving kernels and random measures commonly exploited in practice. Our abstract results are of independent theoretical interest and can be applied to other areas dealing with Lévy moving average processes. The strictly Bayesian analysis is further explored in a companion paper, where our results are extended to accommodate posterior analysis. 1. Introduction. Survival
Distributions of functionals of the two parameter PoissonDirichlet process
, 2006
"... The present paper provides exact expressions for the probability distribution of linear functionals of the two–parameter Poisson–Dirichlet process PD(α, θ). Distributional results that follow from the application of an inversion formula for a (generalized) Cauchy– Stieltjes transform are achieved. ..."
Abstract

Cited by 11 (10 self)
 Add to MetaCart
(Show Context)
The present paper provides exact expressions for the probability distribution of linear functionals of the two–parameter Poisson–Dirichlet process PD(α, θ). Distributional results that follow from the application of an inversion formula for a (generalized) Cauchy– Stieltjes transform are achieved. Moreover, several interesting integral identities are obtained by exploiting a correspondence between the mean functional of a Poisson–Dirichlet process and the mean functional of a suitable Dirichlet process. Finally, some distributional characterizations in terms of mixture representations are illustrated. Our formulae are relevant to occupation time phenomena connected with Brownian motion and more general Bessel processes, as well as to models arising in Bayesian nonparametric statistics.