Results 1 - 10
of
44
The two-parameter Poisson-Dirichlet distribution derived from a stable subordinator.
, 1995
"... The two-parameter Poisson-Dirichlet distribution, denoted pd(ff; `), is a distribution on the set of decreasing positive sequences with sum 1. The usual Poisson-Dirichlet distribution with a single parameter `, introduced by Kingman, is pd(0; `). Known properties of pd(0; `), including the Markov ..."
Abstract
-
Cited by 162 (36 self)
- Add to MetaCart
The two-parameter Poisson-Dirichlet distribution, denoted pd(ff; `), is a distribution on the set of decreasing positive sequences with sum 1. The usual Poisson-Dirichlet distribution with a single parameter `, introduced by Kingman, is pd(0; `). Known properties of pd(0; `), including the Markov chain description due to Vershik-Shmidt-Ignatov, are generalized to the two-parameter case. The size-biased random permutation of pd(ff; `) is a simple residual allocation model proposed by Engen in the context of species diversity, and rediscovered by Perman and the authors in the study of excursions of Brownian motion and Bessel processes. For 0 ! ff ! 1, pd(ff; 0) is the asymptotic distribution of ranked lengths of excursions of a Markov chain away from a state whose recurrence time distribution is in the domain of attraction of a stable law of index ff. Formulae in this case trace back to work of Darling, Lamperti and Wendel in the 1950's and 60's. The distribution of ranked lengths of e...
Generalized weighted Chinese restaurant processes for species sampling mixture models
- Statistica Sinica
, 2003
"... Abstract: The class of species sampling mixture models is introduced as an extension of semiparametric models based on the Dirichlet process to models based on the general class of species sampling priors, or equivalently the class of all exchangeable urn distributions. Using Fubini calculus in conj ..."
Abstract
-
Cited by 36 (8 self)
- Add to MetaCart
Abstract: The class of species sampling mixture models is introduced as an extension of semiparametric models based on the Dirichlet process to models based on the general class of species sampling priors, or equivalently the class of all exchangeable urn distributions. Using Fubini calculus in conjunction with Pitman (1995, 1996), we derive characterizations of the posterior distribution in terms of a posterior partition distribution that extend the results of Lo (1984) for the Dirichlet process. These results provide a better understanding of models and have both theoretical and practical applications. To facilitate the use of our models we generalize the work in Brunner, Chan, James and Lo (2001) by extending their weighted Chinese restaurant (WCR) Monte Carlo procedure, an i.i.d. sequential importance sampling (SIS) procedure for approximating posterior mean functionals based on the Dirichlet process, to the case of approximation of mean functionals and additionally their posterior laws in species sampling mixture models. We also discuss collapsed Gibbs sampling, Pólya urn Gibbs sampling and a Pólya urn SIS scheme. Our framework allows for numerous applications, including multiplicative counting process models subject to weighted gamma processes, as well as nonparametric and semiparametric hierarchical models based on the Dirichlet process, its two-parameter extension, the Pitman-Yor process and finite dimensional Dirichlet priors. Key words and phrases: Dirichlet process, exchangeable partition, finite dimensional Dirichlet prior, two-parameter Poisson-Dirichlet process, prediction rule, random probability measure, species sampling sequence.
Variational methods for the Dirichlet process
- In Proceedings of the 21st International Conference on Machine Learning
, 2004
"... Variational inference methods, including mean field methods and loopy belief propagation, have been widely used for approximate probabilistic inference in graphical models. While often less accurate than MCMC, variational methods provide a fast deterministic approximation to marginal and conditional ..."
Abstract
-
Cited by 33 (4 self)
- Add to MetaCart
Variational inference methods, including mean field methods and loopy belief propagation, have been widely used for approximate probabilistic inference in graphical models. While often less accurate than MCMC, variational methods provide a fast deterministic approximation to marginal and conditional probabilities. Such approximations can be particularly useful in high dimensional problems where sampling methods are too slow to be effective. A limitation of current methods, however, is that they are restricted to parametric probabilistic models. MCMC does not have such a limitation; indeed, MCMC samplers have been developed for the Dirichlet process (DP), a nonparametric distribution on distributions (Ferguson, 1973) that is the cornerstone of Bayesian nonparametric statistics (Escobar & West, 1995; Neal, 2000). In this paper, we develop a meanfield variational approach to approximate inference for the Dirichlet process, where the approximate posterior is based on the truncated stick-breaking construction (Ishwaran & James, 2001). We compare our approach to DP samplers for Gaussian DP mixture models. 1.
Probabilistic bounds on the coefficients of polynomials with only real zeros
- J. Combin. Theory Ser. A
, 1997
"... The work of Harper and subsequent authors has shown that nite sequences (a 0;;an) arising from combinatorial problems are often such that the polynomial A(z): = P n k=0 akz k has only real zeros. Basic examples include rows from the arrays of binomial coe cients, Stir-ling numbers of the rst and sec ..."
Abstract
-
Cited by 17 (0 self)
- Add to MetaCart
The work of Harper and subsequent authors has shown that nite sequences (a 0;;an) arising from combinatorial problems are often such that the polynomial A(z): = P n k=0 akz k has only real zeros. Basic examples include rows from the arrays of binomial coe cients, Stir-ling numbers of the rst and second kinds, and Eulerian numbers. Assuming the ak are non-negative, A(1)> 0 and that A(z) is not constant, it is known that A(z) has only real zeros i the normalized sequence (a 0=A(1);;an=A(1)) is the probability distribution of the Research supported in part by N.S.F. Grant MCS9404345 1 number of successes in n independent trials for some sequence of suc-cess probabilities. Such sequences (a 0;;an) are also known to be characterized by total positivity of the in nite matrix (ai,j) indexed by non-negative integers i and j. This papers reviews inequalities and approximations for such sequences, called Polya frequency sequences which follow from their probabilistic representation. In combinatorial examples these inequalities yield a number of improvements of known estimates.
Random Walks on Trees and Matchings
- Electron. J. Probab
, 2002
"... We give sharp rates of convergence for a natural Markov chain on the space of phylogenetic trees and dually for the natural random walk on the set of perfect matchings in the complete graph on 2n vertices. Roughly, the results show that n log n steps are necessary and su#ce to achieve randomness. ..."
Abstract
-
Cited by 15 (2 self)
- Add to MetaCart
We give sharp rates of convergence for a natural Markov chain on the space of phylogenetic trees and dually for the natural random walk on the set of perfect matchings in the complete graph on 2n vertices. Roughly, the results show that n log n steps are necessary and su#ce to achieve randomness. The proof depends on the representation theory of the symmetric group and a bijection between trees and matchings.
Gibbs distributions for random partitions generated by a fragmentation process
, 2006
"... process ..."
Beta-coalescents and continuous stable random trees
, 2006
"... Coalescents with multiple collisions, also known as Λ-coalescents, were introduced by Pitman and Sagitov in 1999. These processes describe the evolution of particles that undergo stochastic coagulation in such a way that several blocks can merge at the same time to form a single block. In the case t ..."
Abstract
-
Cited by 11 (5 self)
- Add to MetaCart
Coalescents with multiple collisions, also known as Λ-coalescents, were introduced by Pitman and Sagitov in 1999. These processes describe the evolution of particles that undergo stochastic coagulation in such a way that several blocks can merge at the same time to form a single block. In the case that the measure Λ is the Beta(2 − α, α) distribution, they are also known to describe the genealogies of large populations where a single individual can produce a large number of offspring. Here we use a recent result of Birkner et al. to prove that Beta-coalescents can be embedded in continuous stable random trees, about which much is known due to recent progress of Duquesne and Le Gall. Our proof is based on a construction of the Donnelly-Kurtz lookdown process using continuous random trees which is of independent interest. This produces a number of results concerning the small-time behavior of Beta-coalescents. Most notably, we recover an almost sure limit theorem of the authors for the number of blocks at small times, and give the multifractal spectrum corresponding to the emergence of blocks with atypical size. Also, we are able to find exact asymptotics for sampling formulae corresponding to the site frequency spectrum and allele frequency spectrum associated with mutations in the context of population genetics.
Asymptotics of Poisson approximation to random discrete distributions: an analytic approach
- Advances in Applied Probability
, 1998
"... this paper, we shall describe the asymptotic behaviors of several distances of Poisson approximation to a wide class of discrete distributions covering many examples from number theory, combinatorics and arithmetic semigroups. Our aim is to show that whenever (analytic) generating functions of the r ..."
Abstract
-
Cited by 9 (9 self)
- Add to MetaCart
this paper, we shall describe the asymptotic behaviors of several distances of Poisson approximation to a wide class of discrete distributions covering many examples from number theory, combinatorics and arithmetic semigroups. Our aim is to show that whenever (analytic) generating functions of the random variables in question are available, complex-analytic methods can be used to derive precise asymptotic results for the five distances above. Actually, we shall consider the following generalized distances: let ff ? 0 be a fixed positive number, (X; Y ) = FM (X; Y ) = (X; Y ) = sup K (X; Y ) = sup M (X; Y ) = jP(X = j) \Gamma P(Y = j) Note that d TV = d M . Besides the case ff = 1 (and ff = 1=2 for d M ), only the case d TV was previously studied by Franken [39] for Poisson approximation to the sum of independent but not identically distributed Bernoulli random variables. We take these quantities as our measures of degree of nearness of Poisson approximation, some of which may be interpreted as certain norms in suitable space as many authors did (cf. [12, 22, 23, 74, 96]). For a large class of discrete distributions, we shall derive an asymptotic main term together with an error estimate for each of these distances. Our results are thus "approximation theorems" rather than "limit theorems". The common form of the underlying structure of these distributions suggests the study of an analytic scheme as we did previously for normal approximation and large deviations (cf. [53, 54]). Many concrete examples from probabilistic number theory and combinatorial structures will justify the study of this scheme. Our treatment being completely general, many extensions can be further pursued with essentially the same line of methods. We shall di...
Exchangeable partitions derived from Markovian coalescents
- Adv. Appl. Probab
, 2006
"... Kingman derived the Ewens sampling formula for random partitions from the genealogy model defined by a Poisson process of mutations along lines of descent governed by a simple coalescent process. Möhle described the recursion which determines the generalization of the Ewens sampling formula when the ..."
Abstract
-
Cited by 8 (2 self)
- Add to MetaCart
Kingman derived the Ewens sampling formula for random partitions from the genealogy model defined by a Poisson process of mutations along lines of descent governed by a simple coalescent process. Möhle described the recursion which determines the generalization of the Ewens sampling formula when the lines of descent are governed by a coalescent with multiple collisions. In [7] authors exploit an analogy with the theory of regenerative composition and partition structures, and provide various characterizations of the associated exchangeable random partitions. This paper gives parallel results for the further generalized model with lines of descent following a coalescent with simultaneous multiple collisions. 1

