• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Latent dirichlet allocation (2003)

Cached

  • Download as a PDF

Download Links

  • [www.cs.colorado.edu]
  • [www-2.cs.cmu.edu]
  • [www.cs.berkeley.edu]
  • [www.cs.berkeley.edu]
  • [www.jmlr.org]
  • [www.cs.princeton.edu]
  • [jmlr.csail.mit.edu]
  • [faculty.cs.byu.edu]
  • [www.inf.ed.ac.uk]
  • [www.cs.ucsd.edu]
  • [www.cs.princeton.edu:80]
  • [www.ai.mit.edu]
  • [www.cs.utah.edu]
  • [www.cs.utah.edu]
  • [www.cs.berkeley.edu]
  • [www.robotics.stanford.edu]
  • [ai.stanford.edu]

  • Other Repositories/Bibliography

  • DBLP
  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by David M. Blei , Andrew Y. Ng , Michael I. Jordan , John Lafferty
Venue:Journal of Machine Learning Research
Citations:1370 - 48 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@ARTICLE{Blei03latentdirichlet,
    author = {David M. Blei and Andrew Y. Ng and Michael I. Jordan and John Lafferty},
    title = {Latent dirichlet allocation},
    journal = {Journal of Machine Learning Research},
    year = {2003},
    volume = {3},
    pages = {2003}
}

Years of Citing Articles

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian model, in which each item of a collection is modeled as a finite mixture over an underlying set of topics. Each topic is, in turn, modeled as an infinite mixture over an underlying set of topic probabilities. In the context of text modeling, the topic probabilities provide an explicit representation of a document. We present efficient approximate inference techniques based on variational methods and an EM algorithm for empirical Bayes parameter estimation. We report results in document modeling, text classification, and collaborative filtering, comparing to a mixture of unigrams model and the probabilistic LSI model. 1.

Citations

2168 Indexing by latent semantic analysis - Deerwester, Dumais, et al. - 1990
1927 Modern Information Retrieval - Baeza-Yates, Ribeiro-Neto - 1999
1086 Making large-scale SVM learning practical - Joachims - 1999
887 Bayesian Data Analysis - Gelman, Carlin, et al. - 2004
656 An Introduction to Variational Methods for Graphical Models - Jordan, Ghahramani, et al. - 1999
632 Text classification from labeled and unlabeled documents using - Nigram, McCallum, et al.
612 Statistical Methods for Speech Recognition - Jelinek - 1997
545 Probabilistic latent semantic indexing - Hofmann - 1999
469 editor. Learning in Graphical Models - Jordan - 1998
241 Modeling annotated data - Blei, Jordan - 2003
210 Latent semantic indexing: A probabilistic analysis - Papadimitriou, Tamaki, et al. - 1998
207 Using maximum entropy for text classification - Nigam - 1999
168 The first text retrieval conference - Harman - 1993
131 A variational Bayesian framework for graphical models - Attias - 2003
112 S.: Probabilistic models for unified collaborative and content-based recommendation in sparse-data environments - Popescul, Ungar, et al.
102 Estimating a Dirichlet distribution - Minka - 2003
84 Expectation-propagation for the generative aspect model - Minka, Lafferty - 2002
67 An experimental comparison of several clustering methods, Microsoft Research Report MSR-TR-98-06 - Meila, Heckerman
61 Parametric empirical Bayes inference: theory and applications (with discussion - Morris - 1983
58 Improving multiclass text classification with the support vector machine - Rennie, Rifkin - 2001
56 Approximate Bayesian Inference in Conditionally Independent Hierarchical Models (Parametric Empirical Bayes Models - Kass, Steffey - 1989
48 A probabilistic approach to semantic representation. Paper presented at the - Griffiths, Steyvers - 2004
24 Recent progress on de Finetti’s notions of exchangeability - Diaconis - 1988
7 Bayesian methods for censored categorical data - Dickey, Jiang, et al. - 1987
6 Finetti, Theory of Probability, Vols - de - 1974
5 Exchangeability and related topics. École d’ Été de Probabilités de Saint-Flour XIII - Aldous - 1985
2 Caenorrhabditis genetic center bibliography - Avery - 2002
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University