• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Topic-Based Language Models Using EM (1999)

Cached

  • Download as a PDF
  •  
  • Download as a PS

Download Links

  • [www.icsi.berkeley.edu]
  • [www.cs.brown.edu]
  • [www.cs.brown.edu]
  • [ftp.icsi.berkeley.edu]
  • [http.icsi.berkeley.edu]
  • [ftp.icsi.berkeley.edu]
  • [www.icsi.berkeley.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Daniel Gildea , Thomas Hofmann
Venue:IN PROCEEDINGS OF EUROSPEECH
Citations:35 - 1 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@INPROCEEDINGS{Gildea99topic-basedlanguage,
    author = {Daniel Gildea and Thomas Hofmann},
    title = {Topic-Based Language Models Using EM},
    booktitle = {IN PROCEEDINGS OF EUROSPEECH},
    year = {1999},
    pages = {2167--2170},
    publisher = {}
}

Years of Citing Articles

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

In this paper, we propose a novel statistical language model to capture topic-related long-range dependencies. Topics are modeled in a latent variable framework in which we also derive an EM algorithm to perform a topic factor decomposition based on a segmented training corpus. The topic model is combined with a standard language model to be used for on-line word prediction. Perplexity results indicate an improvement over previously proposed topic models, which unfortunately has not translated into lower word error.

Citations

6232 Maximum likelihood from incomplete data via the EM algorithm - Dempster, Laird, et al. - 1977
2168 Indexing by latent semantic analysis - Deerwester, Dumais, et al. - 1990
612 A view of the EM algorithm that justifies incremental, sparse, and other variants - Neal, Hinton - 1998
545 Probabilistic latent semantic indexing - Hofmann - 1999
375 Probabilistic latent semantic analysis - Hofmann - 1999
355 Generalized Iterative Scaling for Log-Linear Models - Darroch, Ratcliff - 1972
201 A maximum entropy approach to adaptive statistical language modelling - Rosenfeld - 1996
106 Exploiting syntactic structure for language modeling - Chelba, Jelinek - 1998
77 Modeling long distance dependence in language: Topic mixtures vs. dynamic cache models - Iyer, Ostendorf - 1996
67 Language model adaptation using mixtures and an exponentially decaying cache - Clarkson, Robinson - 1997
37 Towards better integration of semantic predictors in statistical language modeling - Coccaro, Jurafsky - 1998
26 Using a stochastic context-free grammar as a language model for speech recognition - JURAFKSY, WOOTERS, et al. - 1995
16 A cache based natural language model for speech recognition - Kuhn, Mori - 1992
15 Beyond word n-grams - Pereira, Singer - 1995
14 The SPRACH system for the transcription of broadcast news - Cook, Christie, et al. - 1999
13 A latent semantic analysis framework for large-span language modeling - Bellegarda - 1997
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University