• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Improvements to the Sequence Memoizer

Cached

  • Download as a PDF

Download Links

  • [www.cs.berkeley.edu]
  • [www.gatsby.ucl.ac.uk]
  • [www.eecs.berkeley.edu]
  • [books.nips.cc]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Yee Whye Teh
Citations:2 - 2 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Teh_improvementsto,
    author = {Yee Whye Teh},
    title = {Improvements to the Sequence Memoizer},
    year = {}
}

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

The sequence memoizer is a model for sequence data with state-of-the-art performance on language modeling and compression. We propose a number of improvements to the model and inference algorithm, including an enlarged range of hyperparameters, a memory-efficient representation, and inference algorithms operating on the new representation. Our derivations are based on precise definitions of the various processes that will also allow us to provide an elementary proof of the “mysterious ” coagulation and fragmentation properties used in the original paper on the sequence memoizer by Wood et al. (2009). We present some experimental results supporting our improvements. 1

Citations

328 Hierarchical Dirichlet processes - Teh, Jordan, et al. - 2006
162 The two-parameter PoissonDirichlet distribution derived from a stable subordinator. Annals of Probability - Pitman, Yor - 1997
160 Gibbs sampling methods for stick–breaking priors - Ishwaran, James - 2001
81 A neural probabilistic language model - Bengio, Ducharme, et al.
71 Coalescents with multiple collisions - Pitman - 1999
48 A hierarchical Bayesian language model based on Pitman-Yor processes - Teh - 2006
15 A unified approach to generalized Stirling numbers - Hsu, Shiue - 1998
8 A Bayesian interpretation of interpolated KneserNey - Teh - 2006
7 A stochastic memoizer for sequence data - Wood, Archambeau, et al. - 2009
7 Lossless compression based on the sequence memoizer - Gasthaus, Wood, et al. - 2010
3 Coagulation fragmentation laws induced by general coagulations of two-parameter Poisson-Dirichlet processes - Ho, James, et al. - 2006
1 A note on the implementation of hierarchical Dirichlet processes - Blunsom, Cohn, et al.
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University