• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

DMCA

Two space saving tricks for linear time LCP computation (2004)

Cached

  • Download as a PDF

Download Links

  • [people.unipmn.it]
  • [www.mfn.unipmn.it]
  • [www.mfn.unipmn.it]
  • [people.unipmn.it]
  • [people.unipmn.it]
  • [people.unipmn.it]
  • [people.unipmn.it]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Giovanni Manzini
Citations:44 - 2 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@TECHREPORT{Manzini04twospace,
    author = {Giovanni Manzini},
    title = {Two space saving tricks for linear time LCP computation},
    institution = {},
    year = {2004}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

Abstract. In this paper we consider the linear time algorithm of Kasai et al. [6] for the computation of the Longest Common Prefix (LCP) array given the text and the suffix array. We show that this algorithm can be implemented without any auxiliary array in addition to the ones required for the input (the text and the suffix array) and the output (the LCP array). Thus, for a text of length n, we reduce the space occupancy of this algorithm from 13n bytes to 9n bytes. We also consider the problem of computing the LCP array by “overwriting” the suffix array. For this problem we propose an algorithm whose space occupancy can be bounded in terms of the empirical entropy of the input text. Experiments show that for linguistic texts our algorithm uses roughly 7n bytes. Our algorithm makes use of the Burrows-Wheeler Transform even if it does not represent any data in compressed form. To our knowledge this is the first application of the Burrows-Wheeler Transform outside the domain of data compression. The source code for the algorithms described in this paper has been included in the lightweight suffix sorting package [13] which is freely available under the GNU GPL. 1

Keyphrases

suffix array    linear time lcp computation    lcp array    burrows-wheeler transform    space occupancy    gnu gpl    source code    longest common prefix    linear time algorithm    data compression    empirical entropy    linguistic text    lightweight suffix    compressed form    input text    first application    auxiliary array   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University