## Asymptotic Behavior of the Lempel-Ziv Parsing Scheme and Digital Search Trees (1995)

Venue: | Theoretical Computer Science |

Citations: | 64 - 30 self |

### BibTeX

@INPROCEEDINGS{Jacquet95asymptoticbehavior,

author = {Philippe Jacquet and Wojciech Szpankowski},

title = {Asymptotic Behavior of the Lempel-Ziv Parsing Scheme and Digital Search Trees},

booktitle = {Theoretical Computer Science},

year = {1995},

pages = {161--197},

publisher = {}

}

### Years of Citing Articles

### OpenURL

### Abstract

The Lempel-Ziv parsing scheme finds a wide range of applications, most notably in data compression and algorithms on words. It partitions a sequence of length n into variable phrases such that a new phrase is the shortest substring not seen in the past as a phrase. The parameter of interest is the number M n of phrases that one can construct from a sequence of length n. In this paper, for the memoryless source with unequal probabilities of symbols generation we derive the limiting distribution of M n which turns out to be normal. This proves a long standing open problem. In fact, to obtain this result we solved another open problem, namely, that of establishing the limiting distribution of the internal path length in a digital search tree. The latter is a consequence of an asymptotic solution of a multiplicative differential-functional equation often arising in the analysis of algorithms on words. Interestingly enough, our findings are proved by a combination of probabilistic techniques such as renewal equation and uniform integrability, and analytical techniques such as Mellin transform, differential-functional equations, de-Poissonization, and so forth. In concluding remarks we indicate a possibility of extending our results to Markovian models.