MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Block Sorting Text Compression - Final Report (1996) [16 citations — 3 self]

Abstract:

A recent development in text compression is a "block sorting" algorithm which permutes the input text according to a special sort procedure and then processes the permuted text with Move-to-Front and a final statistical compressor. The technique combines good speed with excellent compression performance. This report investigates the block sorting compression algorithm, in particular trying to understand its operation and limitations. Various approaches are investigated in an attempt to improve the compression with block sorting, most of which involve a hierarchy of coding models to allow fast adaptation to local contexts. The best technique involves a new "structured" coding model, especially designed for compressing data with skew symbol distributions. Block sorting compression is found to be related to work by Shannon in 1951 on the prediction of English text. The work confirms block-sorting as a good text compression technique, with a compression approaching that of the currently be...

Citations

593 TC: Managing Gigabytes: compressing and indexing documents and images – IA, Moffat, et al. - 1999
536 Text Compression – Bell, Cleary, et al. - 1990
517 Arithmetic coding for data compression – Witten, Neal, et al. - 1987
340 A block sorting lossless data compression algorithm – Burrows, Wheeler - 1994
252 Data compression using adaptive coding and partial string matching – Cleary, Witten - 1984
221 Prediction and entropy of printed english – Shannon - 1951
107 Arithmetic Coding Revisited – Moffat, Neal, et al. - 1995
92 Implementing the PPM Data Compression Scheme – Moffat - 1990
83 Unbounded Length Contexts for PPM – Cleary, Teahan, et al. - 1995
22 Block sorting text compression – Fenwick - 1996
11 private communication – Burrows
11 Arithmetic coding and statistical modelling – Nelson - 1991
10 A locally adaptive data compression algorithm – Bentley, Sleator, et al. - 1986
9 Improvements to the Block Sorting Text Compression Algorithm – FENWICK - 1995
6 Experiments with a Block-Sorting Text Compression Algorithm", The – Fenwick - 1995
5 The Structure of DMC – Bunton - 1995
3 private communication – Teahan
1 A New Technique for Self Organising List Searches", Computer Journal, pp 450--454 – Fenwick - 1991
1 private communication. (Oct '95) Tech Rep 130 Block Sorting Text Compression 23 – Wheeler - 1996