#### DMCA

## Searching BWT compressed text with the Boyer-Moore algorithm and binary search (2002)

### Cached

### Download Links

- [corpus.canterbury.ac.nz]
- [www.corpus.canterbury.ac.nz]
- DBLP

### Other Repositories/Bibliography

Venue: | Proceedings, IEEE Data Compression Conference, 2002 |

Citations: | 13 - 7 self |

### Citations

1522 | A Universal Algorithm for Sequential Data Compression",
- Ziv, Lempel
- 1977
(Show Context)
Citation Context ...jeroh & Mukherjee (2001) outlines several techniques for online compressed-domain pattern matching in both text and images. Many of these techniques are based on the LZ family of compression systems (=-=Ziv & Lempel 1977-=-, Ziv & Lempel 1978), but others include methods for Huffman-coded text and run-length encoding. Little work has been done with the Burrows-Wheeler transform, although some research has been undertake... |

825 | Fast pattern matching in strings. - Knuth, Morris, et al. - 1977 |

809 | D.J.: A BlockSorting Lossless Data Compression Algorithm. Digital Systems Research Center, Research Report 124,
- Burrows, Wheeler
- 1994
(Show Context)
Citation Context ...eries, and binary search is much faster even for large numbers of queries. 1 Introduction This paper investigates on-line exact pattern matching in files compressed with the BurrowsWheeler transform (=-=Burrows & Wheeler 1994-=-). By ‘on-line’ pattern matching, we refer to methods that do not require a pre-computed index—all the work of pattern matching is done at query time. They are particularly suitable for texts that are... |

761 |
A Fast String Searching Algorithm,”
- Boyer, S
- 1994
(Show Context)
Citation Context ...ues for on-line exact pattern matching in files that have been compressed using the Burrows-Wheeler transform. We investigate two approaches. The first is an application of the Boyer-Moore algorithm (=-=Boyer & Moore 1977-=-) to a transformed string. The second approach is based on the observation that the transform effectively contains a sorted list of all substrings of the original text, which can be exploited for very... |

296 | Opportunistic data structures with applications.
- Ferragina, Manzini
- 2000
(Show Context)
Citation Context ...thods for Huffman-coded text and run-length encoding. Little work has been done with the Burrows-Wheeler transform, although some research has been undertaken in the area of offline pattern matching (=-=Ferragina & Manzini 2000-=-, Ferragina & Manzini 2001, Sadakane & Imai 1999, Sadakane 2000). Throughout this paper we will refer to the pattern matching problem in terms of searching for a pattern P of length m in a text T of l... |

173 | A locally adaptive data compression scheme,”
- Bentley, Sleator, et al.
- 1986
(Show Context)
Citation Context ... compression program, bsmp, was developed. bsmp uses a four-stage compression system: 1. a Burrows-Wheeler transform, with the block size set to the size of the entire file, 2. a move-to-front coder (=-=Bentley et al. 1986-=-), which takes advantage of the high level of local repetition in the BWT output, 3. a run-length coder, to remove the long sequences of zeroes in the MTF output, and 4. an order-0 arithmetic coder. N... |

13 | Pattern matching in compressed texts and images - Adjeroh, Bell, et al. - 2013 |

13 | Processing Truncated Terms in Document Retrieval Systems', Information Processing and Management - Choueka - 1982 |

9 | A cooperative distributed text database management method unifying search and compression based on
- Sadakane, Imai
- 1999
(Show Context)
Citation Context .... Little work has been done with the Burrows-Wheeler transform, although some research has been undertaken in the area of offline pattern matching (Ferragina & Manzini 2000, Ferragina & Manzini 2001, =-=Sadakane & Imai 1999-=-, Sadakane 2000). Throughout this paper we will refer to the pattern matching problem in terms of searching for a pattern P of length m in a text T of length n. The input alphabet will be referred to ... |

7 | Block sorting text compression–final report - Fenwick - 1996 |

2 |
An experimental study of a compressed index. Part of this work appeared
- Ferragina
- 2001
(Show Context)
Citation Context ...xt and run-length encoding. Little work has been done with the Burrows-Wheeler transform, although some research has been undertaken in the area of offline pattern matching (Ferragina & Manzini 2000, =-=Ferragina & Manzini 2001-=-, Sadakane & Imai 1999, Sadakane 2000). Throughout this paper we will refer to the pattern matching problem in terms of searching for a pattern P of length m in a text T of length n. The input alphabe... |

2 |
Unifying Text Search and Compression—Suffix Sorting, Block Sorting and Suffix Arrays
- Sadakane
- 2000
(Show Context)
Citation Context ... done with the Burrows-Wheeler transform, although some research has been undertaken in the area of offline pattern matching (Ferragina & Manzini 2000, Ferragina & Manzini 2001, Sadakane & Imai 1999, =-=Sadakane 2000-=-). Throughout this paper we will refer to the pattern matching problem in terms of searching for a pattern P of length m in a text T of length n. The input alphabet will be referred to as Σ; similarly... |

2 | Managing Gigabytes, second edn - Witten, Moffatt - 1999 |