The String B-Tree: A New Data Structure for String Search in External Memory and its Applications. (1998)
| Venue: | Journal of the ACM |
| Citations: | 110 - 11 self |
BibTeX
@ARTICLE{Ferragina98thestring,
author = {Paolo Ferragina and Roberto Grossi},
title = {The String B-Tree: A New Data Structure for String Search in External Memory and its Applications.},
journal = {Journal of the ACM},
year = {1998},
volume = {46},
pages = {236--280}
}
Years of Citing Articles
OpenURL
Abstract
We introduce a new text-indexing data structure, the String B-Tree, that can be seen as a link between some traditional external-memory and string-matching data structures. In a short phrase, it is a combination of B-trees and Patricia tries for internal-node indices that is made more effective by adding extra pointers to speed up search and update operations. Consequently, the String B-Tree overcomes the theoretical limitations of inverted files, B-trees, prefix B-trees, suffix arrays, compacted tries and suffix trees. String B-trees have the same worst-case performance as B-trees but they manage unbounded-length strings and perform much more powerful search operations such as the ones supported by suffix trees. String B-trees are also effective in main memory (RAM model) because they improve the online suffix tree search on a dynamic set of strings. They also can be successfully applied to database indexing and software duplication.







