@MISC{Krishna01sphinx:schema-conscious, author = {Leela Krishna and Poola Jayant and R. Haritsa}, title = {SphinX: Schema-conscious XML Indexing by}, year = {2001} }
Share
OpenURL
Abstract
User queries on XML documents are typically expressed as regular path expres-sions. A variety of novel indexing techniques for efficiently retrieving the results to such queries have been proposed in the recent literature. While these techniques are generic to documents that are completely schema-less, in practice XML documents of-ten adhere to a DTD-specified schema. In this paper, we propose SphinX, a new XML indexing scheme that utilizes the DTD to significantly speed up the search process. SphinX implements a persistent index structure that seamlessly combines the schema information with standard B-tree technology, resulting in a simple and scalable solu-tion. A performance evaluation over a variety of XML documents, including the Xmark benchmark, indicates significant benefits over previous approaches with regard to both index construction and index access. 1 1