Results 1 - 10
of
3,284
Distance-based indexing for high-dimensional metric spaces
- In Proc. ACM SIGMOD International Conference on Management of Data
, 1997
"... In many database applications, one of the common queries is to find approximate matches to a given query item from a collection of data items. For example, given an image database, one may want to retrieve all images that are similar to a given query image. Distance based index structures are propos ..."
Abstract
-
Cited by 129 (3 self)
- Add to MetaCart
In many database applications, one of the common queries is to find approximate matches to a given query item from a collection of data items. For example, given an image database, one may want to retrieve all images that are similar to a given query image. Distance based index structures
Measuring the difficulty of distance-based indexing
- in Proceedings of the 12th international conference on String Processing and Information Retrieval, ser. SPIRE’05
, 2005
"... Abstract. Data structures for similarity search are commonly evalu-ated on data in vector spaces, but distance-based data structures are also applicable to non-vector spaces with no natural concept of dimen-sionality. The intrinsic dimensionality statistic of Chávez and Navarro provides a way to co ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
Abstract. Data structures for similarity search are commonly evalu-ated on data in vector spaces, but distance-based data structures are also applicable to non-vector spaces with no natural concept of dimen-sionality. The intrinsic dimensionality statistic of Chávez and Navarro provides a way
Multilevel distance-based index structure for multivariate time series
- in Proceedings of 12th International Symposium on Temporal Representation and Reasoning (TIME 2005), June 2005. © 2006 ACADEMY PUBLISHER
"... Multivariate time series (MTS) datasets are common in various multimedia, medical and financial applications. In previous work, we introduced a similarity measure for MTS datasets, termed Eros (Extended Frobenius norm), which is based on the Frobenius Norm and Principal Component Analysis (PCA). Ero ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
. In this paper, we propose a distance-based index structure, Muse (Multilevel distancebased index structure for Eros), for efficient retrieval of MTS items using Eros. Muse constructs each level as a distance-based index structure without using the weights, up to z levels. At the query time, Muse combines the z
Distance Based Indexing for String Proximity Search
- IN ICDE
, 2003
"... In many database applications involving string data, it is common to have near neighbor queries (asking for strings that are similar to a query string) or nearest neighbor queries (asking for strings that are most similar to a query string). The similarity between strings is defined in terms of a di ..."
Abstract
-
Cited by 37 (0 self)
- Add to MetaCart
distance function determined by the application domain. The most popular string distance measures are based on (a weighted) count of (i) character edit or (ii) block edit operations to transform one string into the other. Examples include the Levenshtein edit distance and the recently introduced
Analysis of Distance Based Indexing Methods for Similarity Search
"... In this paper we investigate data structures for performing similarity search queries in metric spaces. We have selected six methods proposed in three dierent communities: two theoretically analyzed structures of Clarkson, GNAT and SA-tree from databases, and AESA and LAESA which are used for patter ..."
Abstract
- Add to MetaCart
In this paper we investigate data structures for performing similarity search queries in metric spaces. We have selected six methods proposed in three dierent communities: two theoretically analyzed structures of Clarkson, GNAT and SA-tree from databases, and AESA and LAESA which are used for pattern recognition applications. We have implemented all these methods and compared their eciency in practice. We propose an improvement for GNAT data structure that reduces its space requirement. We show this improvement does not degrade the query time by much.
DSIM: A Distance-based Indexing Method for Genomic Sequences
- In Proceedings of the IEEE International Conference on Bioinformatics and Bioengineering (BIBE
, 2005
"... In this paper, we propose a Distance-based Sequence In-dexing Method (DSIM) for indexing and searching genome databases. Borrowing the idea of video compression, we compress the genomic sequence database around a set of automatically selected reference words, formed from high-frequency data substrin ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
In this paper, we propose a Distance-based Sequence In-dexing Method (DSIM) for indexing and searching genome databases. Borrowing the idea of video compression, we compress the genomic sequence database around a set of automatically selected reference words, formed from high-frequency data
Case Study: Distance-Based Image Retrieval
- in the MoBIoS DBMS. in The 5th International Conference on Computer and Information Technology (CIT
, 2005
"... Similarity search leveraging distance-based index structures is increasingly being used for complex data types. It has been shown that for high dimensional uniform vectors with similarity norms, any clustering and partitioning index method is outperformed by sequential scan. However, intrinsic clust ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Similarity search leveraging distance-based index structures is increasingly being used for complex data types. It has been shown that for high dimensional uniform vectors with similarity norms, any clustering and partitioning index method is outperformed by sequential scan. However, intrinsic
On Optimizing Distance-Based Similarity Search for Biological Databases
- Stanford University
, 2005
"... Similarity search leveraging distance-based index structures is increasingly being used for both multimedia and biological database applications. We consider distance-based indexing for three important biological data types, protein k-mers with the metric PAM model, DNA k-mers with Hamming distance ..."
Abstract
-
Cited by 10 (4 self)
- Add to MetaCart
Similarity search leveraging distance-based index structures is increasingly being used for both multimedia and biological database applications. We consider distance-based indexing for three important biological data types, protein k-mers with the metric PAM model, DNA k-mers with Hamming distance
Index-driven similarity search in metric spaces
- ACM Transactions on Database Systems
, 2003
"... Similarity search is a very important operation in multimedia databases and other database applications involving complex objects, and involves finding objects in a data set S similar to a query object q, based on some similarity measure. In this article, we focus on methods for similarity search th ..."
Abstract
-
Cited by 192 (8 self)
- Add to MetaCart
that make the general assumption that similarity is represented with a distance metric d. Existing methods for handling similarity search in this setting typically fall into one of two classes. The first directly indexes the objects based on distances (distance-based indexing), while the second is based
Results 1 - 10
of
3,284