• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Similarity search in high dimensions via hashing (1999)

Cached

  • Download as a PDF

Download Links

  • [www.cs.ust.hk]
  • [www.cs.princeton.edu]
  • [theory.stanford.edu]
  • [theory.stanford.edu]
  • [theory.stanford.edu]
  • [www.cs.princeton.edu]
  • [nichol.as]
  • [www.cs.princeton.edu]
  • [www.wisdom.weizmann.ac.il]

  • Other Repositories/Bibliography

  • DBLP
  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Aristides Gionis , Piotr Indyk , Rajeev Motwani
Citations:275 - 11 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@INPROCEEDINGS{Gionis99similaritysearch,
    author = {Aristides Gionis and Piotr Indyk and Rajeev Motwani},
    title = {Similarity search in high dimensions via hashing},
    booktitle = {},
    year = {1999},
    pages = {518--529}
}

Years of Citing Articles

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

The nearest- or near-neighbor query problems arise in a large variety of database applications, usually in the context of similarity searching. Of late, there has been increasing interest in building search/index structures for performing similarity search over high-dimensional data, e.g., image databases, document collections, time-series databases, and genome databases. Unfortunately, all known techniques for solving this problem fall prey to the \curse of dimensionality. " That is, the data structures scale poorly with data dimensionality; in fact, if the number of dimensions exceeds 10 to 20, searching in k-d trees and related structures involves the inspection of a large fraction of the database, thereby doing no better than brute-force linear search. It has been suggested that since the selection of features and the choice of a distance metric in typical applications is rather heuristic, determining an approximate nearest neighbor should su ce for most practical purposes. In this paper, we examine a novel scheme for approximate similarity search based on hashing. The basic idea is to hash the points

Citations

3341 Pattern Classification and Scene Analysis - Duda, Hart - 1973
2699 MJ: Introduction to Modern Information Retrieval - Salton, McGill - 1986
2168 Indexing by latent semantic analysis - Deerwester, Dumais, et al. - 1990
1567 Randomized Algorithms - Motwani, Raghavan - 1995
1084 The Design and Analysis of Spatial Data Structures - Samet - 1990
911 Multidimensional binary search trees used for associative searching - Bentley - 1975
769 P.: Query by image and video content: the QBIC system - Flickner, Sawhney, et al.
687 Nearest neighbor pattern classification - Cover, Hart - 1967
643 Algorithms in combinatorial geometry - Edelsbrunner - 1987
634 An optimal algorithm for approximate nearest neighbor searching fixed dimensions - Arya, Mount, et al. - 1998
533 Approximate nearest neighbors: Towards removing the curse of dimensionality - Indyk, Motwani - 1998
488 an algorithm for finding best matches in logarithmic expected time - Friedman, Bentley, et al. - 1977
413 S.: A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces - Weber, Schek, et al. - 1998
393 Efficient and effective querying by image content - Faloutsos - 1994
376 The geometry of graphs and some of its algorithmic applications - Linial, London, et al. - 1995
375 Analysis of a complex of statistical variables into principal components - Hotelling - 1933
360 Color Science: Concepts and Methods, Quantitative Data and Formulae - Wyszecki, Stiles - 1982
355 Photobook: Tools for content-based manipulation of image databases - Pentland, Sclaroff
342 The SR-tree: an index structure for high-dimensional nearest neighbor queries - Katayama, Satoh - 1997
249 A weighted nearest neighbor algorithm for learning with symbolic features - Cost, Salzberg - 1993
211 R.: Discriminant adaptive nearest neighbor classification - Hastie, Tibshirani - 1996
191 Wavelet-based histograms for selectivity estimation - Matias, Vitter, et al. - 1998
153 Visually searching the web for content - Smith, Chang - 1997
117 Integrated spatial and feature image system: Retrieval, analysis and compression - Smith - 1997
111 Finding interesting associations without support pruning - Cohen, Datar, et al.
102 Approximate medians and other quantiles in one pass and with limited memory - Manku, Rajagopalan, et al. - 1998
91 Random sampling for histogram construction: how much is enough - Chaudhuri, Motwani, et al. - 1998
88 Dimensionaltry reduction for similarity searching in dynamic databases - Kanth, Agrawal, et al. - 1999
82 A survey of information retrieval and filtering methods”, Technical Report on A survey of information retrieval and filtering methods - Faloutsos, Oard - 1995
59 The dimensions of almost spherical sections of convex bodies - Figiel, Lindenstrauss, et al. - 1977
51 Approximate nearest neighbor queries revisited - Chan - 1997
43 A cost model for similarity queries in metric spaces - Ciaccia, Patella, et al.
33 Accounting for boundary effects in nearest neighbor searching - Arya, Mount, et al. - 1995
20 Nearest neighbor methods in discrimination - Devroye, Wagner - 1982
20 Y.: Texture Features for Browsing and Retrieval of Large Image Data - Manjunath, Ma - 1996
18 Multidimensional access methods. Trees Have Grown Everywhere - Sellis, Roussopoulos, et al. - 1997
15 Approximate Matching with High Dimensionality R-trees", M.Sc. scholarly paper - Otterman - 1992
11 Finding pirated video sequences on the internet - Indyk, Iyengar, et al. - 1999
10 Vector Quantization and Data Compression - Gersho, Gray - 1991
9 Uber lineare Methoden in der - Karhunen - 1947
8 High-dimensional Index Structures - Berchtold, Keim - 1998
7 Nearest neighbor filters for multivariate data - Koivune, Kassam - 1995
5 Fonctions aleastoires de second ordere. Processus Stochastiques et mouvement Brownian - Lo'eve - 1948
3 Airphoto dataset - Manjunath - 2000
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University