• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Efficient clustering and matching for object class recognition, in: BMVC, (2006)

by B Leibe, K Mikolajczyk, B Schiele
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 37
Next 10 →

Fast approximate nearest neighbors with automatic algorithm configuration

by Marius Muja, David G. Lowe - In VISAPP International Conference on Computer Vision Theory and Applications , 2009
"... nearest-neighbors search, randomized kd-trees, hierarchical k-means tree, clustering. For many computer vision problems, the most time consuming component consists of nearest neighbor matching in high-dimensional spaces. There are no known exact algorithms for solving these high-dimensional problems ..."
Abstract - Cited by 455 (2 self) - Add to MetaCart
nearest-neighbors search, randomized kd-trees, hierarchical k-means tree, clustering. For many computer vision problems, the most time consuming component consists of nearest neighbor matching in high-dimensional spaces. There are no known exact algorithms for solving these high-dimensional problems that are faster than linear search. Approximate algorithms are known to provide large speedups with only minor loss in accuracy, but many such algorithms have been published with only minimal guidance on selecting an algorithm and its parameters for any given problem. In this paper, we describe a system that answers the question, “What is the fastest approximate nearest-neighbor algorithm for my data? ” Our system will take any given dataset and desired degree of precision and use these to automatically determine the best algorithm and parameter values. We also describe a new algorithm that applies priority search on hierarchical k-means trees, which we have found to provide the best known performance on many datasets. After testing a range of alternatives, we have found that multiple randomized k-d trees provide the best performance for other datasets. We are releasing public domain code that implements these approaches. This library provides about one order of magnitude improvement in query time over the best previously available software and provides fully automated parameter selection. 1
(Show Context)

Citation Context

...ree level, so that the darkest values (ratio≈1) fall near the boundaries between k-means regions. cal k-means tree similar to that proposed by Fukunaga and Narendra (Fukunaga and Narendra, 1975). In (=-=Leibe et al., 2006-=-) the authors propose an efficient method for clustering and matching features in large datasets. They compare several clustering methods: k-means clustering, agglomerative clustering, and a combined ...

Vector quantizing feature space with a regular lattice

by Tinne Tuytelaars - In ICCV , 2007
"... Most recent class-level object recognition systems work with visual words, i.e., vector quantized local descriptors. In this paper we examine the feasibility of a dataindependent approach to construct such a visual vocabulary, where the feature space is discretized using a regular lattice. Using has ..."
Abstract - Cited by 69 (3 self) - Add to MetaCart
Most recent class-level object recognition systems work with visual words, i.e., vector quantized local descriptors. In this paper we examine the feasibility of a dataindependent approach to construct such a visual vocabulary, where the feature space is discretized using a regular lattice. Using hashing techniques, only non-empty bins are stored, and fine-grained grids become possible in spite of the high dimensionality of typical feature spaces. Based on this representation, we can explore the structure of the feature space, and obtain state-of-the-art pixelwise classification results. In the case of image classification, we introduce a class-specific feature selection step, which takes the spatial structure of SIFT-like descriptors into account. Results are reported on the Graz02 dataset. 1.
(Show Context)

Citation Context

...e – there do not exist separate clusters. Existing solutions which improve over standard k-means include the use of mean-shift based clustering [9], hierarchical kmeans [21], agglomerative clustering =-=[14]-=-, and randomized trees [20]. They all present a compromise between accuracy and speed. In this paper we introduce an accurate and fast solution which differs significantly from existing solutions, as ...

Localizing Objects with Smart Dictionaries

by Brian Fulkerson, Andrea Vedaldi, Stefano Soatto - Proceedings of European Conference on Computer Vision
"... Abstract. We present an approach to determine the category and location of objects in images. It performs very fast categorization of each pixel in an image, a brute-force approach made feasible by three key developments: First, our method reduces the size of a large generic dictionary (on the order ..."
Abstract - Cited by 62 (3 self) - Add to MetaCart
Abstract. We present an approach to determine the category and location of objects in images. It performs very fast categorization of each pixel in an image, a brute-force approach made feasible by three key developments: First, our method reduces the size of a large generic dictionary (on the order of ten thousand words) to the low hundreds while increasing classification performance compared to k-means. This is achieved by creating a discriminative dictionary tailored to the task by following the information bottleneck principle. Second, we perform feature-based categorization efficiently on a dense grid by extending the concept of integral images to the computation of local histograms. Third, we compute SIFT descriptors densely in linear time. We compare our method to the state of the art and find that it excels in accuracy and simplicity, performing better while assuming less. 1
(Show Context)

Citation Context

...sely in linear time. Related work. Lazebnik et al. [8]alsoperformdiscriminativelearningtooptimize k-means, but are limited to small dictionaries and visual words which are Voronoi cells. Leibe et al. =-=[9]-=-alsoperformcompression,butnotinadiscriminative sense. Finally, Winn et al. [10]dodiscriminativecompressioninasimilar fashion, but we show that we perform better and can scale to larger dictionaries. F...

Feature tracking and motion compensation for action recognition

by H. Uemura, S. Ishikawa K. Mikolajczyk - In BMVC , 2008
"... This paper discusses an approach to human action recognition via local feature tracking and robust estimation of background motion. The main contribution is a robust feature extraction algorithm based on KLT tracker and SIFT as well as a method for estimating dominant planes in the scene. Multiple i ..."
Abstract - Cited by 29 (2 self) - Add to MetaCart
This paper discusses an approach to human action recognition via local feature tracking and robust estimation of background motion. The main contribution is a robust feature extraction algorithm based on KLT tracker and SIFT as well as a method for estimating dominant planes in the scene. Multiple interest point detectors are used to provide large number of features for every frame. The motion vectors for the features are estimated using optical flow and SIFT based matching. The features are combined with image segmentation to estimate dominant homographies, and then separated into static and moving ones regardless the camera motion. The action recognition approach can handle camera motion, zoom, human appearance variations, background clutter and occlusion. The motion compensation shows very good accuracy on a number of test sequences. The recognition system is extensively compared to state-of-the art action recognition methods and the results are improved. 1
(Show Context)

Citation Context

...ing. The SIFT descriptors are extracted from the training data as described in sections 2 and 3. The vocabulary trees are then constructed with the agglomerative clustering similar to the one used in =-=[9]-=-. Initially each descriptor forms a cluster. Two nearest clusters in the whole set are merged at each iteration based on their Euclidean distance. We continue merging until the number of clusters is 1...

Efficient Sequential Correspondence Selection by Cosegmentation

by Jan Čech, et al. , 2009
"... In many retrieval, object recognition and wide baseline stereo methods, correspondences of interest points (distinguished regions) are commonly established by matching compact descriptors such as SIFTs. We show that a subsequent cosegmentation process coupled with a quasi-optimal sequential decision ..."
Abstract - Cited by 27 (7 self) - Add to MetaCart
In many retrieval, object recognition and wide baseline stereo methods, correspondences of interest points (distinguished regions) are commonly established by matching compact descriptors such as SIFTs. We show that a subsequent cosegmentation process coupled with a quasi-optimal sequential decision process leads to a correspondence verification procedure that (i) has high precision (is highly discriminative) (ii) has good recall and (iii) is fast. The sequential decision on the correctness of a correspondence is based on simple statistics of a modified dense stereo matching algorithm. The statistics are projected on a prominent discriminative direction by SVM. Wald’s sequential probability ratio test is performed on the SVM projection computed on progressively larger cosegmented regions. We show experimentally that the proposed Sequential Correspondence Verification (SCV) algorithm significantly outperforms the standard correspondence selection method based on SIFT distance ratios on challenging matching problems.
(Show Context)

Citation Context

...is to describe regions with a compact descriptor such as SIFT [9] or some discretization of it (e.g. ”visual words” [17]) and to store database image representations in a search tree (k-d [9], metric =-=[7]-=-, k-means [13, 16, 3]). 1 1 Terms ”viewpoint invariant features”, ”interest points”, ”patches”, ”distinguished regions” also appear in the literature. A better estimate of correspondence quality (a pr...

Learning Linear Discriminant Projections for Dimensionality Reduction of Image Descriptors

by Hongping Cai, Krystian Mikolajczyk, Jiri Matas
"... This paper proposes a general method for improving image descriptors using discriminant projections. Two methods based on Linear Discriminant Analysis have been recently introduced in [3, 11] to improve matching performance of local descriptors and to reduce their dimensionality. These methods requi ..."
Abstract - Cited by 20 (5 self) - Add to MetaCart
This paper proposes a general method for improving image descriptors using discriminant projections. Two methods based on Linear Discriminant Analysis have been recently introduced in [3, 11] to improve matching performance of local descriptors and to reduce their dimensionality. These methods require large training set with ground truth of accurate point-to-point correspondences which limits their applicability. We demonstrate the theoretical equivalence of these methods and provide a means to derive projection vectors on data without available ground truth. It makes it possible to apply this technique and improve performance of any combination of interest point detectors-descriptors. We conduct an extensive evaluation of the discriminative projection methods in various application scenarios. The results validate the proposed method in viewpoint invariant matching and category recognition.
(Show Context)

Citation Context

...ormation to estimate the covariance. It is a strong assumption but it may lead to better results in particular when used in combination with the simulated data. We employ the agglomerative clustering =-=[6]-=- to obtain the feature clusters. It is a bottom up clustering approach which iteratively finds the nearest features in the set and merges them into clusters. In our implementation features whose dista...

Improving Descriptors for Fast Tree Matching by Optimal Linear Projection

by Krystian Mikolajczyk
"... In this paper we propose to transform an image descriptor so that nearest neighbor (NN) search for correspondences becomes the optimal matching strategy under the assumption that inter-image deviations of corresponding descriptors have Gaussian distribution. The Euclidean NN in the transformed domai ..."
Abstract - Cited by 17 (1 self) - Add to MetaCart
In this paper we propose to transform an image descriptor so that nearest neighbor (NN) search for correspondences becomes the optimal matching strategy under the assumption that inter-image deviations of corresponding descriptors have Gaussian distribution. The Euclidean NN in the transformed domain corresponds to the NN according to a truncated Mahalanobis metric in the original descriptor space. We provide theoretical justification for the proposed approach and show experimentally that the transformation allows a significant dimensionality reduction and improves matching performance of a state-of-the art SIFT descriptor. We observe consistent improvement in precision-recall and speed of fast matching in tree structures at the expense of little overhead for projecting the descriptors into transformed space. In the context of SIFT vs. transformed M-SIFT comparison, tree search structures are evaluated according to different criteria and query types. All search tree experiments confirm that transformed M-SIFT performs better than the original SIFT. 1.
(Show Context)

Citation Context

... of large scale recognition methods [27]. The possibility to match SIFT-like descriptors by sublinear tree-based search methods is one of its key properties and has been explored in different systems =-=[18, 21, 23, 27]-=-. We therefore carefully compare SIFT and M-SIFT performance with different search structures, variants of kd-trees and metric (ball) trees. Besides demonstrating that M-SIFT outperforms SIFT, valuabl...

Action recognition with appearancemotion features and fast search trees

by K. Mikolajczyk, H. Uemura - Comput. Vis. Image Underst , 2011
"... In this paper we propose an approach for action recognition based on a vocabulary of local motion-appearance features and fast approximate search in a large number of trees. Large numbers of features with associated motion vectors are extracted from video data and are represented by many trees. Mult ..."
Abstract - Cited by 7 (0 self) - Add to MetaCart
In this paper we propose an approach for action recognition based on a vocabulary of local motion-appearance features and fast approximate search in a large number of trees. Large numbers of features with associated motion vectors are extracted from video data and are represented by many trees. Multiple interest point detectors are used to provide features for every frame. The motion vectors for the features are estimated using optical flow and a descriptor based matching. The features are combined with image segmentation to estimate dominant homographies, and then separated into static and moving ones despite the camera motion. Features from a query sequence are matched to the trees and vote for action categories and their locations. Large number of trees make the process efficient and robust. The system is capable of simultaneous categorisation and localisation of actions using only a few frames per sequence. The approach obtains excellent performance on standard action recognition sequences. We perform large scale experiments on 17 challenging real action categories from various sport disciplines. We demonstrate the robustness of our method to appearance variations, camera motion, scale change, asymmetric actions, background clutter and occlusion.

Fast codebook generation by sequential data analysis for object classification

by Ra Teynor, Hans Burkhardt - in Proc. of the 3rd Int. Symp. on Visual Computing (ISVC , 2007
"... Abstract. In this work, we present a novel, fast clustering scheme for codebook generation from local features for object class recognition. It relies on a sequential data analysis and creates compact clusters with low variance. We compare our algorithm to other commonly used algorithms with respect ..."
Abstract - Cited by 7 (1 self) - Add to MetaCart
Abstract. In this work, we present a novel, fast clustering scheme for codebook generation from local features for object class recognition. It relies on a sequential data analysis and creates compact clusters with low variance. We compare our algorithm to other commonly used algorithms with respect to cluster statistics and classification performance. It turns out that our algorithm is the fastest for codebook generation, without loss in classification performance, when using the right matching scheme. In this context, we propose a well suited matching scheme for assigning data entries to cluster centers based on the sigmoid function. 1
(Show Context)

Citation Context

...hical clustering (divisive clustering [9] as well as agglomerative clustering), clustering based on function optimization (e.g. expectation maximization (EM) type clustering [15]) or mixed techniques =-=[8]-=-. A general overview about clustering algorithms can be found in [15]. The two most commonly used techniques for codebook generation are agglomerative hierarchical clustering as well as K-means cluste...

Learning tree-structured descriptor quantizers for image categorization

by Josip Krapac, Jakob Verbeek, Josip Krapac, Jakob Verbeek, Learning Tree-structured, Descriptor Quantizers, Josip Krapac, Jakob Verbeek, Frédéric Jurie - In BMVC, 2011. 6
"... HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte p ..."
Abstract - Cited by 6 (1 self) - Add to MetaCart
HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et a ̀ la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.
(Show Context)

Citation Context

...s k-means focus on clustering the most frequent descriptors, that are in general less discriminative. An algorithm similar to mean-shift was proposed to overcome this limitation. For similar reasons, =-=[12]-=- suggests to use agglomerative clustering which gives more compact clusters, while being more robust to outliers. Hierarchical clustering is also very efficient in terms of the computational cost to a...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University