Results 1 -
2 of
2
Beyond spatial pyramids: A new feature extraction framework with dense spatial sampling for image classification
- in Proc. Eur. Conf. Comput. Vis., 2012
"... Abstract. We introduce a new framework for image classification that extends beyond the window sampling of fixed spatial pyramids to include a comprehensive set of windows densely sampled over location, size and aspect ratio. To effectively deal with this large set of windows, we derive a concise hi ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
(Show Context)
Abstract. We introduce a new framework for image classification that extends beyond the window sampling of fixed spatial pyramids to include a comprehensive set of windows densely sampled over location, size and aspect ratio. To effectively deal with this large set of windows, we derive a concise high-level image feature using a two-level extraction method. At the first level, window-based features are computed from local descriptors (e.g., SIFT, spatial HOG, LBP) in a process similar to standard feature extractors. Then at the second level, the new image feature is determined from the window-based features in a manner analogous to the first level. This higher level of abstraction offers both efficient handling of dense samples and reduced sensitivity to misalignment. More importantly, our simple yet effective framework can readily accommodate a large number of existing pooling/coding methods, allowing them to extract features beyond the spatial pyramid representation. To effectively fuse the second level feature with a standard first level image feature for classification, we additionally propose a new learning algorithm, called Generalized Adaptive ℓp-norm Multiple Kernel Learning (GA-MKL), to learn an adapted robust classifier based on multiple base kernels constructed from image features and multiple sets of prelearned classifiers of all the classes. Extensive evaluation on the object recognition (Caltech256) and scene recognition (15Scenes) benchmark datasets demonstrates that the proposed method outperforms state-ofthe-art image classification algorithms under a broad range of settings.
Multimedia Systems (accepted) (will be inserted by the editor) Tag Relevance Fusion for Social Image Retrieval
"... Abstract Due to the subjective nature of social tag-ging, measuring the relevance of social tags with respect to the visual content is crucial for retrieving the increas-ing amounts of social-networked images. Witnessing the limit of a single measurement of tag relevance, we intro-duce in this paper ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract Due to the subjective nature of social tag-ging, measuring the relevance of social tags with respect to the visual content is crucial for retrieving the increas-ing amounts of social-networked images. Witnessing the limit of a single measurement of tag relevance, we intro-duce in this paper tag relevance fusion as an extension to methods for tag relevance estimation. We present a systematic study, covering tag relevance fusion in early and late stages, and in supervised and unsupervised set-tings. Experiments on a large present-day benchmark set show that tag relevance fusion leads to better image retrieval. Moreover, unsupervised tag relevance fusion is found to be practically as effective as supervised tag relevance fusion, but without the need of any training efforts. This finding suggests the potential of tag rele-vance fusion for real-world deployment.