• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Performance Evaluation in Content-Based Image Retrieval : Overview and Proposals (1999)

by H Muller, V Müler, McG Squire D
Add To MetaCart

Tools

Sorted by:
Results 11 - 20 of 35
Next 10 →

Episode-constrained cross-validation in video concept retrieval

by Jan C. Van Gemert, Cor J. Veenman, Jan-mark Geusebroek Member - IEEE Trans. Multimedia
"... Abstract—Whereas video tells a narrative by a composition of shots, current video retrieval methods focus mainly on single shots. In retrieval performance estimation, similar shots in a narrative may result in performance over-estimation. We propose an episode-based version of cross-validation leadi ..."
Abstract - Cited by 4 (2 self) - Add to MetaCart
Abstract—Whereas video tells a narrative by a composition of shots, current video retrieval methods focus mainly on single shots. In retrieval performance estimation, similar shots in a narrative may result in performance over-estimation. We propose an episode-based version of cross-validation leading up to 14% classification improvement over shot based cross-validation. I.

MRML: A Communication Protocol for Content-Based Image Retrieval

by Wolfgang Müller, Henning Müller, Stéphane Marchand-Maillet, Thierry Pun, David McG. Squire, Zoran Pecenovic, Christoph Giess, Arjen P. de Vries , 2000
"... In this paper we introduce and describe the Multimedia Retrieval Markup Language (MRML). This XML-based markup language is the basis for an open communication protocol for content-based image retrieval systems (CBIRSs). MRML was initially designed as a means of separating CBIR engines from their use ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
In this paper we introduce and describe the Multimedia Retrieval Markup Language (MRML). This XML-based markup language is the basis for an open communication protocol for content-based image retrieval systems (CBIRSs). MRML was initially designed as a means of separating CBIR engines from their user interfaces. It is, however, also extensible as the basis for standardised performance evaluation procedures. Such a tool is essential for the formulation and implementation of common benchmarks for CBIR. A common protocol can also bring new dynamics to the CBIR field -- it makes the development of new systems faster and more efficient, and opens the door of the CBIR research field to other disciplines such as Human-Computer Interaction. The MRML specifications, as well as the first MRML-compliant applications, are freely available and are introduced in this paper.

An Open Framework for Distributed Multimedia Retrieval

by Henning Müller , Wolfgang Müller, David McG. Squire, Zoran Pecenović, Stéphane Marchand-Maillet, Thierry Pun , 2000
"... This article describes a framework for distributed multimedia retrieval which permits the connection of compliant user interfaces with a variety of multimedia retrieval engines via an open communication protocol, MRML (Multi Media Retrieval Markup Language). It allows the choice of image collecti ..."
Abstract - Cited by 3 (2 self) - Add to MetaCart
This article describes a framework for distributed multimedia retrieval which permits the connection of compliant user interfaces with a variety of multimedia retrieval engines via an open communication protocol, MRML (Multi Media Retrieval Markup Language). It allows the choice of image collection, feature set and query algorithm during run{time, permitting multiple users to query a system adapted to their needs, using the query paradigm adapted to their problem such as query by example (QBE), browsing queries, or query by annotation. User interaction is implemented over several levels and in diverse ways. Relevance feedback is implemented using positive and negative example images that can be used for a best{match QBE query. In contrast, browsing methods try to approach the searched image by giving overviews of the entire collection and by successive renement. In addition to these query methods, Long term o line learning is implemented. It allows feature preferences per ...

Performance metrics and evaluation issues for continuous activity recognition,” in Performance Metrics for Intelligent Systems

by David Minnen, Tracy Westeyn, Thad Starner , 2006
"... Abstract — In this paper we examine several factors that influence the evaluation of multi-class, continuous activity recognition. Currently, there is no standard metric for evaluating and comparing such systems, although many possible error formulations and performance metrics could be adapted from ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
Abstract — In this paper we examine several factors that influence the evaluation of multi-class, continuous activity recognition. Currently, there is no standard metric for evaluating and comparing such systems, although many possible error formulations and performance metrics could be adapted from other domains. In order to make progress toward a standard metric appropriate for evaluating activity recognition, we outline the sources of errors in such systems, present different methods for detecting and labeling these errors, and compare existing metrics with a more nuanced performance visualization. We conclude with a discussion concerning the interpretation of the visualization for comparing recognition systems in different domains. I.

MRML: An Extensible Communication Protocol for Interoperability and Benchmarking of Multimedia Information Retrieval Systems

by Wolfgang Müller, Henning Müller, Stéphane Marchand-Maillet, Thierry Pun, David McG. Squire, Zoran Pecenovic, Christoph Giess, Arjen P. De Vries - PATTERN RECOGNITION LETTERS, SPECIAL ISSUE ON IMAGE/VIDEO INDEXING AND RETRIEVAL , 2000
"... While in the area of relational databases interoperability is ensured by common communication protocols (e.g. ODBC/JDBC using SQL), Content Based Image Retrieval Systems (CBIRS) and other multimedia retrieval systems are lacking both a common query language and a common communication protocol. Besi ..."
Abstract - Cited by 3 (2 self) - Add to MetaCart
While in the area of relational databases interoperability is ensured by common communication protocols (e.g. ODBC/JDBC using SQL), Content Based Image Retrieval Systems (CBIRS) and other multimedia retrieval systems are lacking both a common query language and a common communication protocol. Besides its obvious short term convenience, interoperability of systems is crucial for the exchange and analysis of user data. In this paper, we present and describe an extensible XML-based query markup language, called MRML (Multimedia Retrieval Markup Language). MRML is primarily designed so as to ensure interoperability between dierent content{based multimedia retrieval systems. Further, MRML allows researchers to preserve their freedom in extending their system as needed. MRML encapsulates multimedia queries in a way that enables multimedia (MM) query languages, MM content descriptions, MM query engines, and MM user interfaces to grow independently from each other, reaching a maximum of in...

A Performance Evaluation Protocol for Content-Based Image Retrieval Algorithms/Systems

by Liu Wenyin, Zhong Su, Stan Li, Yan-Feng Sun, Hongjiang Zhang - In Empirical Evaluation Methods in Computer Vision , 2001
"... In this paper, we address an important yet controversial issue in content-based image retrieval (CBIR): performance evaluation (PE). The goal is to develop an objective and comprehensive protocol for quantitative evaluation and comparison of CBIR performances among different approaches. The protocol ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
In this paper, we address an important yet controversial issue in content-based image retrieval (CBIR): performance evaluation (PE). The goal is to develop an objective and comprehensive protocol for quantitative evaluation and comparison of CBIR performances among different approaches. The protocol consists of two elements: acquisition of appropriate ground truth data, and definition of quantitative performance metrics. A large image data set with human-label ground truths has been built as part of the protocol and made available for public use. An application of this protocol to our MiAlbum system is also presented, which shows suitability of such a protocol.

The multimodal neighborhood signature for modeling object color appearance and aplications in object recognition and image retrieval

by J. Matas, D. Koubaroulis, J. Kittler - Computer Vision and Image Understanding , 2002
"... We propose a general-purpose color-based object model called the Multimodal Neighborhood Signature (MNS) with applications in object recognition and image retrieval. Object modeling is example-based and can cope with many appearance variations due to the image formation/rendering process. The local ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
We propose a general-purpose color-based object model called the Multimodal Neighborhood Signature (MNS) with applications in object recognition and image retrieval. Object modeling is example-based and can cope with many appearance variations due to the image formation/rendering process. The local nature of the color representation facilitates robustness to occlusion and clutter. Unlike other methods, neither segmentation nor edge detection is required and the area of homogeneously colored regions is not used. The algorithm is simple to implement and has low storage requirements. In the reported experiments, eight recognition and two other retrieval methods are reviewed and systematically compared with MNS. Results show good and fast performance under severe scale, viewpoint, occlusion, and background change using a single image for object modeling. Although spatial information was not used and its default internal parameters were used, MNS outperformed most compared methods. c ○ 2002 Elsevier Science (USA) Key Words: multimodal neighborhood signature; color; image retrieval; object recognition; mean shift; mns.

Abstract Evaluation and comparison of texture descriptors proposed in MPEG-7

by Feng Xu, Yu-jin Zhang , 2005
"... Texture description contributes as one of the most important low-level features in content-based image retrieval. In MPEG-7, homogeneous texture descriptor (HTD), texture browsing descriptor (TBD), and edge histogram descriptor (EHD) have been proposed as texture descriptors. However, no comprehensi ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
Texture description contributes as one of the most important low-level features in content-based image retrieval. In MPEG-7, homogeneous texture descriptor (HTD), texture browsing descriptor (TBD), and edge histogram descriptor (EHD) have been proposed as texture descriptors. However, no comprehensive evaluation and comparison of these three descriptors have been made. In this paper, we propose a comprehensive evaluation and comparison benchmark for feature descriptors, especially for visual descriptors in MPEG-7. In the proposed benchmark, three texture descriptors in MPEG-7 are evaluated and compared. First, the descriptors are analyzed according to the standard criteria. Second, experiments are implemented on the Brodatz texture image database. Analysis of the experimental results shows that each descriptor has some specific characteristics and performs better than the other two in certain applications. The applicability is also summarized for each descriptor. The survey as well as performance evaluation and comparison in this paper provide several guidelines for using these descriptors in image retrieval and other applications.

PATSEEK: Content Based Image Retrieval System for Patent Database

by Avinash Tiwari, Veena Bansal
"... A patent always contains some images along with the text. Many text based systems have been developed to search the patent database. In this paper, we describe PATSEEK that is an image based search system for US patent database. The objective is to let the user check the similarity of his query imag ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
A patent always contains some images along with the text. Many text based systems have been developed to search the patent database. In this paper, we describe PATSEEK that is an image based search system for US patent database. The objective is to let the user check the similarity of his query image with the images that exist in US patents. The user can specify a set of key words that must exist in the text of the patents whose images will be searched for similarity. PATSEEK automatically grabs images from the US patent database on the request of the user and represents them through an edge orientation autocorrelogram. L1 and L2 distance measures are used to compute the distance between the images. A recall rate of 100 % for 61 % of query images and an average 32 % recall rate for rest of the images has been observed.

Efficient indexing for strongly similar subimage retrieval,” 2006, submitted to

by Gerhard Roth, William Scott - CIVR2006-International Conference on Image and Video Retrieval
"... Strongly similar subimages contain different views of the same object. In subimage search, the user selects an image region and the retrieval system attempts to find matching subimages in an image database that are strongly similar. Solutions have been proposed using salient features or “interest po ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Strongly similar subimages contain different views of the same object. In subimage search, the user selects an image region and the retrieval system attempts to find matching subimages in an image database that are strongly similar. Solutions have been proposed using salient features or “interest points ” that have associated descriptor vectors. However, searching large image databases by exhaustive comparison of interest point descriptors is not feasible. To solve this problem, we propose a novel off-line indexing scheme based on the most significant bits (MSBs) of these descriptors. On-line search uses this index file to limit the search to interest points whose descriptors have the same MSB value, a process up to three orders of magnitude faster than exhaustive search. It is also incremental, since the index file for a union of a group of images can be created by merging the index files of the individual image groups. The effectiveness of the approach is demonstrated experimentally on a variety of image databases. 1
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University