Results 1 - 10
of
41
Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach
- IEEE Transactions on Pattern Analysis and Machine Intelligence
, 2003
"... Automatic linguistic indexing of pictures is an important but highly challenging problem for researchers in computer vision and content-based image retrieval. In this paper, we introduce a statistical modeling approach to this problem. Categorized images are used to train a dictionary of hundreds ..."
Abstract
-
Cited by 171 (22 self)
- Add to MetaCart
Automatic linguistic indexing of pictures is an important but highly challenging problem for researchers in computer vision and content-based image retrieval. In this paper, we introduce a statistical modeling approach to this problem. Categorized images are used to train a dictionary of hundreds of statistical models each representing a concept. Images of any given concept are regarded as instances of a stochastic process that characterizes the concept. To measure the extent of association between an image and the textual description of a concept, the likelihood of the occurrence of the image based on the characterizing stochastic process is computed. A high likelihood indicates a strong association. In our experimental implementation, we focus on a particular group of stochastic processes, that is, the two-dimensional multiresolution hidden Markov models (2D MHMMs). We implemented and tested our ALIP (Automatic Linguistic Indexing of Pictures) system on a photographic image database of 600 different concepts, each with about 40 training images. The system is evaluated quantitatively using more than 4,600 images outside the training database and compared with a random annotation scheme. Experiments have demonstrated the good accuracy of the system and its high potential in linguistic indexing of photographic images.
Image retrieval: ideas, influences, and trends of the new age
- ACM COMPUTING SURVEYS
, 2008
"... We have witnessed great interest and a wealth of promise in content-based image retrieval as an emerging technology. While the last decade laid foundation to such promise, it also paved the way for a large number of new techniques and systems, got many new people involved, and triggered stronger ass ..."
Abstract
-
Cited by 157 (3 self)
- Add to MetaCart
We have witnessed great interest and a wealth of promise in content-based image retrieval as an emerging technology. While the last decade laid foundation to such promise, it also paved the way for a large number of new techniques and systems, got many new people involved, and triggered stronger association of weakly related fields. In this article, we survey almost 300 key theoretical and empirical contributions in the current decade related to image retrieval and automatic image annotation, and in the process discuss the spawning of related subfields. We also discuss significant challenges involved in the adaptation of existing image retrieval techniques to build systems that can be useful in the real world. In retrospect of what has been achieved so far, we also conjecture what the future may hold for image retrieval research.
Hidden Markov processes
- IEEE Trans. Inform. Theory
, 2002
"... Abstract—An overview of statistical and information-theoretic aspects of hidden Markov processes (HMPs) is presented. An HMP is a discrete-time finite-state homogeneous Markov chain observed through a discrete-time memoryless invariant channel. In recent years, the work of Baum and Petrie on finite- ..."
Abstract
-
Cited by 93 (2 self)
- Add to MetaCart
Abstract—An overview of statistical and information-theoretic aspects of hidden Markov processes (HMPs) is presented. An HMP is a discrete-time finite-state homogeneous Markov chain observed through a discrete-time memoryless invariant channel. In recent years, the work of Baum and Petrie on finite-state finite-alphabet HMPs was expanded to HMPs with finite as well as continuous state spaces and a general alphabet. In particular, statistical properties and ergodic theorems for relative entropy densities of HMPs were developed. Consistency and asymptotic normality of the maximum-likelihood (ML) parameter estimator were proved under some mild conditions. Similar results were established for switching autoregressive processes. These processes generalize HMPs. New algorithms were developed for estimating the state, parameter, and order of an HMP, for universal coding and classification of HMPs, and for universal decoding of hidden Markov channels. These and other related topics are reviewed in this paper. Index Terms—Baum–Petrie algorithm, entropy ergodic theorems, finite-state channels, hidden Markov models, identifiability, Kalman filter, maximum-likelihood (ML) estimation, order estimation, recursive parameter estimation, switching autoregressive processes, Ziv inequality. I.
Multiresolution image classification by hierarchical modeling with two dimensional hidden Markov models
- IEEE TRANS. INFORMATION THEORY
, 2000
"... This paper treats a multiresolution hidden Markov model for classifying images. Each image is represented by feature vectors at several resolutions, which are statistically dependent as modeled by the underlying state process, a multiscale Markov mesh. Unknowns in the model are estimated by maximum ..."
Abstract
-
Cited by 39 (8 self)
- Add to MetaCart
This paper treats a multiresolution hidden Markov model for classifying images. Each image is represented by feature vectors at several resolutions, which are statistically dependent as modeled by the underlying state process, a multiscale Markov mesh. Unknowns in the model are estimated by maximum likelihood, in particular by employing the expectation-maximization algorithm. An image is classified by finding the optimal set of states with maximum a posteriori probability. States are then mapped into classes. The multiresolution model enables multiscale information about context to be incorporated into classification. Suboptimal algorithms based on the model provide progressive classification that is much faster than the algorithm based on single-resolution hidden Markov models.
Signal and Image Segmentation Using Pairwise Markov Chains
, 2003
"... The aim of this paper is to apply the recent pairwise Markov chain model, which generalizes the hidden Markov chain one, to the unsupervised restoration of hidden data. The main novelty is an original parameter estimation method, valid in a general setting where the form of the possibly correlated n ..."
Abstract
-
Cited by 21 (13 self)
- Add to MetaCart
The aim of this paper is to apply the recent pairwise Markov chain model, which generalizes the hidden Markov chain one, to the unsupervised restoration of hidden data. The main novelty is an original parameter estimation method, valid in a general setting where the form of the possibly correlated noise is not known. Several experimental results are presented in both Gaussian and generalized mixture contexts. They show the advantages of the pairwise Markov chain model with respect to classical hidden Markov chain one for supervised and unsupervised restorations.
2D conditional random fields for web information extraction
- Proc. of ICML
, 2005
"... The Web contains an abundance of useful semistructured information about real world objects, and our empirical study shows that strong sequence characteristics exist for Web information about objects of the same type across different Web sites. Conditional Random Fields (CRFs) are the state of the a ..."
Abstract
-
Cited by 16 (3 self)
- Add to MetaCart
The Web contains an abundance of useful semistructured information about real world objects, and our empirical study shows that strong sequence characteristics exist for Web information about objects of the same type across different Web sites. Conditional Random Fields (CRFs) are the state of the art approaches taking the sequence characteristics to do better labeling. However, as the information on a Web page is two-dimensionally laid out, previous linear-chain CRFs have their limitations for Web information extraction. To better incorporate the two-dimensional neighborhood interactions, this paper presents a two-dimensional CRF model to automatically extract object information from the Web. We empirically compare the proposed model with existing linear-chain CRF models for product information extraction, and the results show the effectiveness of our model. 1.
Wavelet-based texture analysis and synthesis using hidden Markov models
- IEEE Trans. Circuits Syst. I
, 2003
"... Wavelet-domain hidden Markov models (HMMs), in particular hidden Markov tree (HMT), were recently proposed and applied to image processing, where it was usually assumed that three subbands of the 2-D discrete wavelet transform (DWT), i.e. HL, LH, and HH, are independent. In this paper, we study wave ..."
Abstract
-
Cited by 11 (1 self)
- Add to MetaCart
Wavelet-domain hidden Markov models (HMMs), in particular hidden Markov tree (HMT), were recently proposed and applied to image processing, where it was usually assumed that three subbands of the 2-D discrete wavelet transform (DWT), i.e. HL, LH, and HH, are independent. In this paper, we study wavelet-based texture analysis and synthesis using HMMs. Particularly, we develop a new HMM, called HMT-3S, for statistical texture characterization in the wavelet-domain. In addition to the joint statistics captured by HMT, the new HMT-3S can also exploit the crosscorrelation across DWT subbands. Meanwhile, HMT-3S can be characterized by using the graphical grouping technique, and has the same tree structure as HMT. The proposed HMT-3S is applied to texture analysis, including classification and segmentation, and texture synthesis with improved performance over HMT. Specifically, for texture classification, we study four wavelet-based methods, and experimental results show that HMT-3S provides the highest percentage of correct classification of over 95 % upon a set of 55 Brodatz textures. For texture segmentation, we demonstrate that more accurate texture characterization from HMT-3S allows the significant improvements in terms of both classification accuracy and boundary localization. For texture synthesis, we develop an iterative
Gaussian Mixture Models Of Texture And Colour For Image Database
- in Proc . ICASSP
, 2003
"... We introduce Gaussian mixture models of `structure' and colour features in order to classify coloured textures in images, with a view to the retrieval of textured colour images from databases. Classifications are performed separately using structure and colour and then combined using a confidence cr ..."
Abstract
-
Cited by 9 (0 self)
- Add to MetaCart
We introduce Gaussian mixture models of `structure' and colour features in order to classify coloured textures in images, with a view to the retrieval of textured colour images from databases. Classifications are performed separately using structure and colour and then combined using a confidence criterion. We apply the models to the VisTex database and to the classification of man-made and natural areas in aerial images. We compare these models with others in the literature, and show an overall improvement in performance.

