• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Supervised mid-level features for word image representation (0)

by A Gordo
Venue:In CVPR
Add To MetaCart

Tools

Sorted by:
Results 1 - 1 of 1

LEWIS: Latent Embeddings for Word Images and their Semantics

by Albert Gordo, Naila Murray, Florent Perronnin
"... The goal of this work is to bring semantics into the tasks of text recognition and retrieval in natural images. Al-though text recognition and retrieval have received a lot of attention in recent years, previous works have focused on recognizing or retrieving exactly the same word used as a query, w ..."
Abstract - Add to MetaCart
The goal of this work is to bring semantics into the tasks of text recognition and retrieval in natural images. Al-though text recognition and retrieval have received a lot of attention in recent years, previous works have focused on recognizing or retrieving exactly the same word used as a query, without taking the semantics into consideration. In this paper, we ask the following question: can we pre-dict semantic concepts directly from a word image, with-out explicitly trying to transcribe the word image or its characters at any point? For this goal we propose a con-volutional neural network (CNN) with a weighted ranking loss objective that ensures that the concepts relevant to the query image are ranked ahead of those that are not rele-vant. This can also be interpreted as learning a Euclidean space where word images and concepts are jointly embed-ded. This model is learned in an end-to-end manner, from image pixels to semantic concepts, using a dataset of syn-thetically generated word images and concepts mined from a lexical database (WordNet). Our results show that, de-spite the complexity of the task, word images and concepts can indeed be associated with a high degree of accuracy. 1.
(Show Context)

Citation Context

...ated to text recognition and retrieval in natural images [15, 27]. For example, given an image of a word, one may be interested in recognizing the word, either using a list of possible transcriptions =-=[4, 10, 27]-=- or in an unconstrained manner [6, 12]. There has also been a growing interest in word image retrieval: given a query, which can be either a text string or another word image, one tries to retrieve th...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University