Results 1 - 10
of
38
Automatic Thumbnail Cropping and its Effectiveness
, 2003
"... Thumbnail images provide users of image retrieval and browsing systems with a method for quickly scanning large numbers of images. Recognizing the objects in an image is important in many retrieval tasks, but thumbnails generated by shrinking the original image often render objects illegible. We stu ..."
Abstract
-
Cited by 114 (11 self)
- Add to MetaCart
(Show Context)
Thumbnail images provide users of image retrieval and browsing systems with a method for quickly scanning large numbers of images. Recognizing the objects in an image is important in many retrieval tasks, but thumbnails generated by shrinking the original image often render objects illegible. We study the ability of computer vision systems to detect key components of images so that intelligent cropping, prior to shrinking, can render objects more recognizable. We evaluate automatic cropping techniques l) based on a method that detects salient portions of general images, and 2) based on automatic face detection. Our user study shows that these methods result in small thumbnails that are substantially more recognizable and easier to find in the context of visual search.
Visual attention detection in video sequences using spatiotemporal cues
- ACM MM
"... Human vision system actively seeks interesting regions in images to reduce the search effort in tasks, such as object detection and recognition. Similarly, prominent actions in video sequences are more likely to attract human’s first sight than their surrounding neighbors. In this paper, we propose ..."
Abstract
-
Cited by 98 (1 self)
- Add to MetaCart
(Show Context)
Human vision system actively seeks interesting regions in images to reduce the search effort in tasks, such as object detection and recognition. Similarly, prominent actions in video sequences are more likely to attract human’s first sight than their surrounding neighbors. In this paper, we propose a spatiotemporal video attention detection technique for detecting the attended regions that correspond to both interesting objects and actions in video sequences. Both spatial and temporal saliency maps are constructed and further fused in a dynamic fashion to produce the overall spatiotemporal attention model. In the temporal attention model, motion contrast is computed based on the planar motions (homography) between images, which is estimated by applying RANSAC on point correspondences in the scene. To compensate the non-uniformity of spatial distribution of interest-points, spanning areas of motion segments are incorporated in the motion contrast computation. In the spatial attention model, we have developed a fast method for computing pixel-level saliency maps using color histograms of images. A hierarchical spatial attention representation is established to reveal the interesting points in images as well as the interesting regions. Finally, a dynamic fusion technique is applied to combine both the temporal and spatial saliency maps, where temporal attention is dominant over the spatial model when large motion contrast exists, and vice versa. The proposed spatiotemporal attention framework has been extensively applied on several video sequences, and attended regions are detected to highlight interesting objects and motions present in the sequences with very high user satisfaction rate.
Computational visual attention systems and their cognitive foundations: A survey
- ACM Trans. on Applied Perception
"... Based on concepts of the human visual system, computational visual attention systems aim to detect regions of interest in images. Psychologists, neurobiologists, and computer scientists have investigated visual attention thoroughly during the last decades and profited considerably from each other. H ..."
Abstract
-
Cited by 67 (4 self)
- Add to MetaCart
(Show Context)
Based on concepts of the human visual system, computational visual attention systems aim to detect regions of interest in images. Psychologists, neurobiologists, and computer scientists have investigated visual attention thoroughly during the last decades and profited considerably from each other. However, the interdisciplinarity of the topic holds not only benefits but also difficulties: concepts of other fields are usually hard to access due to differences in vocabulary and lack of knowledge of the relevant literature. This paper aims to bridge this gap and bring together concepts and ideas from the different research areas. It provides an extensive survey of the grounding psychological and biological research on visual attention as well as the current state of the art of computational systems. Furthermore, it presents a broad range of applications of computational attention systems in fields like computer vision, cognitive systems and mobile robotics. We conclude with a discussion on the limitations and open questions in the field.
Modelling spatio-temporal saliency to predict gaze direction for short videos,” Int
- Journal of Computer Vision
, 2009
"... The original publication is available at www.springerlink.com ..."
Abstract
-
Cited by 35 (4 self)
- Add to MetaCart
(Show Context)
The original publication is available at www.springerlink.com
The emergence of attention by population-based inference and its role in distributed processing and cognitive control of vision
, 2005
"... ..."
Interactions of visual attention and object recognition: computational modeling, algorithms, and psychophysics
, 2006
"... iii iv I would like to thank my advisor, Dr. Christof Koch, for his guidance and patience throughout the work that led to this thesis. He and the other members of my advisory committee, Dr. Pietro Perona, Dr. Laurent Itti, Dr. Shinsuke Shimojo, and Dr. Richard Andersen, helped me to stay focused whe ..."
Abstract
-
Cited by 21 (0 self)
- Add to MetaCart
iii iv I would like to thank my advisor, Dr. Christof Koch, for his guidance and patience throughout the work that led to this thesis. He and the other members of my advisory committee, Dr. Pietro Perona, Dr. Laurent Itti, Dr. Shinsuke Shimojo, and Dr. Richard Andersen, helped me to stay focused when I was about to embark on yet another project. It was an honor and pleasure to collaborate with Ueli Rutishauser and Dr. Fei-Fei Li at Caltech;
Attention in hierarchical models of object recognition
- Prog. Brain Res
, 2007
"... Object recognition and visual attention are tightly linked processes in human perception. Over the last three decades, many models have been suggested to explain these two processes and their interactions, and in some cases these models appear to contradict each other. We suggest a unifying framewor ..."
Abstract
-
Cited by 13 (0 self)
- Add to MetaCart
Object recognition and visual attention are tightly linked processes in human perception. Over the last three decades, many models have been suggested to explain these two processes and their interactions, and in some cases these models appear to contradict each other. We suggest a unifying framework for object recognition and attention and review the existing modeling literature in this context. Furthermore, we demonstrate a proof-of-concept implementation for sharing complex features between recognition and attention as a mode of top-down attention to particular objects or object categories. “At first he’d most easily make out the shadows; and after that the phantoms of the human beings and the other things in water; and, later, the things them-selves. ” — Socrates describing the visual experience of a man exposed to the richness of the visual world outside his cave for the first time (Plato, The Re-public). 1 1
Two Selection Stages Provide Efficient Object-Based Attentional Control for Dynamic Vision
- In International Workshop on Attention and Performance in Computer Vision
, 2003
"... In this paper, we introduce semiattentive computations as the result of replacing the usual single selection stage of visual attention by two consecutive selection stages. They are motivated by shortcomings of conventional attention models and correlate well to findings in human attention. The first ..."
Abstract
-
Cited by 12 (0 self)
- Add to MetaCart
In this paper, we introduce semiattentive computations as the result of replacing the usual single selection stage of visual attention by two consecutive selection stages. They are motivated by shortcomings of conventional attention models and correlate well to findings in human attention. The first selection stage employs preattentive saliency computations for the complete available input, and selects a small number of discrete items. These are subject to the semiattentive processes of tracking and information accumulation. The second stage selects a single element from the result of the first selection stage for the conventional focus of attention. The implementation and efficiency of this scheme is demonstrated in this paper. Its main advantage is the efficient selection and inhibition of objects in dynamic scenes. It allows the serialized accumulation of information for a changing environment and provides an up-to-date world model. The focus of this paper is on the quality of the computed world model and the object-related computations.
Statistical Structuring of Pictorial Databases for Content-Based Image Retrieval Systems
, 1996
"... This letter presents a two-stage statistical approach for "exploring and explaining" a pictorial database, for content-based image retrieval systems. First, we describe how correspondence analysis provides image classes, as well as facilitates the understanding of the role of image primiti ..."
Abstract
-
Cited by 12 (9 self)
- Add to MetaCart
This letter presents a two-stage statistical approach for "exploring and explaining" a pictorial database, for content-based image retrieval systems. First, we describe how correspondence analysis provides image classes, as well as facilitates the understanding of the role of image primitives and attributes used to index pictures. Such understanding allows an intelligent choice of features, and thus computational savings, to be made. Second, ascendent hierarchical classification permits the structuring of the database, in order to ease picture indexing and retrieval. Keywords Image databases, content-based image retrieval systems, exploratory statistics, correspondence analysis, ascendant hierarchical classification. Running head Statistical structuring of pictorial databases. Corresponding author: Thierry Pun, address above (phone: +41 (22) 705 7627, fax: +41 (22) 320 29 27). 1. This research is supported by the Swiss National Fund for Scientific Research, grant 20.40239.94. Part of...
Schelling Points on 3D Surface Meshes
"... This paper investigates “Schelling points ” on 3D meshes, feature points selected by people in a pure coordination game due to their salience. To collect data for this investigation, we designed an online experiment that asked people to select points on 3D surfaces that they expect will be selected ..."
Abstract
-
Cited by 11 (2 self)
- Add to MetaCart
This paper investigates “Schelling points ” on 3D meshes, feature points selected by people in a pure coordination game due to their salience. To collect data for this investigation, we designed an online experiment that asked people to select points on 3D surfaces that they expect will be selected by other people. We then analyzed properties of the selected points, finding that: 1) Schelling point sets are usually highly symmetric, and 2) local curvature properties (e.g., Gauss curvature) are most helpful for identifying obvious Schelling points (tips of protrusions), but 3) global properties (e.g., segment centeredness, proximity to a symmetry axis, etc.) are required to explain more subtle features. Based on these observations, we use regression analysis to combine multiple properties into an analytical model that predicts where Schelling points are likely to be on new meshes. We find that this model benefits from a variety of surface properties, particularly when training data comes from examples in the same object class.