Results 1 - 10
of
8,282
Human Face Detection in Visual Scenes
, 1995
"... We present a neural network-based face detection system. A retinally connected neural network examines small windows of an image, and decides whether each window contains a face. The system arbitrates between multiple networks to improve performance over a single network. We use a bootstrap algorith ..."
Abstract
-
Cited by 179 (3 self)
- Add to MetaCart
We present a neural network-based face detection system. A retinally connected neural network examines small windows of an image, and decides whether each window contains a face. The system arbitrates between multiple networks to improve performance over a single network. We use a bootstrap algorithm for training the networks, which adds false detections into the training set as training progresses. This eliminates the difficult task of manually selecting non-face training examples, which must be chosen to span the entire space of non-face images. Comparisons with other state-of-the-art face detection systems are presented; our system has better performance in terms of detection and false-positive rates. This work was partially supported by a grant from Siemens Corporate Research, Inc., by the Department of the Army, Army Research Office under grant number DAAH04-94-G-0006, and by the Office of Naval Research under grant number N00014-95-1-0591. This work was started while Shumeet Balu...
Visual Scene Perception
"... A visual scene is commonly defined as a view of an environment comprised of objects and surfaces organized in a meaningful way, like a kitchen, a street or a forest path. More broadly, the domain of scene perception encompasses any visual stimulus that contains multiple elements arranged in a spatia ..."
Abstract
- Add to MetaCart
A visual scene is commonly defined as a view of an environment comprised of objects and surfaces organized in a meaningful way, like a kitchen, a street or a forest path. More broadly, the domain of scene perception encompasses any visual stimulus that contains multiple elements arranged in a
Learning Visual Scene Attributes
"... Take a look around you. How would you describe your surroundings to best give an idea of what everything looks like to someone not there? Maybe you will give a category to the scene, say, ‘bedroom’. You might try to list some of the objects around you, like ‘bed’, ‘lamp’, and ‘desk’. Or perhaps you’ ..."
Abstract
- Add to MetaCart
’ll describe it with adjectives like ‘indoors’, ‘cozy’, and ‘cluttered’. In computer vision, (or more specifically, in scene understanding), the most effective way to describe a visual scene is also a major question. Of the these three ways of describing a scene, (commonly referred to as categorization, scene
Grounded semantic composition for visual scenes
- Journal of Artificial Intelligence Research
, 2004
"... We present a visually-grounded language understanding model based on a study of how people verbally describe objects in scenes. The emphasis of the model is on the combination of individual word meanings to produce meanings for complex referring expressions. The model has been implemented, and it is ..."
Abstract
-
Cited by 105 (24 self)
- Add to MetaCart
We present a visually-grounded language understanding model based on a study of how people verbally describe objects in scenes. The emphasis of the model is on the combination of individual word meanings to produce meanings for complex referring expressions. The model has been implemented
Direct Methods for Visual Scene Reconstruction
- In IEEE Workshop on Representations of Visual Scenes
, 1995
"... There has been a lot of activity recently surrounding the reconstruction of photorealistic 3-D scenes and high-resolution images from video sequences. In this paper, we present some of our recent work in this area, which is based on the registration of multiple images (views) in a projective framewo ..."
Abstract
-
Cited by 63 (14 self)
- Add to MetaCart
planar and panoramic mosaics, and for projective depth recovery. We also speculate about the ultimate usefulness of projective approaches to visual scene reconstruction. 1 Introduction The recovery of 3-D scene information from multiple views has long been one of the central problems in computer vision
Methods for Volumetric Reconstruction of Visual Scenes
- International Journal of Computer Vision
, 2003
"... In this paper, we present methods for 3D volumetric reconstruction of visual scenes photographed by multiple calibrated cameras placed at arbitrary viewpoints. Our goal is to generate a 3D model that can be rendered to synthesize new photo-realistic views of the scene. We improve upon existing voxel ..."
Abstract
-
Cited by 44 (0 self)
- Add to MetaCart
In this paper, we present methods for 3D volumetric reconstruction of visual scenes photographed by multiple calibrated cameras placed at arbitrary viewpoints. Our goal is to generate a 3D model that can be rendered to synthesize new photo-realistic views of the scene. We improve upon existing
TAMING CROWDED VISUAL SCENES by
"... Computer vision algorithms have played a pivotal role in commercial video surveillance sys-tems for a number of years. However, a common weakness among these systems is their inability to handle crowded scenes. In this thesis, we have developed algorithms that overcome some of the challenges encount ..."
Abstract
- Add to MetaCart
Computer vision algorithms have played a pivotal role in commercial video surveillance sys-tems for a number of years. However, a common weakness among these systems is their inability to handle crowded scenes. In this thesis, we have developed algorithms that overcome some of the challenges
Describing Objects in Visual Scenes: Is Visual
, 2013
"... When describing an object in a visual scene (referring expression generation; (Krahmer & Deemter, 2012), a speaker must devise an expression which allows a listener to quickly and accurately locate the target. Speakers often do so by describing both the target itself and other objects (landmarks ..."
Abstract
- Add to MetaCart
When describing an object in a visual scene (referring expression generation; (Krahmer & Deemter, 2012), a speaker must devise an expression which allows a listener to quickly and accurately locate the target. Speakers often do so by describing both the target itself and other objects
Results 1 - 10
of
8,282