• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

3-D depth reconstruction from a single still image (2006)

Cached

  • Download as a PDF

Download Links

  • [ai.stanford.edu]
  • [www.cs.cornell.edu]
  • [ai.stanford.edu]
  • [www.robotics.stanford.edu]
  • [www.cs.cornell.edu]
  • [ai.stanford.edu]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by A. Saxena , S. H. Chung , A. Y. Ng
Citations:38 - 12 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Saxena063-ddepth,
    author = {A. Saxena and S. H. Chung and A. Y. Ng},
    title = {3-D depth reconstruction from a single still image },
    year = {2006}
}

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

We consider the task of 3-d depth estimation from a single still image. We take a supervised learning approach to this problem, in which we begin by collecting a training set of monocular images (of unstructured indoor and outdoor environments which include forests, sidewalks, trees, buildings, etc.) and their corresponding ground-truth depthmaps. Then, we apply supervised learning to predict the value of the depthmap as a function of the image. Depth estimation is a challenging problem, since local features alone are insufficient to estimate depth at a point, and one needs to consider the global context of the image. Our model uses a hierarchical, multiscale Markov Random Field (MRF) that incorporates multiscale local- and global-image features, and models the depths and the relation between depths at different points in the image. We show that, even on unstructured scenes, our algorithm is frequently able to recover fairly accurate depthmaps. We further propose a model that incorporates both monocular cues and stereo (triangulation) cues, to obtain significantly more accurate depth estimates than is possible using either monocular or stereo cues alone.

Citations

1548 BConditional random fields: Probabilistic models for segmenting and labeling sequence data - Lafferty, McCallum, et al.
869 Performance of optical flow techniques - Barron, Fleet, et al. - 1992
708 Taxonomy and evaluation of dense two-frame stereo correspondence algorithms - Scharstein, Szeliski
570 Face recognition: A literature survey - Zhao, Chellappa, et al. - 2003
533 Computer Vision: A Modern Approach - Forsyth, Ponce - 2002
427 Sparse coding with an overcomplete basis set: A strategy employed by V1? Vision Research - Olshausen, Field - 1997
258 Preattentive texture discrimination with early vision mechanisms - Malik, Perona - 1990
171 Bayesian modeling of uncertainty in low-level vision - Szeliski - 1990
161 Shape from Shading: A Survey - Zhang, Tsai, et al. - 1999
133 T (2005) Object recognition with features inspired by visual cortex - Serre, Wolf, et al.
128 Multiscale conditional random fields for image labeling - He, Zemel, et al.
121 High-accuracy stereo depth maps using structured light - Scharstein, Szeliski
120 Single view metrology - Criminisi, Reid, et al. - 2000
114 SCAPE: Shape Completion and Animation of People - Anguelov - 2005
111 Geometric context from a single image - Hoiem, Efros, et al.
106 Putting Objects in Perspective - Hoiem, Efros, et al. - 2008
105 Using the forest to see the trees: a graphical model relating features, objects and scenes - Murphy, Torralba, et al. - 1978
97 Foundations of Vision. Sinauer Associates - Wandell - 1995
90 Advances in computational stereo - Brown, Burschka, et al.
86 Discriminative fields for modeling spatial dependencies in natural images - Kumar, Hebert - 2004
83 Wavelet and Filter Banks - Strang, Nguyen - 1997
83 Multiresolution markov models for signal and image processing - Willsky - 2002
75 Automatic photo pop-up - Hoiem, Efros, et al. - 2005
70 ªComputing Local Surface Orientation and Shape from Texture for Curved Surfaces,º Int'l - Malik, Rosenholtz - 1997
66 Building the gist of a scene: the role of global image features in recognition - Oliva, Torralba - 2006
55 Learning depth from single monocular images - Saxena, Chung, et al. - 2005
49 Depth estimation from image structure - Torralba, Oliva
48 Example-based photometric stereo: Shape reconstruction with general, varying BRDFs - Hertzmann, Seitz - 2005
48 Statistics of range images - Huang, Lee, et al. - 2000
43 High speed obstacle avoidance using monocular vision and reinforcement learning - Michels, Saxena, et al. - 2005
42 Robotic grasping of novel objects - Saxena, Driemeyer, et al. - 2007
39 Constructing 3D City Models by Merging Ground-Based and Airborne Views - Frueh, Zakhor - 2003
36 Statistical cues for domain specific image segmentation with performance analysis, CVPR - Konishi, Yuille - 2000
34 Shape from texture from a multi-scale perspective - Lindeberg, Garding - 1993
31 A.: A dynamic Bayesian network model for autonomous 3D reconstruction from a single indoor image - Delage, Lee, et al. - 2006
31 Efficient Spatial-domain Implementation of Multiscale Image Representation Based on Gabor Functions - Nestares, Navarro, et al. - 1998
28 Shedding light on the weather - Narasimhan, Nayar - 2003
27 Learning3-dscenestructure from a single still image - Saxena, Ng - 2007
26 A.: Automatic non-rigid 3d modeling from video - Torresani, Hertzmann
25 Shape from Symmetry - Thrun, Wegbreit - 2005
24 Depth from familiar objects: A hierarchical model for 3D scenes - Sudderth, Torralba, et al.
22 Learning to grasp novel objects using vision - Saxena, Driemeyer, et al. - 2006
19 A SIFT descriptor with global context - Mortensen, Deng, et al.
17 Probabilistic fusion of stereo with color and contrast for bi-layer segmentation - Kolmogorov, Criminisi, et al. - 2006
15 Self-motion and the perception of stationary objects - Wexler, Panerai, et al. - 2001
14 Depth estimation using monocular and stereo cues - Saxena, Schulte, et al. - 2007
14 Perceiving distance accurately by a directional process of integrating ground information,” Nature - Wu, Ooi, et al. - 2004
13 Performance analysis of stereo, vergence, and focus as depth cues for active vision - Das, Ahuja - 1995
12 Geotensity: Combining motion and lighting for 3d surface reconstruction - Maki, Watanabe, et al.
12 Automatic single-image 3d reconstructions of indoor manhattan world scenes - Delage, Lee, et al. - 2005
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University