Results 1 -
3 of
3
SUN RGB-D: A RGBD scene understanding benchmark suite
- In CVPR
, 2015
"... Although RGB-D sensors have enabled major break-throughs for several vision tasks, such as 3D reconstruc-tion, we have not attained the same level of success in high-level scene understanding. Perhaps one of the main rea-sons is the lack of a large-scale benchmark with 3D anno-tations and 3D evaluat ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
(Show Context)
Although RGB-D sensors have enabled major break-throughs for several vision tasks, such as 3D reconstruc-tion, we have not attained the same level of success in high-level scene understanding. Perhaps one of the main rea-sons is the lack of a large-scale benchmark with 3D anno-tations and 3D evaluation metrics. In this paper, we intro-duce an RGB-D benchmark suite for the goal of advancing the state-of-the-arts in all major scene understanding tasks. Our dataset is captured by four different sensors and con-tains 10,335 RGB-D images, at a similar scale as PASCAL VOC. The whole dataset is densely annotated and includes 146,617 2D polygons and 64,595 3D bounding boxes with accurate object orientations, as well as a 3D room layout and scene category for each image. This dataset enables us to train data-hungry algorithms for scene-understanding tasks, evaluate them using meaningful 3D metrics, avoid overfitting to a small testing set, and study cross-sensor bias. 1.
Simultaneous Localization, Mapping, and Manipulation for Unsupervised Object Discovery
"... Abstract — We present an unsupervised framework for simul-taneous appearance-based object discovery, detection, tracking and reconstruction using RGBD cameras and a robot manipu-lator. The system performs dense 3D simultaneous localization and mapping concurrently with unsupervised object discovery. ..."
Abstract
- Add to MetaCart
Abstract — We present an unsupervised framework for simul-taneous appearance-based object discovery, detection, tracking and reconstruction using RGBD cameras and a robot manipu-lator. The system performs dense 3D simultaneous localization and mapping concurrently with unsupervised object discovery. Putative objects that are spatially and visually coherent are manipulated by the robot to gain additional motion-cues. The robot uses appearance alone, followed by structure and motion cues, to jointly discover, verify, learn and improve models of objects. Induced motion segmentation reinforces learned models which are represented implicitly as 2D and 3D level sets to capture both shape and appearance. We compare three different approaches for appearance-based object discovery and find that a novel form of spatio-temporal super-pixels gives the highest quality candidate object models in terms of precision and recall. Live experiments with a Baxter robot demonstrate a holistic pipeline capable of automatic discovery, verification, detection, tracking and reconstruction of unknown objects. I.
Unsupervised Discovery of Object Classes with a Mobile Robot
"... Abstract — Object detection and recognition are fundamental capabilities for a mobile robot. Objects are a powerful repre-sentation for a variety of tasks including mobile manipulation and inventory tracking. As a result, object-based world rep-resentations have seen a great deal of research interes ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract — Object detection and recognition are fundamental capabilities for a mobile robot. Objects are a powerful repre-sentation for a variety of tasks including mobile manipulation and inventory tracking. As a result, object-based world rep-resentations have seen a great deal of research interest in the last several years. However, these systems usually assume that object recognition is well-solved: they require that accurate recognition be available for every object they might encounter. Despite steady advances, object recognition remains a difficult, open problem. Existing object recognition algorithms rely on high-resolution three-dimensional object models or on extensive hand-labeled training data. The sheer variety of objects that occur in natural environments makes manually training a recognizer for every possible object infeasible. In this work, we present a robotic system for unsupervised object and class discovery, in which objects are first discovered, and then grouped into classes in an unsupervised fashion. At each step, we approach the problem as one of robotics, not disembodied computer vision. On a very large robotic dataset, we discover object classes with 98.7 % precision while achieving 71.8% recall. The scale and quality of these results demonstrate the merit of our approach, and prove the practicality of long-term large-scale object discovery. To our knowledge, no other authors have investigated robotic object discovery at this scale, making direct quantitative comparison impossible. We make our implementation and ground-truth labelings available, and evaluate our technique on a very large dataset. As a result, this work is a baseline against which future work can be compared. I.