Results 1 - 10
of
91
A Tutorial on Visual Servo Control
- IEEE Transactions on Robotics and Automation
, 1996
"... This paper provides a tutorial introduction to visual servo control of robotic manipulators. Since the topic spans many disciplines our goal is limited to providing a basic conceptual framework. We begin by reviewing the prerequisite topics from robotics and computer vision, including a brief review ..."
Abstract
-
Cited by 513 (17 self)
- Add to MetaCart
This paper provides a tutorial introduction to visual servo control of robotic manipulators. Since the topic spans many disciplines our goal is limited to providing a basic conceptual framework. We begin by reviewing the prerequisite topics from robotics and computer vision, including a brief review of coordinate transformations, velocity representation, and a description of the geometric aspects of the image formation process. We then present a taxonomy of visual servo control systems. The two major classes of systems, position-based and image-based systems, are then discussed. Since any visual servo system must be capable of tracking image features in a sequence of images, we include an overview of feature-based and correlation-based methods for tracking. We conclude the tutorial with a number of observations on the current directions of the research field of visual servo control. 1 Introduction Today there are over 800,000 robots in the world, mostly working in factory environment...
A Robust Technique for Matching Two Uncalibrated Images Through the Recovery of the Unknown Epipolar Geometry
, 1994
"... ..."
Determining the Epipolar Geometry and its Uncertainty: A Review
- International Journal of Computer Vision
, 1998
"... Two images of a single scene/object are related by the epipolar geometry, which can be described by a 3×3 singular matrix called the essential matrix if images' internal parameters are known, or the fundamental matrix otherwise. It captures all geometric information contained in two images, an ..."
Abstract
-
Cited by 260 (7 self)
- Add to MetaCart
Two images of a single scene/object are related by the epipolar geometry, which can be described by a 3×3 singular matrix called the essential matrix if images' internal parameters are known, or the fundamental matrix otherwise. It captures all geometric information contained in two images, and its determination is very important in many applications such as scene modeling and vehicle navigation. This paper gives an introduction to the epipolar geometry, and provides a complete review of the current techniques for estimating the fundamental matrix and its uncertainty. A well-founded measure is proposed to compare these techniques. Projective reconstruction is also reviewed. The software which we have developed for this review is available on the Internet.
Augmented Reality Through Wearable Computing
, 1997
"... Wearable computing moves computation from the desktop to the user. We are forming a community of networked wearable computer users to explore, over a long period, the augmented realities that these systems can provide. By adapting its behavior to the user's changing environment, a body-worn computer ..."
Abstract
-
Cited by 109 (19 self)
- Add to MetaCart
Wearable computing moves computation from the desktop to the user. We are forming a community of networked wearable computer users to explore, over a long period, the augmented realities that these systems can provide. By adapting its behavior to the user's changing environment, a body-worn computer can assist the user more intelligently, consistently, and continuously than a desktop system. A text-based augmented reality, the Remembrance Agent, is presented to illustrate this approach. Video cameras are used both to warp the visual input (mediated reality) and to sense the user's world for graphical overlay. With a camera, the computer tracks the user's finger, which acts as the system's mouse; performs face recognition; and detects passive objects to overlay 2.5D and 3D graphics onto the real world. Additional apparatus such as audio systems, infrared beacons for sensing location, and biosensors for learning about the wearer's affect are described. Using the input from these interfac...
Recognition of Human Body Motion Using Phase Space Constraints
- In ICCV
, 1995
"... A new method for representing and recognizing human bodymovements is presented. Assuming the availability of Cartesian tracking data, we develop techniques for representation of movements basedon spacecurves in subspaces of a "phase space." The phase space has axes of joint angles and torso location ..."
Abstract
-
Cited by 107 (7 self)
- Add to MetaCart
A new method for representing and recognizing human bodymovements is presented. Assuming the availability of Cartesian tracking data, we develop techniques for representation of movements basedon spacecurves in subspaces of a "phase space." The phase space has axes of joint angles and torso location and attitude, and the axes of the subspaces are subsets of the axes of the phase space. Using this representation we develop a system for learning new movements from ground truth data by searching for constraints which are in effect during the movement to be learned, and not in effect during other movements. We then use the learned representation for recognizing movements in data. Prior approaches by other researchers used a small number of classification categories, which demanded less attention to representation. We train and test the system on nine fundamental movements from classical ballet performed by two dancers. The system learns and accurately recognizes the nine movements in an un...
Fast and Globally Convergent Pose Estimation From Video Images
, 1998
"... Determining the rigid transformation relating 2D images to known 3D geometry is a classical problem in photogrammetry and computer vision. Heretofore, the best methods for solving the problem have relied on iterative optimization methods which cannot be proven to converge and/or which do not effecti ..."
Abstract
-
Cited by 76 (3 self)
- Add to MetaCart
Determining the rigid transformation relating 2D images to known 3D geometry is a classical problem in photogrammetry and computer vision. Heretofore, the best methods for solving the problem have relied on iterative optimization methods which cannot be proven to converge and/or which do not effectively account for the orthonormal structure of rotation matrices. We show that the pose estimation problem can be formulated as that of minimizing an error metric based on collinearity in object (as opposed to image) space. Using object space collinearity error, we derive an iterative algorithm which directly computes orthogonal rotation matrices and which is globally convergent. Experimentally, we show that the method is computationally efficient, that it is no less accurate than the best currently employed optimization methods, and that it outperforms all tested methods in robustness to outliers. Chien-Ping Lu, Silicon Graphics Inc. cplu@engr.sgi.com y Greg Hager, Department of Computer...
Video Orbits of the Projective Group: A Simple Approach to Featureless Estimation of Parameters
- IEEE TRANSACTIONS ON IMAGE PROCESSING
, 1997
"... We present direct featureless methods for estimating the eight parameters of an "exact" projective (homographic) coordinate transformation to register pairs of images, together with the application of seamlessly combining a plurality of images of the same scene, resulting in a single image (or new i ..."
Abstract
-
Cited by 72 (8 self)
- Add to MetaCart
We present direct featureless methods for estimating the eight parameters of an "exact" projective (homographic) coordinate transformation to register pairs of images, together with the application of seamlessly combining a plurality of images of the same scene, resulting in a single image (or new image sequence) of greater resolution or spatial extent. The approach is "exact" for two cases of static scenes: 1) images taken from the same location of an arbitrary three-dimensional (3-D) scene, with a camera that is free to pan, tilt, rotate about its optical axis, and zoom, or 2) images of a flat scene taken from arbitrary locations. The featureless projective approach generalizes interframe camera motion estimation methods that have previously used an affine model (which lacks the degrees of freedom to "exactly" characterize such phenomena as camera pan and tilt) and/or which have relied upon finding points of correspondence between the image frames. The featureless projective approach...
Capturing Articulated Human Hand Motion: A Divide-and-Conquer Approach
- In Proc. 7th Int. Conf. on Computer Vision, volume I
, 1999
"... The use of human hand as a natural interface device serves as a motivating force for research in the modeling, analyzing and capturing of the motion of articulated hand. Model-based hand motion capturing can be formulated as a large nonlinear programming problem, but this approach is plagued by loca ..."
Abstract
-
Cited by 40 (13 self)
- Add to MetaCart
The use of human hand as a natural interface device serves as a motivating force for research in the modeling, analyzing and capturing of the motion of articulated hand. Model-based hand motion capturing can be formulated as a large nonlinear programming problem, but this approach is plagued by local minima. An alternative way is to use analysis-by-synthesis by searching a huge space, but the results are rough and the computation expensive. In this paper, articulated hand motion is decoupled, a new two-step iterative model-based algorithm is proposed to capture articulated human hand motion, and a proof of convergence of this iterative algorithm is also given. In our proposed work, the decoupled global hand motion and local finger motion are parameterized by 3D hand pose and the state of the hand respectively. Hand pose determination is formulated as a least median of squares (LMS) problem rather than the non-robust least squares ...
3D Structure from 2D Motion
- IEEE Signal Processing Magazine
, 1999
"... this paper to delve into this formalism, further reading can be found in [41] [45]. In the following, we shall discuss its practical implementation and implications in the SfM techniques that have adopted it. ..."
Abstract
-
Cited by 38 (1 self)
- Add to MetaCart
this paper to delve into this formalism, further reading can be found in [41] [45]. In the following, we shall discuss its practical implementation and implications in the SfM techniques that have adopted it.
Uncalibrated Euclidean reconstruction: a review
, 1999
"... This paper provides a review on techniques for computing a three-dimensional model of a scene from a single moving camera, with unconstrained motion and unknown parameters. In the classical approach, called autocalibration or self-calibration, camera motion and parameters are recovered first, using ..."
Abstract
-
Cited by 29 (8 self)
- Add to MetaCart
This paper provides a review on techniques for computing a three-dimensional model of a scene from a single moving camera, with unconstrained motion and unknown parameters. In the classical approach, called autocalibration or self-calibration, camera motion and parameters are recovered first, using rigidity; then structure is easily computed. Recently, new methods based on the idea of stratification have been proposed. They upgrade the projective structure, achievable from correspondences only, to the Euclidean structure, by exploiting all the available constraints.

