Results 1 - 10
of
13
Slant from texture and disparity cues: Optimal cue combination
- Journal of Vision
"... How does the visual system combine information from different depth cues to estimate three-dimensional scene parameters? We tested a maximum-likelihood estimation (MLE) model of cue combination for perspective (texture) and binocular disparity cues to surface slant. By factoring the reliability of e ..."
Abstract
-
Cited by 14 (2 self)
- Add to MetaCart
How does the visual system combine information from different depth cues to estimate three-dimensional scene parameters? We tested a maximum-likelihood estimation (MLE) model of cue combination for perspective (texture) and binocular disparity cues to surface slant. By factoring the reliability of each cue into the combination process, MLE provides more reliable estimates of slant than would be available from either cue alone. We measured the reliability of each cue in isolation across a range of slants and distances using a slant-discrimination task. The reliability of the texture cue increases as |slant | increases and does not change with distance. The reliability of the disparity cue decreases as distance increases and varies with slant in a way that also depends on viewing distance. The trends in the single-cue data can be understood in terms of the information available in the retinal images and issues related to solving the binocular correspondence problem. To test the MLE model, we measured perceived slant of two-cue stimuli when disparity and texture were in conflict and the reliability of slant estimation when both cues were available. Results from the two-cue study indicate, consistent with the MLE model, that observers weight each cue according to its relative reliability: Disparity weight decreased as distance and |slant | increased. We also observed the expected improvement in slant estimation when both cues were available. With few discrepancies, our data indicate that observers combine cues in a statistically optimal fashion and thereby reduce the variance of slant estimates below that which could be achieved from either cue alone. These results are consistent with other studies that quantitatively examined the MLE model of cue combination.
Cyclopean geometry of binocular vision
"... The geometry of binocular projection is analyzed in relation to the primate visual system. An oculomotor parameterization that includes the classical vergence and version angles is defined. It is shown that the epipolar geometry of the system is constrained by binocular coordination of the eyes. A l ..."
Abstract
-
Cited by 10 (8 self)
- Add to MetaCart
The geometry of binocular projection is analyzed in relation to the primate visual system. An oculomotor parameterization that includes the classical vergence and version angles is defined. It is shown that the epipolar geometry of the system is constrained by binocular coordination of the eyes. A local model of the scene is adopted in which depth is measured relative to a plane containing the fixation point. These constructions lead to an explicit parameterization of the binocular disparity field involving the gaze angles as well as the scene structure. The representation of visual direction and depth is discussed with reference to the relevant psychophysical and neurophysiological literature. © 2008 Optical Society of America OCIS codes: 330.1400, 330.2210. 1.
Viewing Geometry Determines How Vision and Haptics Combine in Size Perception
- Curr Biol
, 2003
"... this article online for further analysis.) study were more likely to use commonplace rather than Figure 4C shows the predicted and observed JNDs ad hoc strategies. The fact that nearly optimal cue inte- for small or zero conflicts for each observer and each gration was observed in all three studies ..."
Abstract
-
Cited by 8 (2 self)
- Add to MetaCart
this article online for further analysis.) study were more likely to use commonplace rather than Figure 4C shows the predicted and observed JNDs ad hoc strategies. The fact that nearly optimal cue inte- for small or zero conflicts for each observer and each gration was observed in all three studies suggests that stimulus orientation. The good agreement between prethe phenomenon is pervasive. dicted and observed shows that individual differences The observed and predicted PSEs in our experiment in intermodal discrimination can be largely explained by behavior in the within-modality experiments. were very similar (Figure 3D), but the observed and pre- 487 were otherwise transparent. Because element size and density were dicted JNDs differed consistently (Figures 4A and 4B). randomized, they were not a reliable cue to intersurface distance
Perception can influence the vergence responses associated with open-loop gaze shifts in 3D
, 2003
"... ..."
Surgical navigation by autostereoscopic image overlay of integral videography
- IEEE Trans. Inform. Technol. Biomed
, 2004
"... Abstract—This paper describes an autostereoscopic image overlay technique that is integrated into a surgical navigation system to superimpose a real three-dimensional (3-D) image onto the patient via a half-silvered mirror. The images are created by employing a modified version of integral videograp ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Abstract—This paper describes an autostereoscopic image overlay technique that is integrated into a surgical navigation system to superimpose a real three-dimensional (3-D) image onto the patient via a half-silvered mirror. The images are created by employing a modified version of integral videography (IV), which is an animated extension of integral photography. IV records and reproduces 3-D images using a microconvex lens array and flat display; it can display geometrically accurate 3-D autostereoscopic images and reproduce motion parallax without the need for special devices. The use of semitransparent display devices makes it appear that the 3-D image is inside the patient’s body. This is the first report of applying an autostereoscopic display with an image overlay system in surgical navigation. Experiments demonstrated that the fast IV rendering technique and patient-image registration method produce an average registration accuracy of 1.13 mm. Experiments using a target in phantom agar showed that the system can guide a needle toward a target with an average error of 2.6 mm. Improvement in the quality of the IV display will make this system practical and its use will increase surgical accuracy and reduce invasiveness. Index Terms—Image overlay, integral photography (IP), integral videography (IV), registration, surgical navigation, threedimensional (3-D) image. I.
Perceptual Metamers in Stereoscopic Vision
, 2001
"... Theories of cue combination suggest the possibility of constructing visual stimuli that evoke different patterns of neural activity in sensory areas of the brain, but that cannot be distinguished by any behavioral measure of perception. Such stimuli, if they exist, would be interesting for two reaso ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Theories of cue combination suggest the possibility of constructing visual stimuli that evoke different patterns of neural activity in sensory areas of the brain, but that cannot be distinguished by any behavioral measure of perception. Such stimuli, if they exist, would be interesting for two reasons. First, one could know that none of the differences between the stimuli survive past the computations used to build the percepts. Second, it can be difficult to distinguish stimulus-driven components of measured neural activity from top-down components (such as those due to the interestingness of the stimuli). Changing the stimulus without changing the percept could be exploited to measure the stimulusdriven activity. Here we describe stimuli in which vertical and horizontal disparities trade during the construction of percepts of slanted surfaces, yielding stimulus equivalence classes. Equivalence class membership changed after a change of vergence eye posture alone, without changes to the retinal images. A formal correspondence can be drawn between these "perceptual metamers" and more familiar "sensory metamers" such as color metamers.
Noise causes slant underestimation in stereo and motion
- Vision Research
"... This paper discusses a problem, which is inherent in the estimation of 3D shape (surface normals) from multiple views. Noise in the image signal causes bias, which may result in substantial errors in the parameter estimation. The bias predicts the underestimation of slant found in psychophysical and ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
This paper discusses a problem, which is inherent in the estimation of 3D shape (surface normals) from multiple views. Noise in the image signal causes bias, which may result in substantial errors in the parameter estimation. The bias predicts the underestimation of slant found in psychophysical and computational experiments. Specifically, we analyze the estimation of 3D shape from motion and stereo using orientation disparity. For the case of stereo, we show that bias predicts the anisotropy in the perception of horizontal and vertical slant. For the case of 3D motion we demonstrate the bias by means of a new illusory display. Finally, we discuss statistically optimal strategies for the problem and suggest possible avenues for visual systems to deal with the bias.
How Vertical Disparities Assist Judgements of Distance
, 2001
"... The ratio of the vertical sizes of corresponding features in the two eyes' retinal images depends both on the associated object's distance and on its horizontal direction relative to the head (eccentricity). It is known that manipulations of vertical size ratio can affect perceived distance, size, d ..."
Abstract
- Add to MetaCart
The ratio of the vertical sizes of corresponding features in the two eyes' retinal images depends both on the associated object's distance and on its horizontal direction relative to the head (eccentricity). It is known that manipulations of vertical size ratio can affect perceived distance, size, depth and shape. We examined how observers use the vertical size ratio to determine the viewing distance. Do they use the horizontal gradient of vertical size ratio, or do they combine the vertical size ratio itself with the eccentricity at which it is found? Distance scaling (as measured by having subjects set an ellipsoid's size and shape to match a tennis ball) was no better when the judged object was 30 to the right of the head (where vertical size ratios vary considerably with distance) than when it was located straight ahead. Distance scaling improved when vertical disparities were presented within larger visual fields, irrespective of where this was relative to the head. Our results support the proposal that subjects use the horizontal gradient of vertical size ratio to estimate the distance of an object that they are looking at.
www.elsevier.com/locate/visres 3D after-effects are due to shape and not disparity adaptation
, 2001
"... There are a variety of stereoscopic after-effects in which exposure to a stimulus with a particular slant or curvature affects the perceived slant or curvature of a subsequently presented stimulus. These after-effects have been explained as a consequence of fatigue (a decrease in responsiveness) amo ..."
Abstract
- Add to MetaCart
There are a variety of stereoscopic after-effects in which exposure to a stimulus with a particular slant or curvature affects the perceived slant or curvature of a subsequently presented stimulus. These after-effects have been explained as a consequence of fatigue (a decrease in responsiveness) among neural mechanisms that are tuned to particular disparities or patterns of disparity. In fact, a given disparity pattern is consistent with numerous slants or curvatures; to determine slant or curvature, the visual system must take the viewing distance into account. We took advantage of this property to examine whether the mechanisms underlying the stereoscopic curvature after-effect are tuned to particular disparity patterns or to some other property such as surface curvature. The results clearly support the second hypothesis. Thus, 3D after-effects appear to be caused by adaptation among mechanisms specifying surface shape rather than among mechanisms signaling the disparity pattern. © 2001 Elsevier
Eye movements facilitate stereo-slant discrimination when horizontal disparity is noisy
"... Conditions in which saccadic gaze shifts within planar surfaces facilitate stereo-slant discrimination for slant about the horizontal and vertical axis were investigated. When horizontal disparity noise was added, large gaze shifts in the direction of the slant lowered stereo-slant discrimination th ..."
Abstract
- Add to MetaCart
Conditions in which saccadic gaze shifts within planar surfaces facilitate stereo-slant discrimination for slant about the horizontal and vertical axis were investigated. When horizontal disparity noise was added, large gaze shifts in the direction of the slant lowered stereo-slant discrimination thresholds compared to thresholds measured with steady central fixation, whereas eye movements orthogonal to the slant orientation did not lower slant-discrimination thresholds. When no horizontal noise was added, performance was the same with and without gaze shifts. These results suggest that slant is recovered from depth differences between target edges when horizontal disparity signals are variable and that foveal fixation improves the measures of disparity. Eye movements did not lower slant thresholds by providing multiple foveal samples of slant at different target locations that were averaged to reduce disparity noise levels, because eye movements only lowered the thresholds when there was a depth difference between the fixation points. To study which signals for azimuth are used when slant is recovered from the difference in depth between target edges, vertical disparity noise was added and stimulus height was reduced. Both methods elevated slant-discrimination thresholds when horizontal disparity noise was present, suggesting that vertical disparity is used as a cue for azimuth.

