Elsevier

Progress in Neurobiology

Volume 55, Issue 3, 27 June 1998, Pages 191-224
Progress in Neurobiology

Neural mechanisms underlying stereoscopic vision

https://doi.org/10.1016/S0301-0082(98)00012-4Get rights and content

Abstract

The progressive frontalization of both eyes in mammals causes overlap of the left and right visual fields, having as a consequence a region of binocular field with single vision and stereopsis. The horizontal separation of the eyes makes the retinal images of the objects lying in this binocular field have slight horizontal and vertical differences, termed disparities. Horizontal disparities are the main cue for stereopsis. In the past decades numerous physiological studies made on monkeys, which have in many aspects a similar visual system to humans, showed that a population of visual cells are capable of encoding the amplitude and sign of horizontal disparity. Such disparity detectors were found in cortical visual areas V1, V2, V3, V3A, VP, MT (V5) and MST of monkeys and in the superior colliculus of the cat and opossum. According to their disparity tuning function, these cells were first grouped into tuned excitatory, tuned inhibitory, near and far sub-groups. Subsequent studies added two more categories, tuned near and tuned far cells. Asymmetries between left and right receptive field position, on and off regions, and intra-receptive field wiring are believed to be the neural mechanisms of disparity detection. Because horizontal disparity alone is insufficient to compute reliable stereopsis, additional information about fixation distance and angle of gaze is required. Thus, while there is unequivocal evidence of cells capable of detecting horizontal disparities, it is not known how horizontal disparity is calibrated. Sensitivity to vertical disparity and information about the vergence angle or eye position may be the source of this additional information.

Introduction

There has been among the mammalian vertebrates a constant tendency to enlarge the binocular area of the visual field at expense of overlapping both monocular fields and reducing the total space covered by both eyes together. This was achieved by a progressive eye frontalization. In sub-mammalian vertebrates there is a total decussation of the optic nerve fibers at the chiasma, and therefore both monocular inputs remain largely segregated. On the contrary, in higher mammals a complete decussation of each nasal hemiretina makes possible an early interchange of left and right eye inputs. This decussation is an efficient arrangement to bring together corresponding retinal regions of both eyes and permit a point by point matching of the visual field. Among the mammals only the carnivores, lower primates and man have developed true binocular vision. Within this group, the amount of crossing optic fibers at the chiasma proceeds parallel with the increasingly frontalization of the eyes. A recent review on depth vision in animals has been made by Pettigrew (1991).

Normal binocular vision implies fusion, a process that consists of blending together left and right images to be perceived as a single image. This seemly effortless task involves a series of complicated neural processes that are not yet well understood. Curiously, singleness of binocular vision does not confer relevant benefits, on the contrary, it represents an additional problem to solve. Thus, with the exception of stereopsis, seeing with two eyes is not much better than seeing with one.

Depth perception and stereopsis differ in that the first is based on both monocular and binocular cues, whereas the second is based on interocular retinal image differences. Depth perception is thus a broader concept than stereopsis, stereopsis being only one aspect of depth perception. The term `stereopsis' comes from the greek `stereos', which means `solid', to refer the perception of the three dimensions of the visual scene.

Under monocular vision depth can be perceived but stereopsis is not possible. Monocular form and motion alone contain robust depth cues in normal viewing conditions. There is a variety of monocular cues that provide powerful information for perception of real depth (Helmholz, 1925; Ogle, 1962b; Reading, 1983). Among the most relevant are the relative size of the objects, interposition, aerial perspective, shading, geometrical perspective, relative velocity, motion parallax and blurring. Cells of the visual system have mechanisms such as contrast sensitivity, spatial frequency filtering, motion direction and orientation sensitivity that make effective use of these cues and make them available to achieve depth perception. Although depth perception under monocular viewing can reach remarkable accuracy, it is the contribution of stereopsis which makes depth perception so highly effective in mammals.

Stereopsis is based on the interocular differences that appear between the images projected on the left and right retinas because of the horizontal separation of the two eyes. These differences are known as retinal disparities. Retinal disparities have a broad range of amplitude, and the visual system probably processes small and large disparities in different ways. Based on this idea, in 1950 Ogle postulated two kinds of stereopsis. One was termed `quantitative' or `fine stereopsis' and was related to small ranges of retinal disparities that required steady visual fixation. The other one was termed `qualitative' or `coarse stereopsis', related to large disparities and required small scanning eye movements (Bishop and Henry, 1971). Fine stereopsis requires fusion and single vision, gives a precise estimate of depth and usually operates for disparities not exceeding 2 degrees of arc (Mitchell, 1966; Fender and Julesz, 1967). Coarse stereopsis on the contrary, is based on large disparities that produce double images (Bishop and Henry, 1971). Disparities of up to 7 degrees for crossed disparities and up to 12 degrees for uncrossed disparities are believed to produce such depth information (Westheimer and Tanzman, 1956; Blakemore, 1970).

Static positional disparity is not the only difference between left and right retinal images. Velocity and temporal disparity between both retinal images may also occur. For instance, an object moving across the visual field towards the observer outside of the vertical meridian has images with different velocities of movement in the two eyes. These differences can be described as dynamic retinal disparities. On the other hand, temporal delays between the left and right information reaching the central nervous system are able to elicit depth perception similar to that induced by static and dynamic disparities. Although first described by Mach and Dvorak (1872)and later, in 1920, by the astronomer Max Wolf (see Howard and Rogers, 1995, pp. 535–536), this phenomenon was studied in detail in 1922 by Carl Pulfrich (Pulfrich, 1922). This effect, known as `Pulfrich effect', appears when an observer views a pendulum swinging in a frontal plane and one eye is covered with a filter that causes a delay of the visual stimulus. Under this conditions the pendulum appears to move in depth in an horizontal elliptical path.

Since retinal disparities may be static or dynamic, it has been suggested that stereopsis be static or dynamic, each having a number of properties that make them different.

Static stereopsis operates within a range of horizontal disparities of less that 0.33 degrees, its latency is large, about 250 msec, improves for targets located on the fovea, improves with exposure time, increases as the spatial frequency of the stimulus increases, improves when the horopter is determined and persists in presence of chromatic equiluminance (Ogle, 1962a; Ogle and Weil, 1958; Tyler and Torres, 1972; Regan and Beverley, 1973; Julesz et al., 1980; Mayer et al., 1982; Schor et al., 1984; Tyler and Cavanagh, 1989; Tyler, 1983, Tyler, 1990).

On the other hand, dynamic stereopsis operates over a broader range of disparities than static stereopsis, ranging from 0.1 to about 10 degrees, its latency is short (about 130 msec) with locations off the fovea producing less impairment in motion than in static stereopsis. The optimal temporal frequency is for targets that move a rate up of 3–10 cycles/sec, the spatial frequencies extend to 0.1 cycles/deg, and disparity sensitivity decreases when targets have equal red–green luminances (Ogle, 1932; Ogle, 1962a; Regan and Spekreijse, 1970; Tyler and Torres, 1972; Regan and Beverley, 1973; Norcia and Tyler, 1984; Schor et al., 1984; Norcia et al., 1985; Tyler and Cavanagh, 1989).

Experiments aimed to elucidate the neurophysiological basis of stereopsis are numerous. Barlow et al. (1967)reported for the first time the finding that some cortical units in cats were optimally excited by objects lying at different distances. Since then numerous studies were made in cat (Nikara et al., 1968; Pettigrew et al., 1968; Henry et al., 1969; Joshua and Bishop, 1970; Bishop et al., 1971; von der Heydt et al., 1978; Fischer and Krüger, 1979; Ferster, 1981), monkey (Hubel and Wiesel, 1970; Poggio and Fischer, 1977; Maunsell and Van Essen, 1983; Poggio and Talbot, 1981; Poggio et al., 1985, Poggio et al., 1988; Burkhalter and Van Essen, 1986; Hubel and Livingstone, 1987, Hubel and Livingstone, 1990; Felleman and Van Essen, 1987; Roy et al., 1992; Gonzalez et al., 1993a, Gonzalez et al., 1993b) and sheep (Clarke and Whitteridge, 1974, Clarke and Whitteridge, 1976; Clarke et al., 1976; Ramachandran et al., 1977). These studies demonstrated that cells in certain structures of the visual system have responses that are dependent on the binocular disparity of the stimuli.

Macaque monkeys seem to have a visual system very similar to that of humans (Cowey and Ellis, 1967; Farrer and Graham, 1967; DeValois and Jacobs, 1968; Harwerth et al., 1995). Behavioral studies have shown a remarkable similarity in both the oculomotor and sensory aspects of binocular vision in both species, such as sensitivity to binocular disparity (Sarmiento, 1975), binocular fusion (Harwerth et al., 1995) and ability to distinguish random dot stereograms at different apparent depths in absence of monocular cues (Bough, 1970). These observations suggest that the operating characteristics of vergence and stereopsis are virtually identical in macaque monkeys and humans.

This review will focus on the neural mechanisms which are believed to be responsible for the detection of retinal disparities, which are the basis of stereopsis. In this review, the term `disparity' indicates positional static horizontal disparity. In other instances the full term will be used.

As we shall see, observations made in the last decades, mostly from single cells recordings in cortical visual areas revealed neural mechanisms that undoubtedly underlay the visual processes leading to stereopsis. Since investigations on macaque monkeys have provided much of our knowledge of these mechanisms, most of the data referred to here comes from studies carried out on these animals. Although fluctuations in ocular position have important consequences on many aspects of binocular vision and stereopsis, the relationship between ocular movements and binocular vision is beyond the scope of this review and will not be considered in detail.

Section snippets

Geometry of binocular vision

In adult humans the eyes are about 65 mm apart and therefore they see objects from slightly different points. This causes the retinal images of three-dimensional objects to differ in both vertical and horizontal dimensions. Objects located outside of the horopter produce retinal images horizontally separated. When looking straight ahead with the eyes in symmetric convergence, objects located outside the vertical and horizontal meridians produce images vertically separated. Vertical disparity

Disparity tuning function of visual cells

Sensitivity to stimulus disparity is the key factor in stereopsis. It is therefore clear that most studies on stereopsis are aimed to elucidate the neural substrate of disparity sensitivity. These studies were carried out mostly in monkeys and cats, in the first case because of the similarity with the human visual system (Fig. 5) and in the second case because of the overwhelming amount of earlier information on the visual system that we have available. These studies use different methodologies

Development of binocular disparity sensitivity

In monkeys, as in humans, many visual functions appear or improve progressively after birth. Visual tracking of small objects, visually guided reaching, and visual object discrimination appear between 1.5 and 2 months of age (Boothe et al., 1985; Goldman et al., 1974). Stereopsis emerges at about 4 weeks of age in these animals (O'Dell et al., 1991; Birch, 1993; Held, 1993; Shimojo, 1993).

Both functional and anatomical studies have been performed to analyze the postnatal development of the visual

Disparity sensitivity in the visual system

Sensitivity to retinal disparities has been observed in many visual areas. However, the earliest stage in the visual system where the cells clearly signal retinal disparity is V1 (Fig. 12). There is some evidence of binocular interaction in the lateral geniculate nucleus, although as yet no conclusive data has been reported indicating that this structure initiates disparity sensitivity in the visual pathway.

The magno and parvocellular systems

Recent physiological studies on the primate visual system suggest the existence of at least two parallel processing streams, the magnocellular and the parvocellular system. The parvocellular system further segregates in V1 in two subdivisions, the blob and the interblob systems. These streams were defined by phsychophysical, physiological and histological techniques. When studied in detail, they differ in a number of features (Livingstone and Hubel, 1988). Depth perception and movement seems to

Neural connectivity, receptive field structure and sensitivity to disparity

Since stereopsis is believed to be initiated in the visual cortex, most efforts to understand how sensitivity to retinal disparity is achieved have been made in this area. Unfortunately the fine details of the neural mechanisms subserving this process are not fully understood. It is clear that detection of interocular disparities must take place at a level of the visual pathway where both ocular inputs converge. Although some type of ocular interaction may occur in the LGN, the first structure

Ocular dominance and disparity sensitivity

Since an essential requirement to compute disparity is to have available information from both eyes, ocular dominance studies related to disparity sensitivity have been present in almost every report. Indeed, there are findings showing that manipulations that alter ocular dominance columns also disrupt stereopsis (Blakemore et al., 1978; Movshon and Van Sluyters, 1981; Boothe et al., 1985) and that during development stereopsis and ocular dominance columns appear and mature with similar

Scaling horizontal retinal disparities

The most relevant disparities for stereopsis are horizontal disparities. Random dot stereograms show that horizontal disparities are capable of eliciting vivid depth perception. As it has been shown in this review, cortical visual cells in several areas are capable of encoding the magnitude and polarity of horizontal disparities. However, even if horizontal disparity is successfully computed, stereoscopic depth must still be derived from it. This is not a simple task because horizontal

Conclusions

In the last decades experiments made on monkeys have provided unequivocal evidence that retinal disparities are encoded by the cells of the visual system. Since the first finding by Hubel and Wiesel (1970)of sensitivity to retinal disparity in cells from area V2 of monkeys, other studies reported similar results in other visual areas and subcortical structures. Poggio and Fischer (1977)made a relevant contribution to the understanding of stereopsis by characterizing the four main categories of

Acknowledgements

This work was supported by the Xunta de Galicia (Grant no. XUGA20814B96). We are grateful to Dr K. L. Grieve for his fruitful comments on this manuscript.

References (264)

  • R.S. Harwerth et al.

    Behavioral studies of local stereopsis and disparity vergence in monkeys

    Vision Res.

    (1995)
  • G.H. Henry et al.

    Inhibition and sub-liminal excitatory receptive fields of simple units in cat striate cortex

    Vision Res.

    (1969)
  • W. Adams et al.

    Pooling of vertical disparities by the human visual system

    Perception

    (1996)
  • A. Anzai et al.

    Neural mechanisms underlying binocular fusion and stereopsis: position vs. phase

    Proc. Natl. Acad. Sci. U.S.A.

    (1997)
  • C. Aschenbrenner

    Problems in getting information into and out of air photographs

    Photogramm. Engng

    (1954)
  • Aslin, R. N. (1993) Infant accommodation and convergence. In: Early Visual Development, Normal and Abnormal, pp. 30–38....
  • J. Atkinson et al.

    Stereoscopic discrimination in infants

    Perception

    (1976)
  • R.A. Andersen

    Encoding of intention and spatial location in the posterior parietal cortex

    Cereb. Cortex

    (1995)
  • J.S. Baizer

    Receptive field properties of V3 neurons in monkey

    Invest. Ophthalmol.

    (1982)
  • H.B. Barlow et al.

    The neural mechanism of binocular depth discrimination

    J. Physiol., Lond.

    (1967)
  • N. Berman et al.

    Binocular interactions in the cat's superior colliculus

    J. Physiol., Lond.

    (1975)
  • Birch, E. E. (1993) Stereopsis in infants and its developmental relationship to visual acuity. In: Early Visual...
  • P.O. Bishop et al.

    Spatial vision

    A. Rev. Psychol.

    (1971)
  • P.O. Bishop et al.

    Binocular interaction fields of single units in the cat striate cortex

    J. Physiol., Lond.

    (1971)
  • Bishop, P. O. (1973) Neurophysiology of binocular single vision and stereopsis. In: Handbook of Sensory Physiology....
  • C. Blakemore

    The range and scope of binocular depth discrimination in man

    J. Physiol., Lond.

    (1970)
  • C. Blakemore et al.

    A second neural mechanism of binocular depth discrimination

    J. Physiol., Lond.

    (1972)
  • C. Blakemore et al.

    The physiological effects of monocular deprivation and their reversal in monkey's visual cortex

    J. Physiol., Lond.

    (1978)
  • C. Blakemore et al.

    Recovery from monocular deprivation in the monkey. I. Reversal of physiological effects in the visual cortex

    Proc. R. Soc., Lond.

    (1981)
  • G.G. Blasdel et al.

    Termination of afferent axons in macaque striate cortex

    J. Neurosci.

    (1983)
  • R.G. Boothe et al.

    Postnatal development of vision in human and nonhuman primates

    A. Rev. Neurosci.

    (1985)
  • E.W. Bough

    Stereoscopic vision in the Macaque monkey: a behavioral demonstration

    Nature

    (1970)
  • J.P. Bourgeois et al.

    Changes of synaptic density in the primary visual cortex of the macaque monkey from fetal to adult stage

    J. Neurosci.

    (1993)
  • J.P. Bourgeois et al.

    Synaptogenesis in the occipital cortex of macaque monkey devoid of retinal input from early embryonic stages

    Eur. J. Neurosci.

    (1996)
  • D.C. Bradley et al.

    Integration of motion and stereopsis in middle temporal cortical area of macaques

    Nature

    (1995)
  • G.A. Brecher

    Form und Ausdehnung der Pnumschen Areale bei fovealem Sehen

    Pflügers Arch. Ges. Physiol.

    (1942)
  • F. Bremmer et al.

    Eye position effects in monkey cortex. II. Pursuit- and fixation-related activity in posterior parietal areas LIP and 7A

    J. Neurophysiol.

    (1997)
  • A. Burkhalter et al.

    The connections of the ventral posterior area (VP) in the macaque monkey

    Soc. Neurosci. Abstr.

    (1983)
  • A. Burkhalter et al.

    Processing of color, form and disparity information in visual areas VP and V2 of ventral extrastriate cortex in the macaque monkey

    J. Neurosci.

    (1986)
  • Y.M. Chino et al.

    Postnatal development for binocular disparity sensitivity in neurons of the primate visual cortex

    J. Neurosci.

    (1997)
  • P.G.H. Clarke et al.

    Spatial arrangement of cells sensitive to binocular depth in the secondary visual area of the sheep

    J. Physiol., Lond.

    (1974)
  • P.G.H. Clarke et al.

    The cortical visual areas of the sheep

    J. Physiol., Lond.

    (1976)
  • P.G.H. Clarke et al.

    Binocular visual mechanism in cortical areas I and II of the sheep

    J. Physiol., Lond.

    (1976)
  • Collewijn, H., Erkelens, C. J. and Regan, D. (1986) Absolute and relative disparity: a re-evaluation of their...
  • M.L. Cooper et al.

    Neurogenetic gradients in the superior and inferior colliculus of the rhesus monkey

    J. Comp. Neurol.

    (1981)
  • A. Cowey et al.

    Visual acuity of rhesus and squirrel monkeys

    J. Comp. Physiol. Psychol.

    (1967)
  • B.G. Cumming et al.

    Responses of primary visual cortical neurons to binocular disparity without depth perception

    Nature

    (1997)
  • B.G. Cumming et al.

    Vertical disparities and perception of three-dimensional shape

    Nature

    (1991)
  • P.M. Daniel et al.

    The representation of the visual field on the cerebral cortex in monkeys

    J. Physiol., Lond.

    (1961)
  • G.C. DeAngelis et al.

    Neuronal mechanisms underlying stereopsis: how do simple cells in the visual cortex encode binocular disparity?

    Perception

    (1995)
  • Cited by (114)

    • How the brain gets a roaring campfire: Input and output functions

      2020, The Interdisciplinary Handbook of Perceptual Control Theory: Living Control Systems IV
    • How the brain gets a roaring campfire: Input and output functions

      2020, The Interdisciplinary Handbook of Perceptual Control Theory: Living Control Systems IV
    • Dissociating neural activity associated with the subjective phenomenology of monocular stereopsis: An EEG study

      2019, Neuropsychologia
      Citation Excerpt :

      In the event of a significant interaction effect, the Bonferroni method was used to adjust the p value of post hoc pairwise comparisons. We focused our analysis a-priori based on the previous literature, which suggested that depth representation processes would be a bottom-up feedforward processing occurring within the visual cortex and/or parietal cortex (Cumming and DeAngelis, 2001; Gonzalez and Perez, 1998; Orban, 2011; Parker, 2007; Welchman, 2016; Welchman and Kourtzi, 2013). We defined activity associated with the derivation of depth or 3D form as those that could be isolated by subtracting brain responses to 2D form images from those to 3D form images (3D-2D contrast).

    • The role of somatosensory input in target localization during binocular and monocular viewing while performing a high precision reaching and placement task

      2019, Experimental Eye Research
      Citation Excerpt :

      Even when somatosensory input was available, the placement task was performed significantly slower during monocular in comparison to binocular viewing, further demonstrating the superiority of binocular depth cues. Viewing with both eyes provides the CNS with horizontal and vertical disparities, which are not available when input is restricted to one eye (Blake and Wilson, 2011; Gonzalez and Perez, 1998; Poggio, 1995). In addition, ocular vergence provides a less reliable depth cue during monocular viewing due to phoria (Ono and Gonda, 1978; Ono and Weber, 1981).

    View all citing articles on Scopus
    View full text