Abstract
Interocular differences in orientation occur during binocular viewing of a surface slanted in depth. These orientation disparities could be exploited by the visual system to provide information about surface slant, but gradients of positional disparity provide an equally effective means to the same end. We examined the encoding of orientation disparities in V1 neurons that were recorded from two awake fixating monkeys. Monocular orientation selectivity was measured separately in each eye. Although the preferred monocular orientation in the left and right eyes was highly correlated (r = 0.98), 19 of 61 cells showed a significant interocular difference in preferred orientation (IDPO). By itself, an IDPO does not imply a specific binocular selectivity for orientation differences. We therefore examined the response to 25 binocular combinations of orientations by pairing each of five orientations in one eye with five in the other. Forty-four of 64 neurons showed responses that reflected the monocular orientation tuning selectivity; the preferred orientation disparity changed when the monocular orientation was changed in either eye. The remaining third (20 of 64) responded to a consistent orientation disparity in a way that was not simply predictable from monocular orientation selectivity. However, nearly all of these neurons were selective for positional disparity, and several characteristics of the responses suggest that the apparent selectivity for orientation disparities was just a consequence of the positional disparity sensitivity. Neither the data presented here nor previous data from the cat (Blakemore et al., 1972; Nelson et al., 1977) support the idea that a population of neurons early in the visual system has a separate encoding scheme for orientation disparities.
When a frontoparallel plane is viewed with both eyes, lines on the plane project to images with the same orientation in both eyes. However, when the plane is tilted, the orientations of the elements are no longer matched in the two eyes; there is an interocular orientation difference (orientation disparity).Wheatstone (1838) found that, when observers were presented with lines of different orientation in the two eyes, the three-dimensional percept was a line slanted in depth. (Slant is used here to mean rotation out of the frontoparallel plane about a horizontal axis.) This observation does not provide any insight into the underlying mechanism for perceiving slant that might exploit the orientation disparity or the gradient of positional disparity along the extent of the lines in each eye.
Several psychophysical studies have attempted to determine whether human observers use these orientation disparities rather than positional disparities to detect surface slant (von der Heydt, 1978;Ninio, 1985; Mitchison and McKee, 1990; Cagenello and Rogers, 1993). Three methods have been used to try to distinguish between positional and orientation disparity mechanisms: (1) putting the two types of disparity into conflict (Ninio, 1985), (2) exploiting the fact that line elements of different orientations contain different amounts of orientation disparity (Mitchison and McKee, 1990; Cagenello and Rogers, 1993), and (3) attempting to introduce orientation disparities in the absence of consistent positional disparities (von der Heydt et al., 1980). No study has been able to manipulate orientation and position disparities independently (the geometrical relationship between them makes this impossible). For this reason, these psychophysical studies have not led to any clear consensus concerning the role of orientation disparities in the perception of slant.
Approaching the problem from a neurophysiological perspective allows a direct investigation of the underlying neural mechanism. Two distinct populations of neurons early in the visual system, one selective for positional differences and one selective for orientation disparities, would suggest that there are two different mechanisms for detecting slant. Blakemore et al. (1972) demonstrated that some binocular neurons in area 17 of the cat possess different preferred orientations in the two eyes. We refer to this neuronal property as an interocular difference in preferred orientation (IDPO), to distinguish it from orientation disparities, which are a stimulus attribute. Blakemore et al. (1972) suggested that neurons with IDPOs were selective for orientation disparities, and hence formed a “second neural mechanism for depth perception.” Although Hubel and Wiesel (1973) disputed the experimental observations, they have subsequently been replicated both in area 17 (Nelson et al., 1977) and area 21a (Wieniawa-Narkiewicz et al., 1992) of the cat.
All of the above studies were performed with anesthetized animals, in which it is possible that some of the results merely reflected torsional rotation of the eyes under anesthetic. We therefore investigated orientation tuning of binocular neurons in the awake, fixating monkey.
Even if IDPOs do occur, they are insufficient to encode orientation disparities. A binocular interaction that depends on the orientation disparity is also required. Consider a binocular neuron that linearly sums activity from the input of the left and right eyes. Such a sum contains no information at all about the relationship between left and right images, so it is of no more value for depth perception than purely monocular signals. As Wieniawa-Narkiewicz et al. (1992) pointed out, all of the existing data on IDPOs are compatible with such an additive interaction; it is possible that binocular responses reflect the monocular orientation preferences without encoding any differences.
Figure 1 illustrates this distinction. It shows idealized responses to many combinations of stimulus orientation in the two eyes. The left panel is a simple summation of left and right responses, with no further binocular interaction. This produces a left–right separable response pattern (it can be separated into a function describing the responses of the left eye and the right eye. Adding these functions then describes the binocular responses.) The right panel shows an interaction that is specific to binocular orientation differences; the preferred orientation for one eye depends on the stimulus orientation in the other eye, and so it is left–right inseparable. Previous studies of binocular responses (Blakemore et al., 1972; Nelson et al., 1977;Wieniawa-Narkiewicz et al., 1992) have only manipulated stimulus orientation for one eye, keeping the orientation in the other eye constant. Such data are single cross sections through the surfaces shown in Figure 1 and so cannot distinguish the two response patterns.
Other properties of neuronal responses have been considered in an attempt to determine the usefulness of these signals for slant perception. Nelson et al. (1977) pointed out that the activity of most neurons was more strongly modulated by positional disparities than by orientation differences and that the bandwidth for tuning to orientation disparities was no narrower than that for monocular orientation. Although these observations suggest that IDPOs may be of limited value in encoding orientation disparities, they are compatible with interactions that are either left–right separable or left–right inseparable. Thus, the possible contribution of IDPOs to the physiological encoding of depth remains unclear. We therefore investigated the extent to which neurons in V1 of awake fixating monkeys encode orientation disparities. First, orientation tuning was examined in each eye monocularly. Then, binocular responses to a range of combinations of left and right orientations were measured to determine whether or not the binocular interaction between them was left–right separable.
MATERIALS AND METHODS
Animal training. Neurons were recorded from three hemispheres of two adult monkeys (Macaca mulatta), one male (Hg) and one female (Rb). All of the procedures that were performed on the animals complied with the United Kingdom Home Office regulations on animal experimentation. The animals were implanted with a head fixation post and scleral eye coils under general anesthetic (Judge et al., 1980; Cumming and Parker, 1999) and trained to fixate a binocular target for fluid reward.
Stimulus presentation. Stimuli were generated on a Silicon Graphics Indigo workstation and displayed on two cathode ray tube monitors. Both monitors were gamma corrected to produce a linear relationship between luminance and the computer gray level settings. The mean luminance was 42 cd.m−2, contrast was 99%, and the frame rate was 72 Hz. The monitors were viewed by the animal through two small mirrors (diameter, 18 mm) ∼2 cm away from the eyes in a Wheatstone stereoscope configuration. The viewing distance to the monitors was 85 cm, and at this distance a pixel on the screen corresponded to 1.0 arc min of visual angle. The red signal was used to present stimuli to the left monitor, and the blue signal to the right monitor, which ensured accurate synchronization of the binocular images. The stimuli on both monitors were rendered in black–white monochrome (there was no actual color difference between the images presented to the two eyes).
The initial step in plotting receptive fields for the neurons was to find the preferred orientation using a moving bar. Minimum response fields were plotted using a flashing, high-contrast bar at this preferred orientation. The stimuli were centered on, but larger than, the minimum response field. Thus, our stimuli will have covered the receptive field even if our measures of minimum response field underestimated this.
The main stimulus used in these experiments was a circular patch of sine-wave grating that was drifted across the receptive field. When measuring disparity tuning, we also used random dot stereograms (RDS). Both types of stimulus were presented against a mid-gray background. The RDS consisted of equal numbers of black and white dots. In all experiments, the stimulus was presented in the receptive field for 2 sec, and spikes were counted over the entire duration. SeeCumming and Parker (1999) for more details.
Unit recording. Recording was performed using tungsten in glass microelectrodes (Merrill and Ainsworth, 1972). The electrode was advanced through the dura and left to rest for a few minutes. Neurons were then found by first withdrawing the electrode until gray matter was no longer heard and then advancing through the gray matter of primary visual cortex. We examined the relationship between receptive field location and the location of our electrode penetrations over a 10 × 10 mm2 area. There was a clear systematic map that agreed with the known topography of macaque primary visual cortex.
Signals from the electrode were amplified (Bak Electronics, Mount Airy, MD) and filtered (200 Hz to 5 kHz) before being digitized (32 kHz) and stored to disk. A Datawave Discovery System was used to store the spike waveforms and eye position signals, which were then reanalyzed off-line using software designed in our laboratory.
After plotting receptive fields, we measured orientation tuning with a binocular grating at zero disparity. A circular patch of grating extending from 0.5 to 1° beyond the minimum response field was used. [Full field gratings were avoided to reduce any stimulus-driven torsional eye movements (Howard et al., 1994).] The preferred spatial frequency was measured at the preferred orientation, and this spatial frequency was used for measuring monocular orientation tuning curves. The same set of seven orientations was presented to each eye, centered about the binocular preferred orientation (in six cases, only five orientations were used). Stimuli presented to the left and right eyes were interleaved, and each stimulus was presented a minimum of four times.
Once the monocular orientation preference had been established, binocular interactions were measured. A two-dimensional tuning curve was constructed in which left and right orientations were altered systematically and independently. Five orientations for each eye were chosen to span the range of orientations to which the neuron responded. Each of the 25 combinations was presented at least four times, and all conditions were interleaved.
Finally, selectivity for positional disparity was tested with one (or both) of two different techniques. First, a random dot stereogram was used in which disparity was added by shifting the horizontal position of the dots in opposite directions in the two eyes. A minimum of five different disparities, centered about zero, was used, and more values were used if necessary to characterize the tuning curve more thoroughly. Disparity tuning was also measured with sine-wave gratings, because this was the stimulus used in all of the other experiments. The position of the circular grating patch in both eyes was maintained, whereas the interocular phase difference was manipulated. Neurons were classified as disparity-selective if they were significant at the 5% level on a one-way ANOVA of firing rate with respect to disparity. In addition to this test for significance, a disparity discrimination index (DDI) was calculated using the following equation from Prince et al. (2001): Equation 1where Rmax is the largest response rate on the curve, Rmin is the minimum response, and RMSerror is square root of the residual variance across the whole tuning curve. The square root of the firing rate was used as the response measure in all of these tests (see next section).
Analysis. Regression analysis was used to fit equations to the data. However, spike count variance increases with the mean count, violating an important condition for regression analysis. Therefore, all fitting was performed on the square root of the firing rates, which largely removes this dependence (Cumming and Parker, 2000; Prince et al., 2001). (Because the interval over which spikes were counted was fixed, this has the same effect as taking the square root of the counts.) Only fits that accounted for at least 75% of the variance in the square root firing rate were used for subsequent analysis. Tuning curves for both orientation and disparity were considered to have “significant tuning” if the neuronal firing rate was consistently modulated by the changing stimulus parameter. Such modulation was quantified by performing a one-way ANOVA of firing rate with respect to the changing parameter.
Monocular orientation tuning curves were tested for significant tuning with two different tests: first, using a one-way ANOVA; and second, using a test to ensure that a significant component of the modulated data is smooth. The latter test exploited a sequential Ftest (described below) to test that the Gaussian curve was a better fit to the data than a horizontal straight line through the mean spike rate (averaged across all trials). Neurons were only included if they were significant at the 5% level on both the ANOVA and the sequentialF test. The analysis of monocular orientation differences required that the tuning in both eyes meet this criterion. Binocular interactions were examined in neurons that met the criterion for at least one eye.
The monocular tuning data were fit using a nonlinear least square algorithm (numerical algorithms groups) in two different ways. First, two independent Gaussians were fit, one for the left eye and one for the right eye: Equation 2 Equation 3where Al andAr are the amplitudes for the left and right eye, Bl andBr are the baseline firing rates, ωl and ωr are the preferred orientations in the left and right eyes, and ς and ς are the variances for the left and right tuning functions.
Second, the two sets of data were fit such that they were constrained to have the same preferred orientation, ω (i.e., ωl and ωr are both replaced with ω). The fitted curves in this condition had one less parameter overall. To test whether there was a significant difference in the preferred orientation in the two eyes, a sequential Ftest was performed to see whether allowing the two preferred orientations to be independent led to a significantly better fit.
Gaussian curves have been shown to be an adequate description of orientation tuning curves in primary visual cortex of both the cat and the monkey (Henry et al., 1978; Parker and Hawken, 1988). The majority of orientation tuning curves in this study could be fit with a Gaussian function, although a minority showed deviations from this consistent with those previously described by DeValois et al. (1982).
Fitting orientation disparity curves measured binocularly.If the monocular orientation tuning curves are well described by Gaussians and the responses to binocular combinations are left–right separable, these binocular responses can be described by the sum and product of two Gaussians: Equation 4where Alr determines the amplitude of the multiplicative component and D is the baseline activity.
A simple modification to this equation allows it to fit left–right inseparable responses, where this represents a consistent response to orientation disparities. This is done by adding a rotation term so that the fitted response surface can be elongated along a diagonal. In achieving this, Equation 4 becomes: Equation 5where Equation 6 Equation 7
Au, Av,Auv, and D are constants, ς and ς are the variances in the u and v directions, and φ is the angle through which the axes have been rotated about the point (θL, θR). Because this involves adding one parameter to Equation 4, a sequentialF test (Draper and Smith, 1998) can be used to give a statistical measure of the need for the rotation term.
Fitting binocular orientation disparity data with the energy equation. In some cases, we found that neurons responded to more than one orientation disparity, and Equation 5 produced a poor description of the data. The form of the responses looked very similar to those responses described previously in studies of the interaction between left and right stimulus positions (Ohzawa et al., 1997). This is not surprising if the underlying mechanism is like the energy model; the interaction depends only on the results of convolving the stimulus with the receptive field in each eye. If changes in orientation produced systematic changes in the value of this convolution in a way that is similar to the changes elicited by position changes, then one would expect the pattern of binocular interaction to be similar. For this reason, we chose to describe these responses with an equation of the same form as that used by Ohzawa et al. (1997): Equation 8 where Au,Av,Auv, k, and Dare constants, ωl and ωr are the preferred orientations in the left and right eyes, f is the spatial frequency of the cosine term, and φ is the phase of the cosine term. This cosine term is the main difference between this surface and the one described in Equation5, and it allows for multiple peaks or troughs in the surface, rather than the single peak afforded by the Gaussian. Both types of fitting to binocular orientation differences were fit using a Levenberg-Marquardt algorithm run in Matlab (MathWorks Inc., Natick, MA).
Use of sequential F tests. SequentialF tests are used to test whether adding one or more parameters to a linear regression significantly improves the fit of a curve to the data (Draper and Smith, 1998). The residuals from the two fits to be compared are used to calculate the F ratio, which is of the following form: Equation 9where r1 is the residual from the fit with a greater number of parameters (p1),r2 is the residual from the fit with fewer parameters (p2), andn is the total number of trials. For each situation in which these were used, a Monte Carlo simulation was performed to ensure that the frequency with which the null hypothesis was rejected was appropriate for the significance criterion.
RESULTS
A total of 141 V1 neurons were recorded in the two animals. One hundred seven of these were tested for orientation tuning in each eye. The remainder showed weak orientation selectivity to the binocular stimulus or were lost before the monocular data were collected. One hundred three of 107 neurons showed significant orientation selectivity in at least one eye. Seventy-seven of 103 showed significant tuning in both eyes and 26 of 103 were tuned only in one eye.
To compare the preferred orientation of each eye, it was necessary to restrict the analysis to neurons that were tuned in both eyes. Of these 77 neurons, 61 had tuning curves that were adequately described by Gaussian curves in both eyes. Of those neurons that could not be described by a Gaussian curve, the majority had multiple, smaller peaks away from the preferred orientation, as described previously byDeValois et al. (1982). Figure 2 shows monocular tuning curves for four neurons with the fitted Gaussian functions. The error bars show SEMs. A and B show IDPOs of 1.3 and 1.6°, respectively (neither significantly different from 0). C and D show significant IDPOs of 7.9 and 5.6°, respectively. Figure 3summarizes the comparison of left and right preferred monocular orientations for all 61 neurons. Although the preferred orientation in the two eyes is highly correlated (r = 0.985), there is a scatter of IDPOs (SD = 9.22). Nineteen of 61 of these cells have significantly different monocular orientations (sequential Ftest; p < 0.05).
Three neurons showed an unusual pattern of results in which the orientations that were excitatory in one eye were suppressive when presented to the other eye. An example is shown in Figure4A, along with its disparity tuning curve (B). Note that this shows suppression for near-zero disparities. This was true for the disparity tuning curves of all three neurons. This combination of response patterns suggests that the result of stimulation in one eye is inhibitory, whereas stimulation of the other eye is excitatory, but that the underlying orientation of the receptive field is in fact similar in the two eyes. For this reason, the fitted Gaussian was allowed to be inverted, and the cell shown in Figure 4 was deemed to have a small orientation difference.
Bandwidths were calculated by taking half width at half height of the fitted Gaussian curves. The average bandwidth for left and right eyes is 21.64 ± 13.09° and 19.92 ± 10.84°, respectively. As shown in Figure 5, bandwidths in the left and right eyes are significantly correlated (r = 0.33;p < 0.05), although not as highly correlated as left and right preferred orientation. The three neurons that were fit with an inverted Gaussian in one eye were excluded from this analysis, because the meaning of bandwidth in this case is not clear.
The determination of complex or simple receptive field types in the awake monkey is complicated by small eye movements. There is no consensus as to how best to make this distinction, and no method has yet been subject to the close scrutiny that has been applied to work using anesthetized animals (Skottun et al., 1991). We used the method of Cumming et al. (1999); responses to drifting gratings were analyzed, but stimulus cycles during which a saccade was made were discarded. Cells in which the F1:F0 harmonic ratio that was estimated using this method was greater than one were classified as simple, followingSkottun et al. (1991). However, cells with an F1:F0 harmonic ratio less than one could be either complex cells or simple cells in which the response has been confounded by a residual eye movement. Using this method, the mean bandwidth for the 15 simple cells was 17.94 ± 6.98° for the left eye and 19.04 ± 7.18° for the right eye. The corresponding values for the 43 complex cells were 21.33 ± 9.17° and 18.20 ± 8.13°.
It is worth noting that there was no significant correlation between the measured magnitude of the IDPO and the orientation bandwidth (r = 0.14; p > 0.05). This suggests that the IDPO measures do not simply reflect measurement error (which would produce larger differences when the bandwidth is larger).
Binocular responses to orientation disparities
The results in the preceding section confirm that interocular differences in receptive field (RF) orientation are present in monkey V1, as they are in the cat. This does not demonstrate that these IDPOs yield a useful binocular signal about orientation disparities. We examined this question with binocular stimuli containing orientation disparities. Binocular responses were examined in 81 of the 103 cells that showed significant monocular orientation tuning. The other 22 cells were lost after the monocular orientation tuning curves were measured in each eye, but before the binocular experiment could be completed. The analysis of binocular orientation disparity encoding depends on fitting the data with one of Equations 4, 5, or 8 (see Materials and Methods). For 17 of 81 cells, all of these fits were poor (<75% of variance explained by fit), so the analysis could not be applied to these cells. Many neurons for which the fits to binocular data were poor also showed irregular tuning to monocular stimuli; in 11 of the 17 cases, at least one of the monocular tuning curves was poorly fit by a Gaussian (<75% of variance explained by the fit). After exclusion of these 17 cells, there were 64 cells with adequate data that were well described by our equations.
The most common pattern of results is illustrated in Figure6, A and C. In these surface plots, the neuronal firing rate is represented by brightness; low firing rates are shown as dark patches, and the highest firing rate is shown as white. The responses are left–right separable. Although the response rate depends on the stimulus orientation in both eyes, the orientation that elicits the maximum response in one eye is independent of the stimulus orientation in the other eye. All of the neurons showing this pattern of results can be fit by a sum and product of Gaussians (Eq. 4). Adding the extra rotation term of Equation 5 did not produce a significant improvement (p > 0.05) in these neurons. Figure 6D shows an example in which there is an inhibitory binocular interaction, but the response is still left–right separable, so this is not specific to the orientation difference. Such a pattern was found in four cells. The three cells for which disparity tuning curves were available all showed a “tuned inhibitory” pattern of responses (Poggio and Fischer, 1977) to positional disparity (like that illustrated in Fig.4B). Both of these patterns can be well fit by Equation 4. The fit to Figure 6A is shown inB.
This common type of response that is left–right separable does not represent a consistent response to orientation disparities. This is highlighted in Figure 7B, which shows two cross sections through the two-dimensional surface plot in A and the fit to this surface using Equation 4. It is obvious that the maximum response depends on the monocular orientation tuning, not the orientation disparity. The maximum response occurs when the right eye receives 80° regardless of whether the left eye stimulus is at 90°, where it represents an orientation disparity of −10°, or 30°, an orientation disparity of +50°.
In addition to the cells described above, there is a further population that does appear to respond consistently to the same orientation disparity, regardless of the absolute orientation of the left and right stimuli; these responses were left–right inseparable. An example of such a cell is shown in Figure8A. In this example, the maximum response occurs when both eyes receive the same orientation; it appears to be selective for an orientation disparity of zero. This pattern of response is left–right inseparable, and the fit to these data (Fig. 8B) is achieved by adding a rotation term to Equation 4. The rotated Gaussian yields a significantly better fit (F test, p < 0.05) than the left–right separable Gaussian. The neuron response in Figure8C was also better fit (C) with a rotated Gaussian, and the preferred IDPO is ∼30°. A total of 20 neurons showed left–right inseparable response profiles, which fell into two groups. One group (10 neurons) were fit with the inseparable Gaussian model (Eq. 5, the rotated form of the separable response profile). These tended to respond maximally for orientation disparities near zero, as illustrated in Figure 8A. The second group of 10 cells tended to show a consistent response minimum for orientation disparities of 0°, with two peaks either side, like the example in Figure 8E. Such response profiles cannot be well described by the rotated Gaussian of Equation 5 (this model accounted for <75% of the response variance), and so Equation 8 was used, which provided a much better fit. This fit accounted for a substantially greater fraction of the response variance than the inseparable model or the separable model described by Equation 5.
To summarize, of the 64 neurons that could be well fit with one of the two surfaces, 44 showed the left–right separable response shown in Figure 6, and 20 were left–right inseparable. These left–right inseparable responses all indicate a tendency to respond consistently to a certain orientation disparity, regardless of the orientation of the stimulus in either eye alone. This suggests a specialization for signaling orientation disparities. However, several observations indicate that there may be an alternative explanation. If neurons are selective for orientation disparity, one would expect that the preferred orientation disparity measured with binocular stimuli would be similar to the difference in RF orientation determined from monocular measures. Figure 9 shows that this is not the case. The monocular measure was calculated from the difference in the peaks of the Gaussians fitted to monocular tuning curves. The binocular measure is simply the orientation disparity of the peak in the fitted response profile. These two measures were not significantly correlated (r = 0.26; p> 0.05; n = 45).
The effect of tuning for positional disparity
One factor that could produce discrepancies between monocular and binocular measures is disparity tuning. This may influence the shape of the binocular interaction, without affecting preferred orientation measured monocularly. Figure 10therefore plots the magnitude of the discrepancy against the extent of disparity tuning. It is clear that the largest discrepancies occur in disparity selective neurons. To quantify this, we measured the variance of the absolute value of the discrepancy between monocular and binocular IDPO. This was calculated separately for the group of disparity selective neurons (determined by one-way ANOVA;p < 0.05) and for disparity unselective neurons. This variance was significantly larger for the disparity-selective neurons (F test; p < 0.05).
This analysis is only possible in neurons that showed significant orientation tuning in both eyes. A similar analysis can be extended to the whole population by comparing monocular and binocular measures of the preferred stimulus orientation for each eye. The binocular measure is taken from the left and right orientations at the point of maximum binocular response, and the monocular measures are taken from the peaks of the Gaussian fit to the monocular tuning data. The relationship is examined in Figure 11, in which again it is clear that the largest discrepancies are associated with disparity selectivity. The correlation between the disparity discrimination index and the discrepancy in measures of preferred orientation is significant (r = 0.32; p< 0.05).
Figure 11 also shows that the neurons exhibiting inseparable interactions between left and right stimulus orientations (filled symbols) tend to show large discrepancies. The mean discrepancy between the monocular and binocular measures of preferred orientation is 4.8° in neurons that showed separable interactions and 15.6° in those that showed inseparable interactions.
It is also clear from Figures 10 and 11 that neurons exhibiting left–right inseparable responses tend to show disparity selectivity (significant at the 5% level on a one-way ANOVA for 15 of 17 cases). In contrast, only 22 of the 40 cells that show a left–right separable response to binocular orientation disparities are disparity tuned. Together, Figures 9-11 strongly suggest that left–right inseparable responses like those illustrated in Figure 8 are in some way a result of tuning for positional disparity. In a few neurons, we explored this further by measuring binocular responses to orientation disparities with different positional disparities. Figure12 shows one example in which two complete binocular interaction profiles were measured. Changing the stimulus disparity had a dramatic effect on the responses to orientation disparities, inverting the interaction profile. When the stimulus is at the preferred disparity of the neuron (−0.38°), the optimal orientation disparity is 0°. When the stimulus is at the positional disparity to which the neuron responds least (0°, the null disparity), the preferred orientation disparity is neither at 0° nor is it predicted by the monocularly measured IDPO. Instead, stimuli with zero orientation disparity produce a minimum in the response.
The phenomenon illustrated in Figure 12 appears to be general in disparity-tuned neurons. In all cases in which there was an inseparable response that was selective for zero orientation disparity (seven neurons), the stimulus disparity turned out to be near to the preferred disparity of the neurons. In all cases in which there was a response minimum near 0° orientation disparity, the disparity was near the null disparity of the neurons. We quantified this effect by calculating a simple index of how close a stimulus disparity (dstim) fell to the preferred disparity (dpref) as a proportion of the distance between preferred and null (dnull) disparities. This index, (disparitystim −disparitypref)/(disparitypref− disparitynull , had a strong negative correlation with the magnitude of preferred orientation disparity determined from our fits (r = −0.91; p ≪ 0.01). This correlation between preferred positional disparity and preferred orientation disparity suggests that the positional disparity tuning determines the orientation disparity to which each neuron appears selective.
The importance of where rotation is centered
All of these observations (binocular IDPO is poorly correlated with monocular IDPO; the discrepancies are largest in disparity-tuned neurons; and the preferred orientation disparity seems to depend on the positional disparity of the stimulus) suggest strongly that the left–right inseparable response comes about because of the positional disparity sensitivity of the neurons. One possible reason for this type of response is illustrated in Figure13, which shows the consequences of applying an orientation disparity around a center of rotation that is not centered in the RF. The boxes represent vertically oriented receptive fields, the dotted line is the stimulus to the left eye, and the solid line is the stimulus to the right eye. For illustrative purposes, lines are used rather than sine-wave gratings, and the orientation disparity is produced by rotating the stimulus shown to the right eye only. On the left panel of Figure 13, the rotation occurs about the center of the receptive field. However, on the right, the rotation is about a point at the bottom of the receptive field. This is equivalent to the on-center rotation plus a translation. Because this translation is only applied to one eye, it constitutes a change in horizontal disparity. The magnitude of the horizontal disparity (illustrated by the arrow) will increase as the angle of stimulus rotation increases. Thus, off-center rotation leads to a complex interaction between binocular phase and binocular orientation differences.
To evaluate the effects of this interaction on our binocular measures of IDPO, we ran a simulation using the energy model of Ohzawa et al. (1990) (H. Bridge, B. G. Cumming, and A. J. Parker, unpublished observations). Our implementation extended the model to two-dimensional monocular receptive fields, so that the effects of both orientation differences and positional differences could be assessed. The results of this simulation are shown in Figure14, in which two important features emerge. First, the combination of off-center rotation and disparity selectivity is sufficient to produce a response pattern that is left–right inseparable, in a way that closely resembles that of the single units. Second, the response pattern depends on the stimulus disparity. At the preferred disparity (A), the maximum response occurs when the stimulus is at the preferred orientation in each eye. When the stimulus is at the null disparity, the same combination of orientations produces maximum inhibition (B).
In summary, off-center rotation can, in principle, explain the left–right inseparable responses to binocularly measured IDPOs that we observed in V1 neurons. Furthermore, three specific characteristics of the data can be explained in the same way. First, maximum excitation at 0° orientation disparity only occurs when the experiment has been performed with the stimulus at the preferred positional disparity, and conversely, there is a trough at 0° orientation disparity when the stimulus is at the null disparity. Second, the discrepancy between the monocular and binocular measures of IDPO is greatest if the neurons are sensitive to positional disparity. Finally, the nature of the binocular tuning to orientation disparity changes with a change in the stimulus disparity as illustrated in Figures 12 (neuronal data) and 14 (energy model).
DISCUSSION
These experiments establish the presence of IDPOs in macaque V1 neurons using a statistical criterion. The use of an awake animal avoids the complication of torsional eye movements induced by anesthesia and paralysis. Nonetheless, we find a range of IDPOs similar to that reported previously from area 17 of the anesthetized cat (Blakemore et al., 1972; Nelson et al., 1977). Although doubts have been raised about the accuracy of the previous measures, this study provides unambiguous evidence of the presence and distribution of IDPOs.
The previous studies investigated only a limited range of binocular conditions. Here we examined responses to all combinations of left and right orientations, which enabled us to determine whether there was a specific binocular interaction that encoded the difference between the stimulus orientations in the two eyes. Previous studies (Blakemore et al., 1972; Nelson et al., 1977; Wieniawa-Narkiewicz et al., 1992) kept the stimulus orientation constant in one eye. Those data are also compatible with a response pattern that was produced simply from summing left and right responses, with no mutual binocular interaction. Consequently, these earlier studies do not demonstrate a binocular mechanism for signaling orientation disparities or surface slant.
Around two-thirds (44 of 64) of the V1 neurons recorded showed a left–right separable response to binocular orientation disparities, whereas the other third (20 of 64) showed an inseparable response. However, several characteristics of this latter group suggested that their inseparable responses are in fact a consequence of positional disparity selectivity: (1) there is a poor correlation between the IDPO determined from monocular and binocular measures; (2) these discrepancies were largest in disparity-tuned neurons; (3) neurons showing inseparable responses were disparity-selective; (4) the preferred orientation disparity depended on the position disparity of the stimulus, in relation to the preferred disparity of the neuron; and (5) a simple model of disparity-selective neurons is able to reproduce all of these phenomena. The modeling suggests that the inseparable responses occur when the center of rotation for the orientation disparities is not located exactly at the center of the RF. For these reasons, we conclude that even the small number of neurons that showed left–right inseparable responses do not represent a mechanism specialized for signaling orientation disparities.
Another feature of the data that suggests that IDPOs are not important in signaling orientation disparities is the fact they are not correlated with orientation bandwidth. Because orientation disparities are usually small, cells with narrow orientation bandwidths are best suited to signaling naturally occurring orientation disparities. Our modeling work suggests that V1 neurons are best able to signal slant through detection of positional disparities. Only neurons with the narrowest orientation bandwidths could usefully exploit orientation disparities [although even these neurons signal slant more reliably via positional disparity (H. Bridge, B. G. Cumming, and A. J. Parker, unpublished observations)]. It is very unlikely that neurons with IDPOs and wide orientation bandwidths contribute to the detection of slant via orientation disparities. Because we find IDPOs are not limited to neurons with narrow orientation bandwidths, it is clear that the presence of an IDPO does not, by itself, suggest that a neuron is specialized for detecting orientation disparities.
These findings also represent a useful test of the energy model of disparity-selective complex cells (Ohzawa et al., 1990). In a modeling study, we show that the energy model predicts a left–right separable interaction between orientations, very similar to that illustrated in Figure 1A. In the energy model, an output nonlinearity renders neurons sensitive to the results of multiplying the left and right images. By combining the outputs of several simple cells with different phase sensitivity, this model generates responses to left and right eye locations that are left–right inseparable. Despite this, the interaction of different orientations is left–right separable, just as we observe here for real neurons. It is possible that V1 neurons could have encoded orientation and position separately, allowing them to respond to orientation disparities in ways quite different from those predicted by the energy model. Indeed, the idea that position and orientation are encoded separately at the monocular stage is implicit in the suggestion that orientation disparities represent a mechanism that is separate from position disparities. We found no evidence of this kind; all of the responses observed to both position and orientation disparities can be explained by supposing that a single monocular calculation is performed, which depends on both position and orientation. A simple convolution of the image with a fixed monocular RF, as in the energy model, is sufficient to account for the data. This lends further support to the view that the energy model provides a good description of the mechanism of disparity tuning in V1 (Cumming and DeAngelis, 2001).
The role of cyclovergence
Cyclovergence is the rotation of the two eyes in opposite directions about an axis from front to back through the center of the eye. If the stimuli used here elicited substantial cyclovergence movements, it would complicate the interpretation of these data. There are several reasons to suppose that such movements did not occur. Because the stimuli were matched in size and orientation to the RF, the orientation disparities applied did not produce a rotation centered on the fovea. Thus, these rotations are not a stimulus to cyclovergence. Even when the rotation is centered on the fovea, cyclovergence movements require large stimuli; Howard et al. (1994) found that decreasing stimulus size from 80 to 5° led to a fivefold decrease in the gain of cyclovergence. Furthermore, they found that the central visual field in which our stimuli were placed made a small contribution to cyclovergence.
The relationship between orientation differences and slant
In considering the possible role of these neurons in signaling slant, it is important to recognize that on a real surface of fixed slant, the orientation difference is not uniform; it depends on the orientation of feature elements (Cagenello and Rogers, 1993). Vertical lines give rise to larger orientation differences than oblique lines. Nonetheless, a mechanism that signaled slant should still show left–right inseparability, with diagonal structure in a plot of responses to all left and right eye combinations. If this structure reflected real world geometry, the diagonal would not always be at 45°.
Alternative ways of coding for slant
Several recent studies in extrastriate cortex have reported neurons that appear to signal surface slant. Shikata et al. (1996)reported visual neurons in the parietal cortex that were selective for surface orientation in three dimensions. This same group later showed that cells in this area were selective not only for disparity gradient but also to texture gradient that corresponded to a similar surface orientation (Tsutsui et al., 1999). In addition to this data for the parietal cortex, Janssen et al. (1999) have described cells in the macaque inferior temporal cortex that are selective for the three-dimensional structure of surfaces. However, none of these studies addresses whether the binocular responses are the result of processing orientation disparities or the gradient of positional disparities. The neurophysiological data described here do not provide evidence for a mechanism that can signal orientation disparities. Rather, they suggest that the initial processing of disparity depends on the orientation and position of features in both eyes, in the way predicted by the energy model of Ohzawa et al. (1990). The proposal that there are two separate neural mechanisms early in visual processing (Blakemore et al., 1972), one that encodes orientation disparities and one that encodes position disparities, therefore has no physiological evidence to support it.
Footnotes
This work was supported by the Wellcome Trust. H.B. was a Christopher Welch Scholar. B.G.C. was a Royal Society University Research Fellow. We thank Andrew Parker for comments on this manuscript.
Correspondence should be addressed to Dr. H. Bridge, University Laboratory of Physiology, Parks Road, Oxford, OX1 3PT, UK. E-mail:holly.bridge{at}physiol.ox.ac.uk.