Abstract
Angles and junctions embedded within contours are important features to represent the shape of objects. To study the neuronal basis to extract these features, we conducted extracellular recordings while two macaque monkeys performed a fixation task. Angle stimuli were the combination of two straight half-lines larger than the size of the classical receptive fields (CRFs). Each line was drawn from the center to outside the CRFs in 1 of 12 directions, so that the stimuli passed through the CRFs and formed angles at the center of the CRFs. Of 114 neurons recorded from the superficial layer of area V2, 91 neurons showed selective responses to these angle stimuli. Of these, 41 neurons (36.0%) showed selective responses to wide angles between 60° and 150° that were distinct from responses to straight lines or sharp angles (30°). Responses were highly selective to a particular angle in approximately one-fourth of neurons. When we tested the selectivity of the same neurons to individual half-lines, the preferred direction was more or less consistent with one or two components of the optimal angle stimuli. These results suggest that the selectivity of the neurons depends on both the combination of two components and the responses to individual components. Angle-selective V2 neurons are unlikely to be specific angle detectors, because the magnitude of their responses to the optimal angle was indistinguishable from that to the optimal half-lines. We suggest that the extraction of information of angles embedded within contour stimuli may start in area V2.
Introduction
Distortion and disruption of contours provide important information enabling representation of the shape of objects as well as the contour lines themselves. These features are given by angles and junctions embedded within contours, and their partial replacement has a destructive effect on the appearance of objects. Although psychophysical studies have suggested the existence of special mechanisms sensitive to angles at relatively early visual stages (Chen and Levi, 1996; Heeley and Buchanan-Smith, 1996; Regan et al., 1996), the neuronal mechanisms responsible for extracting these features are not yet well understood. Our purpose in the present study was to clarify such neuronal mechanisms.
At higher visual stages, many neurons are selective for the complex shapes of objects (Gross, 1973; Desimone et al., 1984; Tanaka, 1993, 1996). In contrast, at area V1, the earliest visual stage, many neurons are highly selective for the orientation of line segments (Hubel and Wiesel, 1959, 1968; Hammond and Andrews, 1978). Their small classical receptive fields (CRFs) are quite appropriate for extracting local features of contour, such as fragments of short line segments. Thus, the neuronal mechanisms underlying object recognition might be a process of integrating this fragmentary information into a single object representation along the hierarchy of the visual pathways. At area V2, the next visual stage, many neurons are also selective for line segment orientation; moreover, Kobatake and Tanaka (1994) demonstrated that a few V2 neurons selectively respond to complex stimuli, including a sharp triangle, and suggested that the representation of complex stimulus features may begin to emerge in area V2. In that regard, Heider et al. (2000) showed that some single end-stop neurons showed selective responses to right corners, whereas Hegde and Van Essen (2000) showed that one-third of V2 neurons respond selectively to various complex stimuli, including angles. In these studies, however, only a small number of angle stimuli were used to evaluate response selectivity. Here, we systematically explored the response selectivity of V2 neurons in a large angle space.
Response selectivity has often been studied using small angle stimuli presented within CRFs as single complete objects (Kobatake and Tanaka, 1994; Hegde and Van Essen, 2000). Evidence suggests, however, that response selectivity is strongly influenced by contextual modulation from outside the CRF (for review, see Allman et al., 1985; Gilbert et al., 2000). In natural scenes, for instance, angles are presented as part of the contours of the objects and not as isolated visual stimuli. It therefore seems reasonable to take into account contextual modulation by contour lines to which angles are connected. We therefore prepared angle stimuli six to eight times larger than the size of the CRFs and report that many V2 neurons show angle selectivity that is dependent on the responses to the individual line components of the angle stimuli as well as to the combination of two line components. These results support the idea that signals encoding the orientation of individual line components converge in area V2, and this convergence is the first step to form the representation of angles within contour stimuli.
Materials and Methods
Two macaque monkeys (monkey CI: Macaca fuscata, female, weighing 3.9 kg; monkey CM: Macaca mulatta, male, weighing 5.2 kg) were trained to perform a fixation task and used for unit recordings. All procedures followed the National Institutes of Health Guide for the Care and Use of Laboratory Animals (revised 1996) and were approved by our institutional animal experimentation committee.
Training and preparation. We used three personal computers (PCs) to control the task sequence, to present visual stimuli, and to collect data. All program software was custom-made in our laboratory. Animals were first trained to sit in a monkey chair and perform a fixation task (rotation task; see below). They were then subjected to an initial surgical preparation: a stainless block for head fixation was attached to the top of the skull using cortical screws and dental cement, and scleral eye search coils (Judge et al., 1980) were implanted in both eyes to monitor eye movements (Enzanshi-Kogyo, Tokyo, Japan). All surgical procedures were performed under deep pentobarbital sodium anesthesia (20 mg/kg supplemented with 10 mg/kg when necessary) and under aseptic conditions. After recovery, animals were provided with additional training until they stably performed the task, even when contour stimuli were presented in parafoveal visual fields as in the recording sessions.
Monkeys were required to fixate on a mark that appeared on the screen of a CRT display (Clinton Monoray Monitor, Cambridge Research Systems, Rochester, UK). We set three virtual fixation windows of different size around the fixation mark. When an animal's eye position moved within an inner fixation window (0.5° × 0.5°), recording trials began. The animal had to maintain fixation within an outer fixation window (1.5° × 1.5°) throughout the trial; otherwise the trial was aborted, and a large cross indicating an error was presented for 500 msec. When a monkey's eye movements were maintained within a middle fixation window (1.0° × 1.0°), physiological data were accepted for further analysis. During the fixation period, a stimulus was presented for a period of 200 msec, and then the presentation was repeated with an interstimulus interval of 900 msec. The repetition number varied randomly from one to three. After the error trials, the repetition number was fixed at one until the animals responded correctly. This served to encourage the animals to continue the task by shortening the total fixation period and making the task easier. After a post-stimulus period varying from 200 to 1200 msec, a vertical dark line on the fixation mark rotated by 90°, and monkeys had to press a lever within a short interval (300 and 500 msec for each animal). The intertrial interval was 1 sec. Training or recording sessions were conducted 5 d/week, during which time water intake was restricted in the cage. Animals were given liquid reward for correctly performing the fixation task and were returned to their cages when they stopped performing the task. We monitored the animals' daily food intake and body weight and provided supplemental water if necessary.
Electrophysiological recordings. A metal recording chamber with an inner diameter of 22 mm was implanted over the left hemisphere in a position to allow access to areas V1 and V2 on the cortical surface. After training on the fixation task was completed, trephination was performed inside the chamber. Penetrations were then made through the dura matter using glass-coated platinum iridium microelectrodes (Wolbarsht et al., 1960) with impedances between 1.0 and 3.0 MΩ at 1 kHz. The electrodes were driven using a stepping motor microdriver (MO-95, Narishige, Tokyo), and successive penetrations were usually positioned 0.5 mm apart. Neuronal signals were amplified (10,000×, 300 Hz-5 kHz; Model 1800, A-M Systems, Sequim, WA) and fed to a PC at a sampling rate of 25 kHz (analog-to-digital converter; PCI-416, Datel, Mansfield, MA). We used spike sorting to isolate single unit recordings during off-line analysis. The recording chamber was filled with agarose during recording sessions to prevent the dura surface from drying and to reduce movement of cortical surfaces. Once the neuronal activity of either a single unit or a cluster of a few units was isolated, we determined the optimal orientation and optimal size of a bar stimulus by moving it manually and listening to the evoked discharges on an audio monitor. We determined the location and extent of the CRF for each recorded unit using the minimum response technique with an optimal bar stimulus (Barlow et al., 1967). We then began quantitative receptive field mapping and exploration of the response properties to angles.
After each recording session, the electrode and microdrive were removed, and the chamber was disinfected with Hibiten gluconate solution (0.05% chlorhexidine gluconate, Sumitomo Pharmaceuticals, Osaka, Japan) and filled with antibiotic (chloramphenicol–fradiomycin sulfate–prednisolone ointment, Sankyo, Tokyo, Japan) before being sealed. Connective tissue on the dura matter was periodically stripped under anesthesia (ketamine hydrochloride, 10 mg/kg, i.m.) to facilitate electrode penetration.
Visual stimuli. Visual stimuli were generated using a PC with a VSG2/3 graphics board (Cambridge Research Systems, Rochester, UK) controlled by custom software and displayed on a CRT display (Multisync Clinton Monoray; resolution, 800 × 600 pixels; refreshment rate, 60 Hz; Cambridge Research Systems). The CRT was placed 110 cm in front of the animals, and the display covered a visual angle of 16° × 12°. Stationary stimuli were presented against a background of 4.0 cd/m2 luminance. In most cases, bright contour stimuli (bars and angles of 0.06–0.10° width) were presented at high contrast (22.4 cd/m2); otherwise, dark stimuli were presented at high contrast (0.5 cd/m2). For quantitative CRF mapping, a short line segment (0.5–1.0° in length) of optimal orientation was presented within a 5 × 5 matrix with 0.5° intervals. To study the properties of the responses to angle stimuli, we prepared a standard stimulus set by combining two straight “half-lines” 3° or 4° in length (Fig. 1 A). Each half-line was drawn from the center to outside the CRF in 1 of 12 directions (every 30°), so that the stimulus passed through the CRF and formed an angle at the center. The example in Figure 1 A shows an angle stimulus composed of 0° and 120° half-lines. We made an angle space of all 66 combinations of two half-lines, which is shown as a 12 × 12 matrix in Figure 1 B. The bottom left half of the matrix (gray region) is a mirror image of the top right half (white region). We did not include single half-lines in the above set of angles, although they may be regarded as a special case in which the two half-lines overlap. This was because the total length and brightness of the single half-line are different from those of other angle stimuli. They are indicated by diagonal lines in the matrix and the response profiles (see below). As a part of the stimulus set, we also studied responses to individual long (3° or 4°) and short (0.3°) half-lines oriented in 12 directions and to short line segments (0.6°) presented at the centers of the CRFs in six orientations. Long line segments (6° or 8°) in six orientations were included in the stimulus set as angle stimuli of 180° width. We refer to the response selectivity for these half-lines as “direction selectivity” to distinguish it from ordinary “orientation selectivity.”
Data collection and analysis. Data were collected for 5–10 repetitions, with each stimulus presented in a pseudorandom block design. Each cycle of recording periods started –200 msec from onset of stimulus presentation and stopped 200 msec after offset of stimulus presentation. Eye positions were fed to PCs at 1 kHz, enabling us to analyze the relationship between eye positions and neuronal responses on a trial-by-trial basis. Spike occurrence during prestimulus periods provided the background firing rate for each neuron. Mean firing rates during stimulus presentation were measured from a time window 200 msec in width, taking into account the response latency, so that maximal rates were obtained. Response magnitude was given by the mean firing rate during stimulus presentation minus the mean firing rate of the background activity. The statistical significance of the response to each stimulus was evaluated using Kolmogorov–Smirnov tests; firing rates for one stimulus were compared with those for background activity. Values of p < 0.05 were considered significant.
To assess the distribution of the response activities of each unit within the angle space of 66 angle stimuli, mean firing rates were normalized to the maximal response and then plotted in a format similar to the stimulus matrix, which we will refer to as the “response profile.” To make response profiles, we assigned appropriate directions a value of 0°, so that a continuous distribution of response activities could be presented as a single area within the response profile.
To evaluate optimal angles, we smoothed the response profile using a 3 × 3 Gaussian filter with the following coefficients: (0.063, 0.125, 0.063), (0.125, 0.250, 0.125), (0.063, 0.125, 0.063). In this way we avoided picking up local peaks in the response profile. We defined a separate response area using the threshold of 80% of the maximal response and assigned peaks for each response area. Some neurons had a secondary response area, hence the secondary peak. It should be noted that the smoothed profiles were used only to obtain the optimal angles and that further analyses were done with the original response profiles.
To compare the maximal responses to angles and half-lines, an angle response index (IA) was computed using the following equation: IA = (RMA – RMHL)/(RMA + RMHL), where RMA is the maximal response to the angle set and RMHL is the maximal response to the half-lines. A value of 0.33 (or –0.33) means the response to the angle stimuli is two (or one-half) times as large as that to the half-lines.
To evaluate the specificity of the response selectivity to angle stimuli, we defined the “peak response area” as the extent of the continuous area of the response profile within which the response amplitude was ≥50% of the maximum. To characterize the distribution of responses in the response profile, we calculated the extent of the elongation along four axes centered at the optimal angle. Each axis represents a feature common to response-inducing angle stimuli, including the line components and the width and orientation of the angles (see Fig. 6). To compare the extent of elongation along three axes—primary (or secondary) line component (“line”), angle width (“wid”), and angle orientation (“ori”)—we plotted the barycenter () of the three parameters in a triangular area (see Fig. 7), as given by the following equation: where , , are vector representations of three corners of a triangular area, and Sline, Swid, and Sori are numbers of response-inducing stimuli equal to or larger than half the maximal response along the three axes, scaled by the maximal range (0.0–1.0).
Recording sites. We recorded from the opercular surface of the occipital cortex in the lower parafoveal region near the V1–V2 border. The border was determined from the retinotopical organization of CRF positions and the changes in CRF size when the recording tracks crossed the V1–V2 border. Mapping of the size and location of the CRFs assured that recordings were from area V2. Because both animals will be used for additional recordings, no histological confirmation was obtained; however, one monkey was scanned using magnetic resonance imaging (MRI) (3 T Allegra MRI scanner; Siemens, Erlangen, Germany) before surgery, after which the images were used to locate the lunate sulcus. Electrodes were kept near the position where spike activities were first encountered in a recording track. Characteristics such as high spontaneous activity and brisk on–off responses were taken to indicate that the electrode had entered layer 4 (von der Heydt and Peterhans, 1989; Snodderly and Gur, 1995). Using this strategy, we restricted recordings to the superficial cortical layers.
Results
The results described herein were on the basis of 114 single-unit records from V2 neurons, 64 from monkey CI and 50 from monkey CM, that responded to some of the 66 angle stimuli in the stimulus set. The significance of the responses was evaluated by comparing the response to each stimulus with the background discharge (Kolmogorov–Smirnov test; p < 0.05). In 91 neurons, there was significant variation among the responses to the various stimuli in the stimulus set (ANOVA; p < 0.05).
Response selectivity to angle stimuli in area V2
When we compared the response selectivity of V2 neurons in the angle space, we found that some were indeed selective for certain angle stimuli, as exemplified in Figure 2. Figures 2, A and B, shows the 66 angle stimuli in the stimulus set and the response of a V2 neuron (cell 1) to each stimulus in the format of a 12 × 12 matrix. This neuron showed statistically significant responses to several stimuli that are indicated by frames around the peristimulus histograms (Fig. 2B). Strong responses were elicited by an angle stimulus composed of 180° and 300° half-lines. In the response profile (Fig. 2C), the diameters of the disks indicate the magnitudes of the responses to the corresponding stimuli. To determine the optimal angles, we smoothed the profile to avoid picking up local maximums (Fig. 2D). For cell 1, two neighboring angle stimuli met the criteria of 80% of the maximal response, which we then used to determine the preferred angles of the neuron in the smoothed profile (see Materials and Methods). We thus judged that cell 1 had one optimal angle (Fig. 2D, open circles) formed by 180° and 300° half-lines.
To evaluate the specificity of the response selectivity to angle stimuli, we defined the peak response area (Fig. 2E, gray regions) (see Materials and Methods). The peak response area of cell 1 extended across two stimuli (4.5% of the stimuli in the set) around the response to the optimal angle (Fig. 2E, open circles). Figure 2F shows the actual angle stimuli that elicited responses equal to or larger than half the maximal response; the optimal angle is indicated by a circle. Apparently, cell 1 was sharply tuned to a particular combination of two line components.
Figure 2G shows that, in addition to angle stimuli, cell 1 had selectivity for the direction of the half-lines. This direction selectivity was broad for short half-lines (0.3° in length), whereas only 180° produced a strong response for long half-lines (4.0° in length). Interestingly, there was no response to a half-line of 300°, although it was a component of the optimal angle. This suggests that selectivity for angle stimuli cannot be explained simply by orientation selectivity for the half-lines comprising the angle stimuli. Cell 1 showed weak responses to both short (0.6° in length) and long (8.0° in length) line segments presented at the center of the CRF (Fig. 2H).
Figure 3 shows another example, which had an optimal angle stimulus composed of 120° and 210° half-lines (cell 2). The peak response area of cell 2 was a single continuous area, extending across five stimuli (Fig. 3E) (7.6% of the stimuli in the set). In contrast to cell 1, moderate responses were observed to angles made up of a combination of the 120° half-line and several other half-lines, and the peak response area was elongated along the horizontal–vertical axes representing the 120° half-line component (Fig. 3E). On the other hand, no elongation of the peak response area was observed along the horizontal–vertical axis representing the 210° half-line component. Apparently, cell 2 responded to the 120° line component when it was combined with downward line components (180–300°). The combination with half-line components oriented in other directions not only produced weak responses, but also reduced responses to the preferred line component. The response selectivity of this neuron may therefore be related to the responses to the individual line components, although simple linear summation of those responses could not explain the response selectivity of the neuron. More likely, this neuron has selectivity to a particular combination of two line components.
Figure 3G shows that cell 2 was also selective for particular half-lines. For short half-lines (0.3° in length), both 120° and 300° induced strong responses, whereas only 120° produced a strong response for long half-lines (4.0° in length). That the selectivity tended to be sharper for long half-lines probably indicates that they activated inhibitory mechanisms. Interestingly, there was no response to the 210° half-line, although it was a component of the optimal angle. Again, this suggests that selectivity to angle stimuli cannot be explained simply by the orientation selectivity for the half-lines comprising the angle stimuli. Cell 2 also responded to line segments presented at the center of the CRF, although the response amplitudes were smaller than the maximal response (Fig. 3H). Cell 2 showed significant orientation selectivity to both short (0.6° in length) and long (8.0° in length) line segments.
In total, we obtained 91 primary peaks and 25 secondary peaks (see Materials and Methods) from 91 angle-selective V2 neurons. We next investigated the properties of these neurons in more detail with the aim of obtaining clues about the neuronal mechanisms by which angle features are extracted.
Distribution of optimal angle stimuli in area V2
We observed diverse angle selectivity in area V2. Figure 4A shows the distribution of the optimal angles within the angle space, with the exact direction of each half-line given on the axes. We found no bias in the distribution of the optimal angles, and the orientation of the angle components for both the primary (circle) and secondary (diamond) peaks was distributed over a wide range. Figure 4B shows the distribution of widths of the optimal angles in the primary and secondary peaks. Of 116 optimal angles, 47 (40.5%) were “wide” angles, ranging between 60° and 150°. The broad distribution of optimal widths in this class indicates that V2 neurons represent a variety of angular components within contour stimuli. Fifty (43.1%) were “sharp” (30°) angles. Because the total length and brightness of the angle stimuli were different from those of the individual half-lines, responses elicited by each were not directly comparable. Consequently, we were unable to quantitatively determine whether the selectivity of these neurons for 30° angles reflected specific tuning to very sharp angles or broad tuning to single line-ends. The remaining 19 angles (16.3%) were long “bars” (180°), i.e., straight lines 6.0° or 8.0° in length that passed through the CRFs. Although these neurons were appropriate for conveying information about the orientation of straight contours passing through the CRF, they accounted for a surprisingly small fraction of V2 neurons. In 14 neurons, the primary and secondary peaks belonged to different classes of angles. Considering this, 41 (36.0%) neurons were selective to wide angles (60–150°), 45 (39.5%) to sharp angles (30°), and 19 (16.7%) to long bars. Notably, neurons selective for angle stimuli also responded to straight contours with submaximal responses, and 47 neurons (51.6%) showed significant orientation selectivity for the straight contours. This means that the responses to line segments might be overestimated if the response selectivity was studied with only straight lines.
As observed with cells 1 and 2, many V2 neurons showed clear responses to the half-lines that we used to form angle stimuli. We therefore compared the maximal responses to angle stimuli with those to half-lines (Fig. 4C). The angle response index (IA) fell between 0.33 and –0.33 in most cases, indicating that the maximal responses to angle stimuli and half-lines were generally comparable. Comparison of firing rates of the maximal responses to angle stimuli and those to half-lines showed that only four neurons exhibited significant differences (Kolmogorov–Smirnov test; p < 0.05), all of which preferred bars (Fig. 4C, filled triangles). Mean values of indices showed a weak tendency for the angle response index to be nearly zero for angles <120° in width and to be larger than zero for other angles and bars. For wide angles 120° and 150° in width and bars, the indices were significantly larger than that for sharp angles (t test; p < 0.05). To summarize, V2 neurons are sensitive not only to angle stimuli but also to half-line stimuli. Nevertheless, when contour stimuli pass through the receptive fields of neurons, these V2 neurons can differentiate the width and orientation of the angles. We conclude that a fairly large number of area V2 neurons are angle selective and involved in the representation of various angles in addition to straight line segments.
Peak response area in response profiles
We next examined the tuning width of the angle selectivity using the size of the peak response area as an index. The distribution of the size among 116 optimal angles ranged from 1 to 56 (Fig. 5). On average, the size was 8 of 66 stimuli (12.1%; median; n = 116) (Fig. 5A, downward arrow), indicating moderate tuning of the angle selectivity, although the size of the peak response area was <4 for 30 optimal angles (25.9%). Thus, a considerable number of neurons showed highly selective responses to a particular angle for both primary and secondary peaks (Fig. 5A). Among the three classes of optimal angle, broad tuning of the angle selectivity was mainly observed in the sharp and wide classes, whereas the bar class had relatively small peak response areas (Fig. 5B), which suggests that V2 neurons selective for a long straight bar are sensitive to orientation and small distortions in contour lines.
We also analyzed the spatial pattern of the distribution of the responses in the response profile. As shown by the responses of cell 3 (Fig. 6A), large peak response areas tended to have particular axes of elongation that provide information about angle features important for inducing selective responses. We assumed four situations. (1) If a neuron responded to each half-line of the optimal angle rather than to a particular combination of two components, it should respond to any stimulus containing either half-line having the optimal orientation, and the distribution of the responses would elongate along the horizontal–vertical axes representing the line components (Fig. 6C). (2) If a neuron was selective for angle width, regardless of the direction of the individual components, the response distribution would elongate along a diagonal axis from left top to right bottom (Fig. 6D). (3) If a neuron was selective for angle orientation, regardless of the direction of the individual line components, the response distribution would elongate along a diagonal axis from right top to left bottom (Fig. 6E). (4) Otherwise, the responses distribution would be evenly spread around the optimal angles, indicating that the selectivity was dependent mainly on the combination of the two line components. Here, we examined which of these patterns best reflected the selectivity of each neuron.
We first evaluated the extent of the elongation, i.e., the number of stimuli inducing responses equal to or larger than half the maximal response, along the four axes centered at the optimal angle, as shown in Figure 6C–E (gray regions). We then compared these numbers to characterize the distribution of the responses (Fig. 7). Because the response matrix was discontinuous at the diagonal line, which corresponds to angle stimuli 0° in width, we used an oblique version of the response profile (Fig. 6B–E, dotted rectangular area) consisting of the same set of angle stimuli. The maximum scale along the axis for the line components and the angle orientation was 11 and that for the angle width was 12. We will refer to the larger of the horizontal–vertical axes as the “primary axis” and to the smaller as the “secondary axis.” In the case of cell 3, the extent of the elongation was 8 along the horizontal axis representing the 120° half-line component, 11 along the vertical axis representing the 240° half-line component, 5 along the diagonal axis representing the angle width (120°), and 5 along the diagonal axis representing the angle orientation (0°). Figure 6F shows the distribution of the extent of the elongation along the four axes for the 116 optimal angles. The greatest elongation was observed mainly along the primary horizontal–vertical axes representing one line component of the optimal angles.
We next compared the extent of elongation along three axes: the axis for the line component (either primary or secondary axis), the axis for angle width, and the axis for angle orientation. In 29 of 116 angles (25.0%), the extent of elongation within the peak response area was less than three along all four axes, and their responses were too specific to evaluate elongation (e.g., cell 1) (Fig. 6F, gray column). These responses appear highly dependent on the combination of the two half-line components. For the remaining 87 angles, the relative extent of elongation along the three axes was characterized by computing the barycenter of the three values (Fig. 7) (see Materials and Methods). There was a clear tendency for the distribution of the responses to elongate along the primary axis for the line component.
When the extent of elongation along one axis was equal to or greater than twice those along the other two axes, we deemed there to be anisotropy and refer to this axis as the “axis of elongation.” In the case of cell 3, the vertical–horizontal axis representing the 240° half-line was the axis of elongation. Twentyseven angles (23.3%) had an axis of elongation, all of which represented one component of the optimal angles (Figs. 6F, first row, black column, 7A, open symbols). For four of these angles, all of which were bars, the secondary horizontal–vertical axis also met the criteria for an axis of elongation (Figs. 6F, second row, black column, 7B, open symbols). We observed no neuron that had a diagonal axis of elongation, suggesting that area V2 does not represent the width or orientation of an angle, per se. This means that the responses to angle stimuli were dependent to a large extent on the responses to one optimal half-line; however, the specific combination of two line components was also necessary to maximally activate these neurons. The remaining 60 optimal angles (51.7%) had no axis of elongation, although they had relatively large peak response areas, indicating that they are sensitive to a specific combination of two half-line components, but with relatively broad angle tuning.
Collectively, the results summarized above suggest that specific combinations of two half-line components are required to elicit optimal responses in most V2 neurons, regardless of tuning width (Fig. 7C). Most of the sharp (33 of 50; 66.0%) and wide (24 of 47; 51.1%) classes had no axis of elongation, although many had large peak response areas. One-fourth of the wide and sharp classes had an axis of elongation only along the primary horizontal–vertical axes. The bar class was split into two types: one sensitive to a long straight line, i.e., an angle 180° in width, that was assigned to either the “specific responses” or “no axis” class depending on the size of peak response area (11 of 19; 57.9%), and another sensitive to one or both of the half-line components (8 of 19; 42.1%).
Consistency between responses to each line component and optimal angles
To determine the contributions made by the individual line components, we examined the relationship between angle selectivity and the responses to the individual half-line components of the angle stimuli (“direction profiles”). Figure 8A shows the direction profiles of four neurons that had from one to three peaks for preferred directions. Figure 8B shows the relationships between the response amplitudes of the primary peaks and that of the secondary peaks. A secondary peak was assigned for further analysis if the amplitude was equal to or larger than half the maximum and was separated from the primary peak by responses smaller than half the maximum (Fig. 8B, filled symbols), and a third peak could be assigned in the same way. For nine neurons, response amplitudes next to the primary or secondary peaks were >95% of the peak response and regarded as parts of a broad preferred direction (e.g., secondary peak of cell 3). Peaks are indicated by black dots in Figure 8A. Of 91 neurons, 33 (36.3%) had a single peak, 33 (36.3%) had two peaks, and 15 (16.5%) had three peaks; half-lines did not induce significant responses in the remaining 10 neurons (8.9%). Direction profiles were then classified into four groups: (1) “bimodal/trimodal (180°)” selectivity with two or three peaks, including a pair 180° apart, which might be appropriate for representing straight contour lines; (2) bimodal/trimodal selectivity with two or three peaks without a pair 180° apart, which might be appropriate for representing angles; (3) “unimodal” selectivity with only one peak, which might be appropriate for representing line-ends; and (4) “no (significant) response” to half-lines. Interestingly, when neurons were classified according to their optimal angles, all four types of direction profile were observed in all three of the resultant groups (Fig. 8C). For example, some neurons had bimodal (180°) selectivity to individual line components and preferred a wide angle rather than straight lines. Consequently, the class of the direction profile was not sufficient to explain the angle selectivity.
In Figure 8A, arrows indicate the directions of two components of the optimal angle stimulus. Cells 3 and 6 had multiple peaks, which were consistent with the two components of the optimal angle. Because the difference between the two preferred directions was not 180° for each neuron, these preferred directions did not represent the orientation preference of the neurons but likely the two half-line components of the optimal angle. In contrast, cell 1 had only one preferred direction, which was consistent with one of the components of the optimal angle (Fig. 8A, black arrow); no response was observed for the other line component (white arrow). Cell 2 had two preferred directions: one consistent with a component of the optimal angle (black arrow) and the other not (white arrow). Figure 9A shows a chart of the consistency between the direction profile and the optimal angles for all neurons that preferred wide angles. In this chart, we compared the direction of each component of the optimal angle (downward arrowheads) and the peak (black thick line) or the peak regions (gray thick line) in which the responses were ≥50% of the maximum in the direction profile. We commonly observed that the direction of the line components of the optimal angle was consistent with the peaks (black arrowheads) or the peak regions (gray arrowheads), as indicated on each line. For the majority of those matching the peak region, the direction of the line component was next to the peak (15 of 19; 78.9%). We observed that in 15 cases (31.9%) both of the line components of the optimal angle were consistent with the peaks or the peak regions (black rightward arrows at the left), especially among the bimodal/trimodal classes; in 22 cases (46.8%), one line component of the optimal angle was consistent (gray rightward arrows at the left), mainly in the unimodal class.
The above results suggest that for most angle-selective V2 neurons, the response to the optimal angle reflects the specific combination of two line components. In many of these neurons, the response to the optimal angles also reflected the signal representing one half-line component, with the other acting as a modulator. This tendency was particularly clear in the wide class (Fig. 9B). As shown in Figures 4C and 8A, the amplitudes of the maximal responses to angles and half-lines were comparable with one another. The result also indicates that modulatory inputs are involved in the formation of angle selectivity and that simple summation of responses to each half-line component was not enough to explain the response selectivity of a neuron.
Stimulus sites and eye positions during stimulus presentation
It was essential in this study that stimuli form angles at the center of the CRFs. If angles were presented near the border of a CRF, a combination of the orientation selectivity and end-stop inhibition might have produced pseudo-response selectivity to angle stimuli. Indeed, Heider et al. (2000) reported that half of the end-stop neurons in area V2 were single end-stopped and that several of these responded to one right corner but not to its opposite. To avoid such pseudo-angle selectivity, we took several precautions when determining the centers of CRFs.
We first determined the borders of the CRFs using the minimum response method (Barlow et al. 1967) with an optimal short bar. Figure 10A shows the CRF of cell 2, the responses of which to angle stimuli are shown in Figure 3. A dot indicates the center of the CRF, where the angles were formed. We next confirmed the centers of the CRFs quantitatively by presenting a short line segment (0.5° in length) at the optimal orientation (0° in exact orientation for cell 2) in a 5 × 5 matrix of 0.5° intervals around the center of the CRF (Fig. 10B). Note that the center of the response matrix corresponded to the location of angle stimuli and should induce maximal responses. If the maximal response was not observed at the center of the response matrix, the CRF was determined again. We then studied the directional selectivity of half-lines short enough to be presented within the CRFs (0.3° in length). If the short half-line of either the optimal or opposite direction (0° and 180° for cell 2) (Fig. 3G) induced only a weak response, we assumed that stimuli were presented near the border of the CRF, so that one short line failed to activate the neuron or activated end-stop inhibition of the neuron. In such cases, we determined the CRF again.
The stability of the animals' fixation was another important factor, because the rotation task strongly required accurate fixation throughout. The mean and SD of the eye positions recorded during presentation of the angle stimuli for all 114 neurons (Fig. 10C,D) indicated that the animals' fixation was well restricted to within the middle fixation window (1.0° × 1.0°). The eccentricity of the CRFs ranged from 1.6° to 5.4°, and the square root of their extent was 0.94° ± 0.03° (mean ± SEM; 0.43–1.81°; n = 114). Considering the accuracy of our CRF mapping and the accuracy of the monkeys' fixation, we are confident that the angle stimuli used in the present study passed through the center of the CRFs, as we intended.
We also considered the fact that response amplitude may be affected by small variations in eye position during presentation of the angle stimuli. Figure 10E shows the eye positions during each of 528 stimulus presentations (66 stimuli × 8 repetitions) in a recording session from cell 2. When we divided these eye positions into two groups according to the magnitudes of the corresponding responses and then compared their means, we found little difference. When angle stimuli induced responses >50% of maximum, mean eye position was –0.039° ± 0.007° horizontally (n = 88) and –0.109° ± 0.008° vertically (n = 88); when they induced responses <50% of maximum, it was –0.049° ± 0.003° horizontally (n = 440) and –0.115° ± 0.004° vertically (n = 440). Figure 10F summarizes this analysis for all 114 neurons. That in most cases the difference was <0.05° for both the horizontal and vertical directions provides further evidence that angle selectivity in V2 neurons cannot be explained as pseudo-angle selectivity generated by asymmetric deviation of eye position and stimulus location relative to the CRF.
Discussion
In the present study, we systematically examined the responses of V2 neurons to a set of 66 angle stimuli. We found that a fairly large number of V2 neurons selectively responded to angles. Notably, one-fourth of them showed highly selective responses to a particular angle. For most angle-selective V2 neurons, the response to the optimal angle reflected the specific combination of the two line components. The present result is consistent with the idea that signals encoding the orientation and location of individual line components converge on V2 neurons and that extraction of angle features may start in area V2.
Response selectivity to angle stimuli in area V2
It has been believed that, like V1 neurons, V2 neurons are selective for the orientation of lines or edges; neurons selective for angle-like stimuli were thought to be rather rare (Kobatake and Tanaka, 1994; Hegde and Van Essen, 2000; Heider et al., 2000). We found, however, that a fairly large number of V2 neurons selectively responded to angles. There are several possible reasons for the discrepancy between the present findings and those of previous studies. First, the present study is the most detailed and systematic assessment of response selectivity to angle stimuli in area V2 reported so far. Previous studies used a smaller number of angle stimuli. Hegde and Van Essen (2000), for example, used 24 angles (4 orientation × 3 widths × 2 sizes) in their measurements. Considering the highly selective neurons observed in this study, such a discrete survey could easily miss finely tuned angle-selective responses and might therefore underestimate angle selectivity. Second, we used large angle stimuli that passed through the CRFs, whereas previous studies presented angle stimuli as a single object within the CRFs. Because angle selectivity was strongly related to the directional selectivity of the half-lines, which was more evident when using long half-lines, our use of large stimuli favored the revelation of angle selectivity. Third, one-half of our sample showed significant orientation selectivity for long bars, although the responses were submaximal. This means that if we studied only selectivity for straight lines, a large fraction of our sample would be classified as selective for line segment orientation. Our finding is therefore consistent with the previous results indicating that V2 neurons are orientation selective. Finally, we concentrated on sampling from the superficial layer to increase our chances of discovering the outcome of information processing within this area that in turn would be transmitted to higher cortical areas.
We also found that a small fraction of the V2 neurons studied preferred straight lines passing through the CRFs. They were very sensitive to orientation and distortion of line or edges, and their response selectivity was sharper than that of other angle-selective neurons. We suggest that these neurons are suitable for precisely differentiating straight contours from curvatures, angles, and junctions.
Our study shows that the amplitudes of the maximal responses to angles and half-lines are comparable with one another. Therefore, it is unlikely that angle-selective V2 neurons act as specific detectors of angle stimuli. Among the continuous and closed contour lines on the surfaces of objects, however, V2 neurons show a great potential to represent angle width and distinguish angles from straight contour lines.
Responses to angle stimuli in the primary visual cortex
So far, angle-selective responses have not been reported in area V1 of the monkey. In area 17 of the cat, some neurons selectively respond to curvatures, chevrons, and sharp angles (Heggelund and Hohmann, 1975; Hammond and Andrews, 1978; Dobbins et al., 1987, 1989; Versavel et al., 1990; Shevelev et al., 1998), and some useful mechanisms for extracting angles were evident. For instance, Hubel and Wiesel (1965) suggested that deformation of optimal bar stimuli that avoids activation of end-stop inhibition may result in strong responses to obtuse angles. Sillito et al. (1995) suggested that facilitation caused by recurrent excitation from orthogonal gratings in the surrounding annulus may represent discontinuity of line segments. Das and Gilbert (1999) suggested that inhibitory interaction between nearby neurons having orthogonal preferred orientation may differentiate T-junctions. Walker et al. (1999) suggested that nonuniform surround suppression around CRFs may be useful for representing angles. Similar mechanisms may exist in area V1 of the monkey (Jones et al., 2001). As exemplified by cells 1 and 2 (Figs. 2, 3), we often observed that asymmetric direction profiles, which are unimodal or bimodal without a pair 180° apart, are given by long half-lines but not by short half-lines. Although this tendency favors the contribution of the nonuniform surround suppression within area V2, additional study will be necessary to clarify the contribution of areas V1 and V2 to the formation of angle representation.
Neuronal mechanisms for extracting angles in area V2
Although we found that many V2 neurons are selective for angles, none specifically represented such features as angle width or angle orientation, regardless of the individual line components of angle stimuli. Instead, angle selectivity was closely related to the specific combination between the signals representing each line component. In one class of neurons, angle selectivity can be explained by an appropriate pattern of convergence of neural signals representing the individual line components of the angle stimulus, which may be accomplished by forward connection from area V1 or by lateral interactions within area V2. It is well accepted that at early visual stages neurons are tuned to a single optimal orientation, although some neurons reportedly show two optimal orientations in primate area V2 (Anzai and Van Essen, 2001, 2002) and in cat area 17 (Shevelev et al., 1994). Indeed, we observed that several neurons had bimodal or trimodal selectivity to half-line components not 180° apart, and their peaks were often consistent with the direction of the two line components of the optimal angle. We also found, however, that the presence of either line component alone was not sufficient to elicit a response. This is clearly demonstrated by the fact that for wide and sharp angles, no neuron exhibited two axes of elongation representing the two half-line components of the optimal angles. There thus appears to be another factor enabling fine selectivity, perhaps facilitatory interactions between the two preferred line components or suppression to nonpreferred line components.
In another class of neurons, such consistency was held only for one line component of the optimal angle. One candidate representing another line component is subthreshold excitatory interactions mediated by the long-range horizontal connections within area V2. Horizontal connections in the primary visual cortex are known to connect multiple cortical columns sharing the same orientation preference (Ts'o et al., 1986; Ts'o and Gilbert, 1988; Gilbert and Wiesel, 1989), but some V2 neurons having an orthogonal orientation preference show cross-correlation peaks near 0 msec, suggesting that horizontal connections may contribute to the formation of angle selectivity (Tamura et al., 1996). Complex integration beyond the CRFs has been observed in area V2 (von der Heydt and Peterhans, 1989; Bakin et al., 2000; Zhou et al., 2000). Another potential mechanism is inhibitory interactions with nonpreferred line components mediated by the nonuniform surround suppression. For instance, Heider et al. (2000) reported that some single end-stopped neurons showed selective responses to right corners. To explain the wide variety of angle selectivity observed in area V2, we also need to consider more complicated nonuniform surround suppression, as reported in other visual areas (Xiao et al., 1995; Walker et al., 1999; Jones et al., 2001).
Angle representation in area V2 and beyond
Fundamental to our understanding of the neuronal basis of object recognition is the idea that attribute-specific and location-specific information are gradually integrated into a single object representation along the hierarchy of visual pathways. The present results are in line with the idea that the representation of complex stimulus features begins to emerge in area V2 (Kobatake and Tanaka, 1994; Hegde and Van Essen, 2000; Mahon and De Valois, 2001). In area V4, the next stage along the ventral visual pathway after area V2, a larger number of neurons are selective for complex stimulus features, including angles (Gallant et al., 1993, 1996; Kobatake and Tanaka, 1994; Pasupathy and Connor, 1999, 2001, 2002; Hanazawa and Komatsu, 2001). Their selectivity is invariant across several locations within CRFs. Furthermore, some neurons are selective for the angle orientation regardless of the direction of individual line components (Pasupathy and Connor, 1999). In contrast, the responses of angle-selective V2 neurons are related to signals representing each line component of the angle stimuli. The relatively small CRFs of V2 neurons did not allow us to examine the positional invariance within the CRFs, but it seems likely that a small positional shift would have a large effect on the responses and that these neurons would not have positional invariance. Thus, there seems to be a large difference in the angle representation in areas V2 and V4. We believe that signals extracted in area V2 are integrated in area V4 to more elaborately represent angles and curvatures. These differences in the nature of angle selectivity also suggest that feedback connections from area V4 are not likely the main source of angle selectivity observed in area V2.
Footnotes
This work was supported by the Japanese Grant-in-Aid for Scientific Research (C) 11680796 and (B) 14380371. We thank T. Ogawa for software development and M. Togawa and N. Takahashi for technical assistance.
Correspondence should be addressed to Minami Ito, Division of Sensory and Cognitive Information, Department of Information Physiology, National Institute for Physiological Sciences, Myoudaiji-cho 38, Okazaki 444-8585, Japan. E-mail: minami{at}nips.ac.jp.
DOI:10.1523/JNEUROSCI.4364-03.2004
Copyright © 2004 Society for Neuroscience 0270-6474/04/243313-12$15.00/0