Abstract
Humans can perceive illusory or subjective contours in the absence of any real physical boundaries. We used an adaptation protocol to look for orientation-selective neural responses to illusory contours defined by phase-shifted abutting line gratings in the human visual cortex. We measured functional magnetic resonance imaging (fMRI) responses to illusory-contour test stimuli after adapting to an illusory-contour adapter stimulus that was oriented parallel or orthogonal to the test stimulus. We found orientation-selective adaptation to illusory contours in early (V1 and V2) and higher-tier visual areas (V3, hV4, VO1, V3A/B, V7, LO1, and LO2). That is, fMRI responses were smaller for test stimuli parallel to the adapter than for test stimuli orthogonal to the adapter. In two control experiments using spatially jittered and phase-randomized stimuli, we demonstrated that this adaptation was not just in response to differences in the distribution of spectral power in the stimuli. Orientation-selective adaptation to illusory contours increased from early to higher-tier visual areas. Thus, both early and higher-tier visual areas contain neurons selective for the orientation of this type of illusory contour.
Introduction
Illusory contours are perceived as sharp boundaries between two regions that do not differ in mean luminance or chromaticity. For example, two phase-shifted abutting line gratings (Fig. 1A) elicit a clear boundary percept in the absence of any average luminance difference across the boundary (Kanizsa, 1976; Soriano et al., 1996). In this paper, we report that regions of human visual cortex respond selectively to the orientation of such illusory contours.
Single-unit recordings have shown that V2 neurons selectively respond to the orientation of both real and illusory contours (von der Heydt and Peterhans, 1989; Lee and Nguyen, 2001). The existence of such “illusory-contour” cells in V1 is more controversial. Some authors claim that illusory-contour cells are absent in V1 (Peterhans and von der Heydt, 1989; von der Heydt and Peterhans, 1989), but others report that some V1 cells respond to illusory contours (Grosof et al., 1993; Sheth et al., 1996; Lee and Nguyen, 2001; Ramsden et al., 2001). Single-unit responses to illusory contours have not been measured beyond V2.
Several imaging studies of illusory contours have been done in humans (Hirsch et al., 1995; Ffytche and Zeki, 1996; Larsson et al., 1999; Mendola et al., 1999; Seghier et al., 2000; Stanley and Rubin, 2003; Maertens and Pollmann, 2005). As a whole, these studies reported responses to illusory contours in V1 and V2 as well as higher-tier visual areas, including the lateral occipital complex (LOC), but responses were stronger in higher-tier areas (e.g., V3A, LOC) than in early visual areas (V1 and V2).
It is difficult to compare the results of the neuroimaging and single-unit studies. Previous neurophysiological studies reported orientation-selective responses of neurons to illusory contours. In contrast, imaging studies compared brain activity in response to illusory-contour stimuli versus control stimuli that did not evoke illusory contour percepts. However, an overall net increase in neural activity within a cortical area to illusory contour stimuli may or may not be indicative of contour processing per se. First, any of a number of differences between the illusory-contour and control stimuli (e.g., differences in the orientation and spatial frequency content of the stimuli) might affect the overall level of activity and thereby confound the interpretation of the results. Second, illusory contours are compelling visual illusions such that attentional modulation of neural activity might confound the interpretation (Huk et al., 2001). Third, most models of visual processing hypothesize that a contour percept is encoded by a relative difference between subpopulations of neurons with different orientation preferences, not by an overall net increase in activity (Gilbert and Wiesel, 1990; Vogels, 1990; Pouget et al., 2000).
We used adaptation to measure orientation-selective responses to illusory contours in human visual cortex while observers performed a demanding foveal task to equate spatial attention across conditions (Larsson et al., 2006). We found orientation-selective adaptation to illusory contours in early (V1 and V2) and higher-tier (V3, V4, VO1, V3A/B, V7, LO1, and LO2) visual areas. Orientation-selective adaptation increased from early to higher-tier visual areas, implying that the proportion of neurons selective for illusory-contour orientation was larger in higher-tier visual areas. Our results are consistent with single-cell studies that have shown orientation-selective responses to illusory contours in V1 and V2 and also confirm imaging studies that have shown activation of higher-tier visual areas in response to illusory contours.
Materials and Methods
Observers and scanning sessions.
Four observers (two male and two female, 27–49 years of age) participated in the main experiment and the first control experiment. Two of the observers (one male and one female) also participated in the second control experiment. Each experiment (main and two control experiments) consisted of two scanning sessions. Observers gave informed consent to participate in accordance with the Helsinki convention and National Institutes of Health guidelines for experiments involving human subjects. The experimental protocol was approved by the New York University Committee on Activities Involving Human Subjects.
Visual stimuli.
Illusory-contour stimuli were presented in an annulus (inner radius, 1.5°; outer radius, 4.9°) around the center of fixation (Fig. 1A). Stimuli were arrays of inducer lines, white diagonal (+45 or −45°) line segments (0.23 arc min wide; 1.4° long, 50% contrast) on a uniform gray background, arranged to form horizontal or vertical abutting line gratings phase shifted by 180°, resulting in the percept of crisp illusory contours between adjacent inducer gratings. The position of each line was jittered by a random amount between ±0.15°, parallel to the orientation of the induced illusory contours. This jitter obscured the regularity of the inducer grating pattern, which might have provided a cue to the orientation of the illusory contours even in the absence of an illusory-contour percept. The spatial frequency of the inducer gratings was 2.8 cycles/° on average (ignoring the jitter). The spatial frequency of illusory contours was 1 contour/°.
In addition to the main experiment, we ran two control experiments that used adapter stimuli [see functional magnetic resonance imaging (fMRI) protocol below] designed to be similar to those in the main experiment but did not induce the perception of illusory contours. The test stimuli in these control experiments were the same illusory-contour stimuli that were used in the main experiment; only the adapter stimuli differed. In the first control experiment, each inducer line was shifted (misaligned), parallel to its orientation, by a random amount raging from −0.55 to 0.55° (Fig. 1B). By misaligning the endpoints of inducer lines, this manipulation eliminated the percept of illusory boundaries between adjacent inducer gratings. As in the main experiment, the orientation of the inducers was either +45 or −45°. In the second control experiment, the adapter stimuli were random-noise patterns generated by randomizing the phase spectrum of the corresponding illusory-contour stimuli (Fig. 1C).
For a separate psychophysical experiment, stimuli were generated with seven levels of misalignment, ranging from no misalignment, as in the main experiment (Fig. 1A), to the misalignment that was used for the first control experiment (Fig. 1B). As in the first control experiment, each inducer line was shifted parallel to its orientation by a random amount drawn from a uniform distribution.
For the fMRI experiments, stimuli were presented on an electromagnetically shielded analog NEC 2110 LCD display using the Psychophysics Toolbox (Brainard, 1997; Pelli, 1997) and a 10-bit graphics card. For the psychophysical experiments, stimuli were presented on a Nokia (Finland) 446 XPro CRT display. Both displays had a refresh rate of 60 Hz and spatial resolution of 800 × 600 pixel. The mean luminance of the display background was 18.14 cd/m2 in the fMRI experiments and 51.81 cd/m2 in the psychophysical experiments. The viewing distance for both the fMRI and psychophysical experiments was 157.5 cm
fMRI protocol.
We used an event-related fMRI adaptation protocol (Fig. 2) (Engel, 2005; Fang et al., 2005; Gardner et al., 2005; Larsson et al., 2006). In this protocol, observers viewed one illusory-contour stimulus repeatedly (the adapter), and we measured the fMRI response to a subsequently presented illusory-contour stimulus (the test) that had either the same (“parallel”) or perpendicular (“orthogonal”) illusory-contour orientation as the adapter.
At the beginning of each session, observers passively viewed the adapter stimulus for 100 s. Adaptation was maintained on subsequent trials by showing a top-up adapter during the first 4 s of each trial. A blank (uniform gray) screen followed the top-up adapter for 1 s, which was followed by the test stimulus for 1 s. The trial ended with a 1.2 s behavioral-response period, during which the display was blank except for the fixation point (Fig. 2A). Total trial duration was 7.2 s. To avoid adaptation to the inducers (independent of adaptation to the illusory contours), the orientation of the inducers was changed every 160 ms during both the adapter and test stimuli (Fig. 2A) while keeping the illusory-contour orientation constant. In addition, the spatial phase of the illusory contours and that of the inducers was varied randomly every 160 ms.
On one-third of the trials, the test stimulus had the same orientation as the adapter (parallel trials); on one-third, the test stimulus was perpendicular to the adapter (orthogonal trials); and on one-third, the test stimulus was blank (adapter-only trials) (Fig. 2B). The trial order was randomly shuffled so that the sequence of trials preceding and following any trial was equally likely to contain any of the three trial types. Specifically, 14 trials of each trial type were presented in seven blocks of six trials each, with trial order randomized within blocks.
Each observer participated in two scanning sessions, one with a horizontal adapter and the other with a vertical adapter. A single adapter orientation, vertical or horizontal, was used for each scanning session. A scanning session comprised 10 adaptation scans (runs), each consisting of 42 trials (∼5 min), plus two localizer scans (see below). The data were averaged across the two sessions for each observer (i.e., across 840 trials).
To control attention, observers performed an attention-demanding task at fixation (Fig. 2A) that forced them to divert their attention away from the peripheral illusory-contour stimuli. This was necessary, because spatial attention can modulate neuronal responses to visual stimuli measured with fMRI in a spatially specific manner (Tootell et al., 1998; Brefczynski and DeYoe, 1999; Gandhi et al., 1999; Kastner et al., 1999; Somers et al., 1999). Furthermore, attention modulates aftereffects, including the motion aftereffect (Chaudhuri, 1990; Lankheet and Verstraten, 1995; Rees et al., 1997), figural aftereffect (Shulman, 1992; Yeh et al., 1996; Suzuki, 2001), tilt aftereffect (Spivey and Spirn, 2000), and illusory-contour tilt aftereffect (Montaser-Kouhsari and Rajimehr, 2004). In our task, observers were required to count the number of Xs in a stream of rapidly (160 ms/letter) presented letters (Z, L, N, T, and X). The letters were shown from the beginning of the adapter until the end of the test stimulus. Afterward, the letters were replaced by a fixation point, cueing observers to respond by pressing one of four keys corresponding to the number of target Xs detected (1–4). Despite the great attentional and working-memory demands of the task, observers' performance after practice was well above chance (Fig. 3). Importantly, there was no difference in performance between the two experimental conditions, indicating that there was no difference in attentional resources devoted to the task between the parallel and orthogonal conditions that could contribute to differences in fMRI responses that we measured. This also indicates that there was no difference in eye position between the conditions, because breaking fixation would have adversely affected performance. Hence, eye movements did not confound the interpretation of our fMRI results.
We ran two control experiments in which the trial sequence, task, and experimental protocol were identical to the main experiment. In the first control experiment, we used the misaligned stimuli (Fig. 1B) as adapters and the same illusory-contour test stimuli as before. These adapter stimuli were visually similar to the illusory-contour adapter stimuli except for the lack of illusory contours. In the second control experiment, the adapter stimuli had identical power spectra as the illusory-contour stimuli, but the phase spectra were randomized (Fig. 1C). The idea behind both manipulations was to ensure that any adaptation effects observed in the main experiment reflected orientation-selective responses to the perceived illusory contours, rather than being a result of incidental features or first-order (luminance contrast) artifacts in the stimuli. As in the main experiment (see above), each observer participated in two scanning sessions for each control experiment, one with a horizontal adapter and the other with a vertical adapter. Each scanning session comprised 10 scans with 42 trials per scan, as in the main experiment.
Localizer scans.
During each scanning session, before and after the adaptation scans, we performed an additional pair of measurements to independently identify the cortical regions responding to the stimuli. The same abutting line-grating illusory-contour stimuli were used as in the main experiment. A block design was used, alternating between 9.6 s blocks of illusory-contour stimuli and 9.6 s blocks of blank (uniform gray) screen. Observers were instructed to maintain their gaze on a fixation mark at the center of the display throughout the scan. Each localizer scan consisted of 13 stimulus-blank alternations.
MRI acquisition.
A Siemens (Erlangen, Germany) Allegra 3T scanner was used to measure the blood oxygenation level-dependent (BOLD) signal (Ogawa et al., 1990) in T2*-weighted images. The scanner was equipped with a four-channel phased-array surface coil covering the back of the head (NM-011 transmit head coil and NMSC-021 receive coil; Nova Medical, Wakefield, MA). A bite bar was used to minimize head motion. Functional data in the adaptation scanning sessions were acquired using the following parameters: repetition time (TR), 1200 ms; echo time (TE), 30 ms; flip angle, 75°; 64 × 64 matrix size; 19 slices oriented perpendicular to the calcarine sulcus; voxel size, 3 × 3 × 3 mm. For the retinotopy measurements (see below), we used the same imaging parameters with the following exceptions: 24 slices and 1500 ms TR. At the beginning of each session, we also acquired anatomical T1-weighted magnetization-prepared rapid-acquisition gradient echo images that covered the same volume as the functional scans but with twice the in-plane resolution (voxel size, 1.5 × 1.5 × 3 mm).
Defining visual area regions of interest.
Nine regions of interest (ROIs) were defined based on retinotopy (V1, V2, V3, V3A/V3B, hV4, V7, LO1, LO2, and VO1). Standard traveling wave methods for retinotopic mapping were used to identify meridian representations corresponding to boundaries between retinotopically organized visual areas (Engel et al., 1994, 1997; Sereno et al., 1995; DeYoe et al., 1996). The retinotopic mapping scans were obtained in scanning sessions separate from the adaptation sessions. The ROIs were drawn on computationally flattened representations (“flat maps”) of the occipital cortex generated from high-resolution T1-weighted anatomy images using the public domain software SurfRelax (Larsson, 2001) (http://www.cma.mgh.harvard.edu/iatr). Areas V1, V2, V3, V3A/V3B, hV4, and V7 have been described extensively in the literature (Wandell et al., 2005). VO1 is a coarsely retinotopic area anterior and lateral to hV4 (Wandell et al., 2005). LO1 and LO2 are two retinotopic areas in the lateral occipital cortex between dorsal V3 and V5/MT+ (Larsson and Heeger, 2006; Larsson et al., 2006).
fMRI data analysis.
The time-series data from each scan were motion corrected within and between scans (Jenkinson et al., 2002). The time series of each voxel was normalized (divided by its mean intensity) to compensate for variations in intensity with distance from the receiver coil and to convert the data from arbitrary image intensity units to percentage signal change.
For the retinotopy and localizer scans, the time-series data from each voxel were high-pass filtered (cutoffs at 0.026 and 0.02 Hz for the retinotopy and localizer scans, respectively) to remove low-frequency noise and drift (Zarahn et al., 1997; Smith et al., 1999) and then were fit with a sinusoid with period equal to that of the stimulus (Bandettini et al., 1993; Engel et al., 1994) yielding both the phase of the best-fit sinusoid and the correlation (technically, coherence) between the best-fit sinusoid and the measured time series. The coherence reflected the signal-to-noise ratio of the evoked responses (Engel et al., 1997), ranging from 0 (no modulation) to 1 (perfect coherence). The best-fit phase measured the temporal delay of the fMRI signal relative to the stimulus cycle. For the retinotopy scans, the phase corresponded to the polar angle and radial components of the retinotopic map. For the localizer experiments, response phase reflected the stimulus condition that evoked the larger response (illusory-contour stimuli or blank field).
Event-related data from the adaptation scans were analyzed separately for each visual area ROI as follows. For each adaptation session, the ROI was restricted to include only those voxels that responded strongly in the localizer scans taken in the same session. Specifically, the ROIs were restricted to include voxels with a response coherence >0.2 and phase between 0 and π, corresponding to stimulus “on” blocks. This ensured that ROIs included only voxels corresponding to visual field locations within the stimulus annulus and reduced intersession variability (Aguirre et al., 1998) by excluding voxels that did not respond strongly to the stimuli. The normalized time courses were averaged across voxels within each ROI. The resulting mean ROI time courses were filtered using a bandpass filter (0.02–0.2 Hz) to remove low-frequency drift and high-frequency noise. Responses to individual trials were extracted from the mean ROI time course by extracting the 16 time points (19.2 s) beginning with the onset of each trial. For each observer, parallel, orthogonal, and blank trials were each pooled across the two scanning sessions (vertical and horizontal adapters). We estimated the response to the adapter stimulus as the mean response for the adapter-only (blank) trials. The mean response to the adapter stimulus was subtracted from the response to each orthogonal and parallel trial, and the resulting time courses were adjusted to a zero baseline by subtracting the mean of the first four time points (corresponding to the time of presentation of the 4 s adapter and the 1 s blank between adapter and test). The peak response amplitudes evoked by the test stimuli (orthogonal or parallel) for each trial were then computed by averaging the two values of the time course around the peak. These two time points, which were the same for all of the ROIs and observers, corresponded to 4.6 and 5.8 s after test stimulus onset (i.e., typical of the peak hemodynamic impulse response). The response amplitudes of individual parallel and orthogonal trials were averaged by trial type to compute the mean response amplitude for each trial type for each visual area ROI. Confidence intervals for mean response amplitudes were calculated for individual observers as the SEM across trials. We estimated the statistical significance of adaptation (i.e., larger response amplitudes to orthogonal than to parallel trials) using a one-tailed t test in individual observers. For group means, we averaged the response amplitudes across the four observers. Confidence intervals for the group amplitude means were computed as the confidence interval pooled across observers (i.e., by taking the square root of the mean squared SEMs of individual observers divided by square root of the number of observers).
We also computed an adaptation index (AI) for each visual area to quantify the difference between the fMRI responses to parallel and orthogonal test stimuli as a result of adaptation in each visual area. The index was calculated as follows: AI = (AO − AP)/(|AO| + |AP|), where AP was the mean response amplitude to the parallel stimulus, and AO was the mean response amplitude to the orthogonal stimulus. This index ranged from −1 to 1. The index was one if adaptation was completely effective, reducing the response to the parallel stimulus to zero. If adaptation was ineffective, so that responses to the parallel and orthogonal stimuli were identical, the index was zero.
For each ROI, the adaptation indices were computed separately for each observer and then averaged. Confidence intervals for adaptation indices were estimated using a bootstrap method (Efron and Tibshirani, 1993). We estimated the underlying distribution of the adaptation indices by resampling the data 1000 times. The response amplitudes from each ROI and each observer were sampled with replacement to yield a resampled data set with the same size as the original data. The resampled adaptation indices were averaged across the four observers, yielding a distribution of 1000 adaptation indices for each ROI. Upper and lower error bounds for the mean adaptation indices were estimated as the 16th and 84th percentiles of the resulting distributions.
Psychophysical protocol.
To verify that the adapter stimuli were behaviorally effective in eliciting adaptation to illusory contours in the main experiment but not in the first control experiment, we measured postadaptation discrimination thresholds to the stimuli with various amounts of misalignment. The stimuli and protocol were similar to those in the fMRI experiments.
Observers performed a two-interval forced-choice task. Two stimuli were displayed in succession. One had the maximum amount of misalignment (0.55°) and the other (the target) had a smaller amount of misalignment. Observers indicated which stimulus was less misaligned (i.e., the stimulus that had the stronger percept of illusory contours). Before each session, observers viewed an adapter stimulus for 100 s. The adapters were either the illusory contours used in the main fMRI experiment or the misaligned adapter stimuli used in the first control experiment. Each trial was 6.6 s long and began by presenting a top-up adapter for 4 s, followed by a blank screen for 0.5 s, followed by two test stimuli for 0.5 s each, separated by an interstimulus interval of 0.5 s. At the end of each trial, observers indicated by key press which interval contained the (less misaligned) illusory-contour target. The misalignment in the target stimulus was varied by two interleaved one-up, two-down staircases (Wetherill and Levitt, 1965). A single experimental session consisted of 10 blocks, each with 20 trials. Target orientation alternated from block to block but was constant within a block. Each observer participated in at least four experimental sessions (one for each combination of adapter type and adapter orientation). The sessions were run on different days to avoid potential confounding effects of long-term adaptation. There were at least 100 trials for each combination of adapter type (illusory contour or misaligned), adapter orientation (horizontal or vertical), and test-stimulus orientation (orthogonal or parallel). Results were pooled across adapter orientations, and psychometric functions were fit to the data using a maximum-likelihood method (Wichmann and Hill, 2001a,b) (http://bootstrap-software.org/psignifit). Discrimination thresholds were defined as the amount of misalignment corresponding to 75% correct responses.
Results
Cortical activations evoked by localizer stimuli
To identify the cortical regions responding to the illusory-contour stimulus, we measured responses evoked by a localizer stimulus (illusory contours alternating with a blank, uniform gray screen; see Materials and Methods). Localizer stimuli evoked robust contiguous activity across retinotopic visual areas. The spatial pattern of the evoked responses was similar in different scanning sessions and across observers. In early visual areas, we found a band of decreased activity peripheral and foveal to the region of increased activity. This decrement in activity may be attributable to hemodynamic effects (e.g., “blood stealing”) or decreases in neuronal activity (Shmuel et al., 2002, 2006). We defined ROIs based on the localizer responses in each visual cortical area (see Materials and Methods).
Orientation-selective fMRI response adaptation to illusory contours
We found evidence of orientation-selective adaptation to illusory contours in multiple visual areas. Time courses from V1 and LO1 for one observer are shown in Figure 4. Note that the fMRI responses are larger for the orthogonal (unadapted) than the parallel (adapted) stimuli, a signature of orientation-selective adaptation. The differences between the responses to parallel versus orthogonal test stimuli were not significant in individual observers in visual area V1 but were statistically significant when averaged across observers. In visual area LO1, the response differences were significant both in the pooled data and in all four observers.
When pooled across observers, the fMRI responses showed significant adaptation in all visual cortical areas studied (Fig. 5, Table 1). The group-averaged time courses for three early visual areas (V1, V2, and V3), two ventral higher-tier areas (hV4 and VO1), and two dorsal (V3A/B and V7) and two lateral higher-tier areas (LO1 and LO2) are shown in Figure 5. Robust responses to the test stimuli in both orthogonal and parallel trials were found in most visual areas, although the responses in V7 and LO2 were weak and variable. The response amplitudes for all visual areas are summarized in Figure 6A. Responses in early visual areas were generally larger than those in higher-tier visual areas; mean response amplitudes were ∼0.6% in V1 and ∼0.5% in V2 and V3, whereas the response amplitudes were smaller (∼0.3%) in ventral higher-tier areas (hV4 and VO1) and smaller still (∼0.1%) in dorsal and lateral higher-tier visual areas (V7 and LO2).
Adaptation increases from early to higher-tier visual areas
To quantify the amount of adaptation relative to the overall response to the test stimuli, we calculated an adaptation index (see Materials and Methods). Adaptation indices in all visual areas, averaged across observers, are plotted in Figure 7A. Adaptation indices were significantly different from zero showing orientation-selective adaptation to illusory contours in all visual areas studied.
Although we measured larger absolute responses to illusory-contour stimuli in early visual areas, the adaptation indices were larger in higher-tier visual areas. Specifically, the adaptation index was small (∼0.1) in V1 and significantly larger (∼0.4) in higher-tier areas such as V7 and LO2 (Fig. 7A), indicating that adaptation differed across visual areas, and the relative amount of response adaptation increased from early to higher-tier visual areas.
No orientation-selective adaptation to the control stimuli
We argue that the adaptation effects in the main experiment are caused by the reduction of responses in neurons that are selective for the orientation of illusory contours. However, we first prove that these response reductions were not the result of incidental properties of the stimuli unrelated to the illusory contours per se. Two control experiments were designed to rule out two such features of the stimuli: the pattern of the line segments and the distribution of energy across the Fourier spectrum.
A version of our illusory contour stimuli with regular spacing of inducer lines (no jitter) forms a distinct pattern, with diagonal line segments in regular columns (for a vertical illusory-contour stimulus) and regular shifts of line segments from one column to the next. Although the irregular spacing (i.e., jitter) of the inducer lines (see Materials and Methods) (Fig. 1A) served to perceptually mask this inducer line pattern, the underlying pattern could have contributed to the observed adaptation. The first control experiment used misaligned adapter stimuli (Fig. 1B) to rule out this possibility. The misaligned adapter stimuli contained a similar underlying inducer line pattern as the (non-misaligned) adapter stimuli used in the main experiment but did not contain illusory contours. With the misaligned adapter stimuli, no visual area showed orientation-selective response adaptation to the illusory-contour test stimuli (Fig. 6B, Table 1) and hence no adaptation indices differed significantly from zero (Fig. 7B). Thus, orientation-selective adaptation in the main experiment is not likely caused by an incidental effect of the pattern of phase-shifted inducers in the illusory-contour stimuli.
The horizontal and vertical illusory-contour stimuli do not have identical Fourier power spectra. The spatial phase of inducers and illusory contours was changed each 160 ms during the adaptation period, and thus it is unlikely the adaptation effects were caused by neurons with linear receptive fields. Even so, the adaptation effects in the main experiment could have resulted from adaptation of neurons selective for the orientation and spatial frequency content of the stimuli (Ginsburg, 1975; Skottun, 1994). In the second control experiment, the adaptor stimuli were phase-randomized versions of the illusory-contour stimuli (Fig. 1C). Therefore, if the adaptation was attributable to neurons selective for the distribution of spectral power, then orientation-selective adaptation should have been obtained again with these phase-randomized adapters. However, no area showed a significant difference in response to the orthogonal versus the parallel test stimuli (Fig. 6C, Table 1), and no adaptation index was significantly different from zero (Fig. 7C).
Psychophysics
The results of the behavioral experiment confirmed that the adapter stimuli were behaviorally effective in eliciting adaptation to illusory contours in the main experiment but not in the first control experiment (Fig. 8). For the illusory-contour adapter stimuli, discrimination thresholds were higher for the parallel (adapted) than for the orthogonal (nonadapted) test stimuli, indicating orientation-selective adaptation (p < 0.05). In contrast, when the misaligned stimuli were used as adapters, detection thresholds for the parallel and orthogonal test stimuli did not differ significantly (p = 0.137). Note that, unlike the fMRI experiment, attention was not diverted from the illusory-contour stimuli in the behavioral experiment. It is possible that the adaptation in the fMRI data would have been stronger if observers had attended to the illusory contour stimuli (see Discussion). Therefore, the degree of orientation-selective adaptation is not directly comparable in the two experiments, but it is reassuring nonetheless to have found a perceptual effect of adaptation using the same stimuli.
Discussion
Our results indicate that there are neurons selective for illusory-contour orientation in most retinotopically organized human visual areas. The lack of significant adaptation in the control experiments confirms that the adaptation found in the main experiment was in response to illusory contours and not to incidental features of the spatial pattern of line inducers or the distribution of spectral energy. These findings are consistent with single-unit recordings in cats and monkeys that found neurons selective for the orientation of illusory contours in V1 and V2 (Redies et al., 1986; Sheth et al., 1996; Lee and Nguyen, 2001) and also confirm imaging studies showing that illusory contours evoke responses in higher-tier visual areas (Mendola et al., 1999; Murray et al., 2002; Stanley and Rubin, 2003). Our results extend these studies by demonstrating orientation-selective responses in higher-tier visual areas beyond V2.
Origin of extrastriate adaptation
We found orientation-selective adaptation to illusory contours in multiple visual areas. A decreased response to the adapted stimulus in higher-tier areas could be the result of decreased input resulting from adaptation in earlier areas. Such “inherited adaptation” propagating from earlier areas (Krekelberg et al., 2006) has been observed in single-unit recording studies (Kohn and Movshon, 2004; Tolias et al., 2005). Kohn and Movshon (2004) showed that direction-selective response adaptation in MT neurons was the result of adaptation in direction-selective neurons in V1 that project to MT. In addition, Larsson et al. (2006) suggested that orientation-selective adaptation to luminance-defined gratings found in human extrastriate visual areas such as V2 and V3 could be accounted for by propagation of adaptation from V1.
In our experiment, adaptation indices increased significantly from V1 and V2 to higher-tier visual areas. This result indicates either that there was a greater proportion of neurons that exhibited orientation-selective adaptation in higher-tier areas than in early visual areas, or that the adaptation was stronger in higher-tier areas for each neuron that adapted. This suggests that response adaptation in these higher-tier visual areas was not trivially inherited from V1. If inherited, it would have to be the case that higher-tier areas received a dominant input from the subpopulation of V1 neurons that exhibited orientation-selective responses to illusory contours. Otherwise, our results imply that there was additional adaptation in higher-tier visual areas.
The progressive increase in orientation-selective adaptation from V1 to higher-tier visual areas might reflect the progressive increase in receptive field sizes across those cortical areas. Given the larger receptive fields in higher-tier visual areas compared with V1 and V2, a larger portion of our stimulus (more illusory contours) fell within the receptive fields of the neurons in higher-tier visual areas, which might have contributed to the greater adaptation observed in those areas. Furthermore, the adaptation in V1 and V2 might have been stronger had the spatial frequency of the illusory contours been higher than what we used and hence better matched to the spatial frequency tuning of V1 and V2 neurons. That is, the increase in adaptation from V1 to higher-tier visual areas may not have been caused by differences in illusory contour processing per se. However, it is believed that neurons in successively higher visual areas perform additional processing, meaning that they respond differently, even after compensating for receptive field size differences. For example, it was reported that monkeys with V4 lesion exhibited a significant deficit in recognizing illusory contours (De Weerd et al., 1996), whereas those with IT lesion had an impaired ability to discriminate shapes defined by illusory contours (Huxlin et al., 2000), suggesting that neurons in higher-tier visual areas represent different properties of illusory contours or shapes defined by such contours even after compensating for receptive field size. Therefore, we find it unlikely that the observed difference in orientation-selective adaptation between visual areas is simply because of a fortuitous choice of spatial frequency.
It is also possible that the adaptation seen in early visual areas could have been the result of adaptation in higher-tier visual areas containing neurons selective to illusory contours and that this adaptation propagated back to early visual areas (including V1) via feedback connections. We cannot determine from the current data whether the adaptation observed in early visual areas was a result of independent adaptation in those areas or inherited by feedback from selective adaptation in another upstream extrastriate area.
It is possible that the adaptation in V1 would have been stronger if observers had attended to the illusory contour stimuli (i.e., without the attention-diverting task), because it is known that attention can strongly modulate adaptation to illusory contours (Montaser-Kouhsari and Rajimehr, 2004). However, attentional modulation of activity is usually found to be weaker in V1 than in higher-tier areas. Thus, if attending to the stimuli were to cause an increase in orientation-selective adaptation, one would expect a disproportionately larger increase in higher-tier areas than in V1.
Comparison between V1 and V2
Our study is the first to use fMRI response adaptation as a tool to characterize orientation-selective responses to illusory contours. Consistent with the prediction from single-unit data (von der Heydt and Peterhans, 1989; Sheth et al., 1996; Lee and Nguyen, 2001), we found orientation-selective adaptation in V2 (i.e., we demonstrated that there are neurons selective for illusory-contour orientation in V2). We also found orientation-selective adaptation to illusory contours in V1, consistent with some studies (Redies et al., 1986; Grosof et al., 1993; Sheth et al., 1996; Lee and Nguyen, 2001; Ramsden et al., 2001) but not with others (Peterhans and von der Heydt, 1989; von der Heydt and Peterhans, 1989). However, in these latter studies, single units were isolated by assessing their responses to luminance-defined contours. Hence, they would not have found neurons selective for the orientation of illusory contours that did not respond to luminance-defined contours of the same orientation and spatial frequency.
In our data, there was no difference in the amount of adaptation observed in V1 and V2. Our interpretation of this result is that neurons in V1 and V2 respond to illusory contours consistent with previous optical imaging and single-unit recording results (Sheth et al., 1996). It could be that neurons in V1 respond selectively to illusory contours because of feedback signals from V2 neurons. This latter suggestion is supported by single-unit data reporting that the population-averaged response to illusory contours in V1 was delayed relative to those in V2 (Lee and Nguyen, 2001).
In imaging studies, the involvement of V1 and V2 in illusory-contour representation is controversial. Mendola et al. (1999) reported activity related to Kanizsa-type illusory-contour stimuli both in V1 and V2, but the activation was weak and found only after sensitive across-subject analysis. However, we observed large fMRI responses in V1 and V2 to illusory contours composed of abutting gratings, enabling us to measure orientation-selective adaptation in V1 and V2. Hirsch et al. (1995) reported activation near the cortical region corresponding to the projection of the vertical meridian between V1 and V2 using Kanizsa-type illusory contours, which is consistent with a role of V1 in illusory-contour representation. V1 also responds to dynamic Kanizsa-type illusory-contour stimuli (Seghier et al., 2000). Maertens and Pollman (2005) found retinotopic illusory-contour responses in V1 in a perceptual-training design while observers were performing curvature discrimination on the illusory boundaries of Kanizsa figures. However, other studies reported no activation evoked by illusory contours in V1 (Ffytche and Zeki, 1996; Kruggel et al., 2001; Murray et al., 2002; Ritzl et al., 2003; Stanley and Rubin, 2003).
It is difficult to compare our results with previous imaging studies. First, there are significant differences between the methodologies used. Previous imaging studies measured the overall fMRI responses to illusory-contour stimuli relative to control stimuli containing no illusory contours. In contrast, our study measured the orientation selectivity of responses. Second, the absence of V1 activity in previous studies could be a result of less-sensitive scanner hardware (lower field strength and less-sensitive receiver coils), different data analysis techniques, or a failure to define V1 retinotopically. Third, many of the previous studies used the Kanizsa-type illusory contours, which may be processed differently from the abutting-line gratings used in our study (Gurnsey et al., 1992, 1996).
Involvement of higher-tier visual areas in illusory contour perception
Our data showed orientation-selective responses in higher-tier visual areas (V3, V4v, VO1, V3A/B, V7, LO1, LO2), implying that there are neurons in those areas that selectively responded to the orientation of illusory contours. No single-unit study has measured neuronal responses to illusory contours in higher-tier visual areas (beyond V2). However, there are two lesion studies suggesting that higher-tier visual areas are involved in the processing and representation of illusory contours. In one study, monkeys with V4 ablations exhibited significant deficits in recognizing illusory contours and contours defined by texture, but smaller or no deficits for motion-, color-, and luminance-defined contours (De Weerd et al., 1996). In a second study, lesions of monkey IT (which may be homologous to human VO1, LO1, and LO2) were found to impair a monkey's ability to discriminate shapes defined by illusory contours, suggesting that IT plays a role in illusory-figure perception (Huxlin et al., 2000).
Imaging studies have also demonstrated the involvement of higher-tier visual areas in illusory-contour representation. Consistent with our data in which adaptation was greater in higher-tier visual areas, Mendola et al. (1999). reported stronger activity in response to illusory contours in V3A, V4v, V7, V8, and the LOC compared with early visual areas. We also found evidence that neurons selective for the orientation of illusory contours exist in LO1 and LO2, two retinotopic areas in the LOC located between V3d and MT (Larsson and Heeger, 2006; Larsson et al., 2006). Several other imaging studies have also found that illusory contours evoke fMRI activity in the LOC (Murray et al., 2002; Ritzl et al., 2003; Stanley and Rubin, 2003). However, because the LOC is a large region containing multiple different subregions (Grill-Spector et al., 1999; Sawamura et al., 2005), the illusory-contour-related activity observed in these studies could have been in subregions of the LOC other than LO1 and LO2.
Footnotes
-
This work was supported by National Institutes of Health Grant EY16165 (M.S.L., D.J.H.) and by the Seaver Foundation. We thank Justin Gardner and Tony Movshon for their helpful comments.
- Correspondence should be addressed to Leila Montaser-Kouhsari, Department of Psychology, New York University, 6 Washington Place, Room 957, New York, NY 10003. lmk306{at}nyu.edu