Abstract
The integration of visual details into a holistic percept is essential for object recognition. This integration has been reported as a key deficit in patients with autism spectrum disorders (ASDs). The weak central coherence account posits an altered disposition to integrate features into a coherent whole in ASD. Here, we test the hypothesis that such weak perceptual coherence may be reflected in weak neural coherence across different cortical sites. We recorded magnetoencephalography from 20 adult human participants with ASD and 20 matched controls, who performed a slit-viewing paradigm, in which objects gradually passed behind a vertical or horizontal slit so that only fragments of the object were visible at any given moment. Object recognition thus required perceptual integration over time and, in case of the horizontal slit, also across visual hemifields. ASD participants were selectively impaired in the horizontal slit condition, indicating specific difficulties in long-range synchronization between the hemispheres. Specifically, the ASD group failed to show condition-related enhancement of imaginary coherence between the posterior superior temporal sulci in both hemispheres during horizontal slit-viewing in contrast to controls. Moreover, local synchronization reflected in occipitocerebellar beta-band power was selectively reduced for horizontal compared with vertical slit-viewing in ASD. Furthermore, we found disturbed connectivity between right posterior superior temporal sulcus and left cerebellum. Together, our results suggest that perceptual integration deficits co-occur with specific patterns of abnormal global and local synchronization in ASD.
SIGNIFICANCE STATEMENT The weak central coherence account proposes a tendency of individuals with autism spectrum disorders (ASDs) to focus on details at the cost of an integrated coherent whole. Here, we provide evidence, at the behavioral and the neural level, that visual integration in object recognition is impaired in ASD, when details had to be integrated across both visual hemifields. We found enhanced interhemispheric gamma-band coherence in typically developed participants when communication between cortical hemispheres was required by the task. Importantly, participants with ASD failed to show this enhanced coherence between bilateral posterior superior temporal sulci. The findings suggest that visual integration is disturbed at the local and global synchronization scale, which might bear implications for object recognition in ASD.
- autism spectrum disorder
- coherence
- interhemispheric
- magnetoencephalography (MEG)
- oscillation
- perceptual integration
Introduction
Integration of visual features is essential for perceiving objects as a coherent whole, a key domain of difficulty in autism spectrum disorders (ASDs) as proposed by the weak central coherence account (WCC) (Happé and Frith, 2006). It has also been suggested that ASDs represent disorders of information integration or “binding” at the neural level (Brock et al., 2002). Frith (2004) even discussed the possibility of ASD reflecting a disconnection syndrome, similar to what has been hypothesized for other disorders, such as schizophrenia. In the typically developed brain, there is increasing evidence that synchronization of oscillatory brain activity in the gamma-frequency band constitutes a mechanism that binds information spatially distributed across the brain (Tallon-Baudry and Bertrand, 1999; Engel and Singer, 2001; Engel et al., 2001). Accordingly, neurons, which represent the same object or event, are active in temporal synchrony (Engel et al., 1991; von der Malsburg, 1994; Engel and Singer, 2001). Indeed, reduced gamma-band activity has been previously associated with perceptual binding deficits in ASD; these studies focused on local synchronization processes and static, illusory stimuli, such as the Kanizsa square and Mooney faces (Sun et al., 2012; e.g., Stroganova et al., 2012). Structural results from diffusion tensor imaging (Travers et al., 2012), as well as functional data from MRI (e.g., Anderson et al., 2011; Dinstein et al., 2011) and M/EEG coherence (e.g., Isler et al., 2010; Khan et al., 2013) also suggest generally compromised anatomical and functional connectivity between distant cortical areas in ASD.
We previously showed that the ability to bind moving dots into a coherent motion percept in an apparent motion stimulus was impaired in ASD, but only when binding across visual hemifields was required (David et al., 2010). Such horizontal binding relies on signal transfer across cortical hemispheres (Chaudhuri and Glaser, 1991; Rose and Büchel, 2005; Helfrich et al., 2014), which led us to assume that a reduction in interhemispheric synchrony might be associated with perceptual integration deficits in ASDs. Here, we recorded magnetoencephalography (MEG) signals from 20 adult participants with ASD and 20 matched controls using an object recognition paradigm that requires the temporal integration of visual details (Parks, 1965). In this paradigm, images of objects pass behind a vertical slit yielding only partial information at any given moment. In a previous study using this stimulus, impaired behavioral performance was shown in ASD (Nakano et al., 2010). We hypothesized that reduced long-range functional connectivity will be associated with perceptual binding difficulties in ASD, especially when coupling between hemispheres is required. To probe long-range integration across cortical hemispheres, we also used a horizontal slit, which spanned both visual hemifields. To quantify the functional connectivity between hemispheres, we calculated interhemispheric imaginary coherence at the cortical source level in ROIs previously implicated in shape and motion perception (Malach et al., 1995; Yin et al., 2002; Beauchamp et al., 2004; Beauchamp, 2005). Although the present study was focused on long-range or global synchronization processes that might be disrupted in ASD, we expected local synchronization as reflected in regional signal power also to be affected.
Materials and Methods
Participants.
Twenty adults with ASD (age 31.2 ± 8.5 years; 9 female) and 20 healthy controls (age 31.5 ± 8.8 years; 9 female) participated in the study. Six additional participants with ASD were excluded for the following reasons: IQ <85 (n = 1), drop out (n = 1), technical problems (n = 1), metal artifacts in MEG recordings (n = 2), and behavioral performance <5% (n = 1). ASD and control participants did not differ significantly with respect to gender, age, IQ, and verbal IQ (Table 1). IQ and Verbal IQ were measured with the Wechsler Adult Intelligence Scale (Wechsler, 1997; Jacobs and Petermann, 2007). To control for potential confounding effects of visual working memory on task performance, we assessed the Figural Memory subtest from the Intelligence Structure Test (Amthauer et al., 2001), which showed no group difference (Table 1). All participants had normal or corrected-to normal vision and were either right- or two-handed. They gave full written informed consent and received monetary compensation for their participation. The study was in accordance with the Declaration of Helsinki and approved by the ethics committee of the Hamburg Medical Association.
All participants in the ASD group have been previously been diagnosed meeting the DSM-IV criteria for Autism or Asperger Syndrome (American Psychiatric Association, 2000). Additionally, all diagnoses were confirmed by a local psychiatrist (D.S.), who conducted a semistructured anamnestic interview and the Structured Clinical Interview for DSM-IV axis I and II disorders (Fist et al., 1997, 2002). The Autism Spectrum Quotient (Baron-Cohen et al., 2001) was significantly higher for ASD than for control participants (Table 1). As a common comorbidity of ASD (Ghaziuddin et al., 2002), depressive symptoms were assessed using the Becks Depression Inventory-II (Beck et al., 1996) (Table 1). Among the ASD participants, five participants received antidepressant medication (selective serotonin reuptake inhibitors, tricyclic antidepressants), but they did not show significant differences of task performance compared with ASD participants without antidepressant medication (t(22.92) = 0.71; p = 0.48). Control participants reported no history of a psychiatric disorder confirmed by the screening modules of the Structured Clinical Interview for DSM-IV axis I and II (Fist et al., 1997, 2002).
Stimuli and task.
Stimuli were 244 line-drawings (black lines on white background) from a standardized picture set (Snodgrass and Vanderwart, 1980), depicting objects, animals, and humans. Stimuli extended 3° on average (±0.8° SE) and were presented on a back projection screen with an LCD projector (Sanyo Pro Xtrax PLC-XP51) with a refresh rate of 60 Hz and a spatial resolution of 1024 × 768 pixels. The entire display subtended ∼30 × 22° of visual angle at a viewing distance of 80 cm.
Participants performed an object recognition task. They were instructed to identify images of objects passing behind a narrow slit (“slit-viewing”) (Parks, 1965). That is, only fragments of the objects were visible at any given moment. For object recognition, these fragments had to be perceptually integrated over time. There were two conditions: Each of the 244 line drawings was presented behind a vertical or horizontal slit (Fig. 1A). At the beginning of each trial, a white slit (0.17° width) was displayed on a gray background. Participants were asked to fixate a red point centered in the slit, and to maintain fixation throughout the trial. After 1 s, the stimulus appeared moving behind the slit either from left to right (vertical slit) or from top to bottom (horizontal slit). This motion was achieved by shifting the stimulus at each monitor frame one or two pixels to the left (vertical slit condition) or bottom (horizontal slit), resulting in moving speeds of 3.5°/s and 7°/s and a total stimulus presentation time of 1 s. Notably, Nakano et al. (2010) demonstrated that moving speed does not affect performance in the slit-viewing task. After stimulus presentation, with a delay of 300 ms, participants were cued to verbally report the recognized object (or to indicate if the object was not identified). Each response was recorded on-line and transcribed off-line. Participants pressed a button to continue with the next trial, which started after a blank intertrial interval of 200–600 ms (Fig. 1A). The experimental design comprised variation of two factors: one between factor “group” (ASD vs control), one within factor “slit orientation” (vertical vs horizontal). Conditions were run block-wise in two MEG recording sessions, and the order of the blocks was counterbalanced across participants. In the horizontal slit condition, objects spanned both visual hemifields. Thus, only in this condition was integration across visual hemifields required.
Behavioral performance indexed by correct trials (in percentage) was compared between groups and conditions in a 2 × 2 mixed-model variance analysis with a level of significance at 5%. A trial was defined as correct if at least the stimulus category could be named correctly (e.g.: “animal” for a depicted “dog”). Data were analyzed using R (http://www.r-project.org).
Neurophysiological data acquisition and preprocessing.
MEG was recorded continuously using a 275-channel whole-head system (Omega 200, CTF Systems) in a magnetically shielded room. Four sensors were not operating, resulting in a total of 271 sensors. The electro-occulogram and electrocardiogram were recorded simultaneously for off-line artifact correction. The head position relative to the MEG sensors was measured continuously using a set of head localization coils (nasion, left and right ear). For all analyzed datasets, the head displacement across a recording session was <20 mm (mean ± SD: 8.4 ± 3.7 mm and 8.3 ± 3.5 mm in ASD group and control group). MEG signals were digitized at 1200 Hz with a low-pass filter at 300 Hz.
The data were bandpass filtered offline (0.5–250 Hz, low-pass filter at 250 Hz, Butterworth of order 4, forward and backward direction; high-pass filter at 0.5 Hz, Butterworth of order 3, forward and backward direction), and line noise was removed using a band-stop filter at 49.5–50.5, 99.5–100.5, and 149.5–150.5 Hz (Butterworth of order 4, forward and backward direction). The continuous data were epoched from −1 s to 1.2 s time-locked to onset of the moving picture. The mean signal was subtracted. Only correct trials were selected for further analyses. Trials containing signal jumps or strong muscle artifacts were visually detected and rejected from further analysis. Independent component analysis (Hyvärinen, 1999; Jung et al., 2000) was applied, and components containing artifacts related to heartbeats, eye blinks, eye movements, saccadic spike artifacts and remaining muscle activity were rejected from the data. All MEG analyses were performed in MATLAB (The MathWorks) using custom-written scripts and the open source MATLAB toolboxes Fieldtrip (Oostenveld et al., 2010) and SPM2 (http://www.fil.ion.ucl.ac.uk/spm).
Spectral analysis.
Spectral estimates were computed using the multitaper approach based on discrete prolate spheroidal (Slepian) sequences (Mitra and Pesaran, 1999) across equally scaled frequencies f from 2 to 102 Hz (in 4 Hz steps) and time t from −500 to 1000 ms (in 50 ms steps). A sliding window method was used with fixed taper length (250 ms) and fixed frequency smoothing (± 8 Hz). All transformations to the frequency domain were performed on the single-trial level before averaging across trials. Thus, spectral estimates contained signal components phase-locked and non-phase-locked to the stimulus onset. The resulting total power is reported (Fig. 2) as percentage of change at frequency f relative to the prestimulus baseline (500 ms before up to stimulus onset). Therefore, the average baseline power was first subtracted from the power of the poststimulus interval, and the difference was then divided by the average baseline power.
For analyses at the cortical source level, we computed spectral estimates independently for five separate frequency bands (4–8 Hz, theta; 8–12 Hz, alpha; 14–36 Hz, beta; 36–60 Hz, low gamma; 60–100 Hz, high gamma) and two time intervals (50–550 ms and 550–1000 ms after stimulus onset). Although deviating from classical frequency bands, we defined the border between beta- and low gamma-band at 36 Hz based on our power results because the upper boundary of the beta-band decrease was rather high. Spectral estimates were computed by different approaches for low (theta and α) and high frequency bands (beta and low/high gamma). In particular, for high frequencies, we used a multitaper approach based on discrete prolate spheroidal (Slepian) sequences (Mitra and Pesaran, 1999), computing spectral estimates on center frequencies with suitable smoothing. For low frequencies (theta and α), we applied Hanning windows on the two time intervals. Estimated power and cross-spectra were averaged across those frequencies within a frequency band. For instance, time-frequency analysis was performed at 4 and 6 Hz; then, power and cross-spectra were averaged across 4 and 6 Hz for the analysis of the theta-band. All transformations to the frequency domain were performed on the single-trial level before averaging across trials. Thus, spectral estimates contained signal components phase-locked and non–phase-locked to the stimulus onset. All further power and coherence analyses were performed on the cortical source level.
Source analysis.
To estimate the neuronal activity at the cortical source level, we used the “beamforming” adaptive linear spatial filtering technique (Van Veen et al., 1997; Gross et al., 2001). At each source location, three orthogonal filters were computed that pass activity from that location with unit gain while maximally suppressing activity from other sources. We selected different filters depending on the brain parameter of interest (see Coherence analysis and Power analysis). To derive the complex valued source estimates, we multiplied the complex frequency domain data with the real-valued filters.
For all participants, T1-weighted structural MRIs were obtained, which were used to build the individual head models for source analysis. Structural T1-weighted magnetization prepared gradient-echo images (TR = 2300 ms, TE = 2.98 ms, FoV = 256 mm, 1 mm slice thickness, TI = 1100 ms, 9° flip angle) with 1 × 1 × 1 mm3 voxel resolution were obtained on a 3T Siemens Magnetom Trio MRI scanner (Siemens Medical Systems) at the University Medical Center Hamburg-Eppendorf. To derive the physical relation between sensors and sources (i.e., the leadfield matrix), individual single-shell head models were constructed from the structural MRIs (Nolte, 2003). A regular 3D grid (1 cm spacing) was defined in stereotactic (i.e., MNI) space and transformed into individual head space using the individuals' MRI. MEG sensors were aligned to the individual head geometry based on three fiducial points (nasion, left and right ear).
All source level analyses were independently performed for the aforementioned five frequency bands and two time intervals. We characterized power and coherence responses relative to the prestimulus baseline interval (−500 to −50 ms before stimulus onset). To avoid spurious effects of the filter on the baseline correction, a common filter was used for stimulus and baseline interval.
Coherence analysis.
To quantify the frequency-dependent synchronization between pairs of signals, we estimated the imaginary part of coherence at the cortical source level. As a measure of connectivity, we have chosen the maximized imaginary coherence as proposed by Ewald et al. (2012). The idea is that the 3D activity estimated for each voxel by a spatial filter without specifying the source orientation represents mainly activity from neural sources in the vicinity of the voxel, which can have different orientations. In general, it is very well possible that relatively weak sources have a higher connection to a specific region than the strong sources. Choosing a source orientation to maximize power would then miss the interacting sources. It is also very well possible that the different sources with different orientations contributing to the estimate of the activity at a specific voxel interact with different regions. In this case, choosing any fixed source orientation would necessarily miss relevant interaction. To take these effects into account, we optimize the source orientations of two sources for each pair of voxels by maximizing the imaginary part of coherence between the two source activities. We recall here the analytic solution of this optimization problem and refer to Ewald et al. (2012) for the derivation. The input is the 6-dimensional cross-spectrum C (2 voxels with 3 orientations each), which we write as follows: where CAA and CBB are the 3 × 3 cross-spectra within voxel A and B, respectively, CAB is the cross-spectrum between the voxels, and the upper index H denotes hermitian conjugation (i.e., complex conjugation and transpose). We define a matrix X as follows: Then the maximized imaginary coherence is given by the following: where λmax is the maximal eigenvalue of XXT. As the distribution of imaginary coherence values is non-Gaussian, we applied a nonlinear z-transform to the coherence estimates (Nolte et al., 2004). As an additional estimate of coherence, we calculated “lagged coherence” (Pascual-Marqui et al., 2011), also maximized for imaginary coherence.
For baseline correction, activity of the baseline was subtracted from the activity of the stimulus interval. Before statistical testing, imaginary coherence values were averaged across the two time intervals. To provide a calculation of interhemispheric coherence between the left- and right-hemispheric part of the ROIs, we averaged the imaginary coherence between those regions. To quantify differences of interhemispheric coherence between groups and condition, a 2 × 2 mixed-model variance analysis was performed. The level of significance was adjusted to p = 0.0033 to control for multiple comparisons (three ROIs and five frequency bands), using the Bonferroni method (Bonferroni, 1936). To ensure that changes in imaginary coherence reflected changes in phase consistency, rather than changes in signal amplitude, we calculated spectral power averaged between ROIs.
As a post hoc analysis, we calculated imaginary coherence between the posterior superior temporal sulcus (pSTS) and the cerebellum to provide a link between the results of global and local synchronization analyses. Based on previous evidence (Jack et al., 2011; Jack and Morris, 2014; Sokolov et al., 2014; Jack and Pelphrey, 2015), we focused on the right pSTS and the left cerebellum. Baseline correction, averaging of imaginary coherence between regions, and statistical analysis were the same as for interhemispheric coherence analyses.
ROIs.
Interhemispheric coherence was calculated at three ROIs, which have previously been associated with motion and shape perception: lateral occipital area (LO) of the lateral occipital complex, human motion area (hMT/V5) and pSTS.
The LO comprises the ventral extent of lateral occipital complex that is implicated in shape perception (Malach et al., 1995; Grill-Spector et al., 1998). LO has been previously defined based on localizer scans (e.g., Grill-Spector et al., 1998) by identifying voxels that showed significantly greater responses to whole versus scrambled objects (Grill-Spector et al., 1998; Lerner et al., 2002; Downing et al., 2007; Eger et al., 2008; Mundy et al., 2012). Here, we used the average of the reported coordinates (MNI coordinates: −45.4, −75.6, −3.6; 44.8, −73.6, −3.0) as the center for a spherical ROI of 10 mm diameter using the WFU PickAtlas software (Lancaster et al., 1997, 2000; Maldjian et al., 2003).
Area hMT/V5, located in the vicinity of the posterior branch of the inferior temporal sulcus, is essentially involved in the perception of motion (Allman and Kaas, 1971; Watson et al., 1993). To define this area as a ROI, we used a probability map provided by the SPM Anatomy Toolbox (Eickhoff et al., 2005; Malikovic et al., 2007).
The pSTS has been associated with the integration of information both within modalities (e.g., visual form, visual motion) and across modalities (Beauchamp et al., 2004; Jellema et al., 2004; Beauchamp, 2005) and with the perception of biological motion (Bonda et al., 1996; Grossman and Blake, 2002). Anatomically defining this area at the posterior part of the superior temporal sulcus (MNI coordinates: ±58, −50, 26), we constructed a spherical ROI of 10 mm diameter using the WFU PickAtlas software (Lancaster et al., 1997, 2000; Maldjian et al., 2003). To calculate imaginary coherence between the pSTS and the cerebellum, the cerebellum has been defined as implemented in the WFU PickAtlas software (Lancaster et al., 1997, 2000; Maldjian et al., 2003).
To evaluate significant cognition-connectivity links more directly, we performed a Pearson-correlation analysis between significant interhemispheric coherence values and behavioral performance (i.e., accuracy on the slit-viewing task). To this end, we calculated the difference between experimental conditions (vertical–horizontal slit-viewing) for each group of participants.
Power analysis.
To quantify signal power at the cortical source-level, we selected the filter (i.e., the source orientation) which maximized power. This filter was multiplied with the complex frequency domain data. For baseline correction, activity of the baseline was first subtracted from the activity of stimulus interval; then, the difference was divided by baseline-activity.
Only the interaction of group × condition was statistically tested. Therefore, a nonparametric permutation test was applied based on spatial clustering (Maris and Oostenveld, 2007). In short, t tests were performed at all cortical locations. Only those data exceeding a criterion (p < 0.05) were included in a clustering procedure. Clusters are defined as sets of spatially connected significant data points. On the cluster level, a statistic was calculated by summing up the t values within a cluster. The largest cluster-level statistic was compared with a permutation distribution under the null hypothesis that assignments to groups or conditions are exchangeable. Therefore, data were randomly assigned to groups or conditions and clustered in the same way as the original data. This was done for 3000 repetitions to construct a distribution. Because of the decision on cluster level, only one statistical comparison was performed, and no further correction for multiple comparisons was necessary.
Results
Behavioral results
Behavioral performance was measured as the percentage of correctly identified objects per condition. A 2 × 2 mixed-measures ANOVA was used to calculate differences between groups (ASD vs control) and conditions (horizontal vs vertical slit). Overall, the two groups did not differ with respect to behavioral performance (F(1,38) = 2.04, p = 0.16). There were a significant main effect of condition (F(1,38) = 8.20, p = 0.01) and a significant interaction of group and condition (F(1,38) = 5.57, p = 0.02; Fig. 1B). Participants with ASD identified fewer objects in the horizontal compared with the vertical slit condition (t(19) = 3.82, p = 0.001), whereas control participants showed no significant performance differences between the vertical and horizontal slit (t(19) = 0.35, p = 0.73). These results indicate specific integration difficulties of ASD participants during horizontal slit-viewing.
Coherence results
As a measure of long-range connectivity between hemispheres, we quantified the imaginary coherence between hemispheres for each of the three ROIs (LO, hMT/V5, pSTS). For pSTS, there was a significant interaction of group × condition in interhemispheric coherence in the low gamma-frequency band (F(1,38) = 10.53, p = 0.002; Fig. 3). Participants of the control group showed stronger interhemispheric coherence in the horizontal compared with the vertical slit condition (t(19) = −3.26, p = 0.004), whereas ASD participants showed no significant difference between the vertical and horizontal slit (t(19) = 1.31, p = 0.21). To test the reliability of our method, we repeated the analysis using an additional estimate of coherence (“lagged coherence”) (Pascual-Marqui et al., 2011). This analysis confirmed the previously detected interaction effect between bilateral pSTS in the low gamma-band yielding almost identical results (F(1,38) = 10.53, p = 0.002). No main effects were observed in the low gamma-frequency band (all p > 0.0033). For the other frequency bands and for LO and hMT/V5, all main or interaction effects were not significant (all p > 0.0033). To ensure that the interaction effect in interhemispheric gamma-band coherence between pSTS reflected true changes in phase consistency rather than changes in signal amplitude, we calculated gamma-band power within pSTS, which showed no significant interaction effect (F(1,38) = 0.03, p = 0.87; Fig. 3B). The correlation between interhemispheric coherence of bilateral pSTS and behavioral performance was not significant for either group (ASD: r = −0.07, p = 0.76; control: r = 0.07, p = 0.77).
The post hoc analysis of imaginary coherence between right pSTS and left cerebellum yielded a significant group × condition interaction of in the low gamma-frequency band (F(1,38) = 5.39, p = 0.03). Participants with ASD showed reduced imaginary coherence in the horizontal compared with the vertical slit condition (t(19) = 2.42, p = 0.03), whereas participants of the control group showed no significant difference between the vertical and horizontal slit (t(19) = −1.40, p = 0.18). To ensure that the interaction effect in between right pSTS and left cerebellum reflected true changes in phase consistency rather than changes in signal amplitude, we calculated gamma-band power within these regions, which showed no significant interaction effect, either in pSTS (F(1,38) = 0.001, p = 0.97) or in the cerebellum (F(1,38) = 0.62, p = 0.43).
Power results
Figure 2 depicts the time-frequency representations of neural responses at posterior sensors. After stimulus onset, a transient response from ∼50–200 ms at frequencies <50 Hz preceded a tonic response. This tonic response consisted of an increase in signal amplitude in the gamma-frequency range (>50 Hz) and a decrease at frequencies ∼<36 Hz.
As a measure of local synchronization in ASD, we analyzed signal power at the cortical source level. To refer to the behavioral and the coherence results, we exclusively calculated the group × condition interaction term. Permutation tests yielded a significant cluster within the beta-frequency band within the early time window (50–550 ms after stimulus onset) that included visual and cerebellar regions (p = 0.01; Fig. 4). Beta-band activity was strongly reduced in participants with ASD in the horizontal compared with the vertical slit condition, whereas no difference between conditions was found in the control group. There were no group × condition effects in the other frequency bands (all p > 0.05).
Discussion
Here, ASD participants showed impaired object recognition when perceptual binding across hemifields and time was required (“horizontal slit-viewing”). This behavioral deficit was matched by neural synchronization abnormalities during horizontal binding in ASD. First, at the global synchronization scale, there was a lack of enhancement of imaginary coherence in the gamma-band between both pSTS. Second, at the local synchronization scale, beta-band power was reduced at inferior temporal/cerebellar cortices. Third, and establishing a link between the two previous results, interhemispheric coherence between pSTS and cerebellum was also reduced in ASD. Our data demonstrate a perceptual integration deficit in ASD at higher levels of information processing in accordance with the WCC account.
Global synchronization
Control participants showed increased interhemispheric coherence in the gamma-frequency range between bilateral pSTS for perceptual integration across hemifields during horizontal slit-viewing. In contrast, participants with ASD failed to show such enhancement in pSTS, which co-occurred with object recognition deficits when horizontal binding was required. ASD has been previously associated with (1) perceptual integration deficits (Plaisted et al., 1999; Milne et al., 2002; Nakano et al., 2010), (2) abnormalities in gamma synchronization (Grice et al., 2001; Brown et al., 2005; Uhlhaas and Singer, 2006; Sokhadze et al., 2009; Stroganova et al., 2012; Sun et al., 2012), (3) structural/functional abnormalities in the pSTS (Pelphrey et al., 2011), and (4) abnormal brain connectivity between hemispheres (Anderson et al., 2011; Verly et al., 2014; Hahamy et al., 2015). Here, we demonstrate a convergent deficit embracing all four aspects of disturbance.
As our task required the identification of moving objects, we calculated interhemispheric coherence at motion- and shape-selective regions (i.e., LO, hMT/V5, pSTS) (Beauchamp et al., 2004; Beauchamp, 2005). Although LO and hMT/V5 were presumably involved in the slit-viewing task (Yin et al., 2002), our data suggest a coupling deficit in ASD predominantly between left and right pSTS, which possibly relates to the complexity of our stimuli and suggests rather higher-order perceptual integration deficits. The pSTS has been considered as an integrative multimodal brain area (Beauchamp et al., 2004; Beauchamp, 2005), bridging across various functions, such as perceptual integration of visual shape and visual motion (Jellema et al., 2004), biological motion (Grossman and Blake, 2002), and social cognition (Yang et al., 2015). Indeed, people with ASD show deficits related to these functions (Koldewyn et al., 2011; Pelphrey et al., 2011; Scheel et al., 2011; Shih et al., 2011), clearly implicating pSTS as a critical area in disturbed brain networks in ASD.
Our behavioral finding of impaired object recognition during slit-viewing is in line with the WCC account (Happé and Frith, 2006), which has been empirically addressed by numerous studies of perceptual integration in ASD (Plaisted et al., 1999; Milne et al., 2002; Nakano et al., 2010). The pathophysiology underlying WCC has remained a matter of conjecture. The impaired performance during horizontal (i.e., across hemifields) but not vertical binding suggested a specific deficit in long-range interhemispheric connectivity. In accordance, we previously showed (1) specific impairments of apparent motion perception along the horizontal (i.e., also requiring binding across the hemifields) but not the vertical meridian in ASD (David et al., 2010; see also O'Keefe and Lindell, 2013), and (2) an association between horizontal apparent motion perception and enhanced gamma-band coupling between motion-selective cortical areas in typically developed participants (Rose and Büchel, 2005; Helfrich et al., 2014). Here, horizontal slit-viewing was also associated with interhemispheric neural coherence in the gamma-band in controls, but not in ASD participants. Abnormalities in the gamma-band have repeatedly been reported for ASD and have been discussed as possible neural signature of WCC (Uhlhaas and Singer, 2006). Brock et al. (2002) suggested that WCC-associated perceptual integration deficits reflect reduced synchronization of gamma-band activity between local networks. Our results strongly support this hypothesis (also see evidence on reduced interhemispheric M/EEG coherence in frequencies <13 Hz) (Isler et al., 2010; Catarino et al., 2013; Khan et al., 2013; Carson et al., 2014). Yet, in relation to WCC, it has also been suggested that local synchronization within networks is intact or even enhanced (Brock et al., 2002; Belmonte et al., 2004); this could be reflected in higher or no power differences in ASD. The available evidence, including our own, however, does not entirely support this aspect of the hypothesis (Brown et al., 2005; Sokhadze et al., 2009; Baruth et al., 2010; Stroganova et al., 2012; Sun et al., 2012; Buard et al., 2013; Snijders et al., 2013). The kind or degree of perceptual integration required by the visual stimulus or task and the presence or lack of a behavioral deficit might be able to explain empirical inconsistencies. In our study, a local synchronization effect in ASD was found in occipitocerebellar beta-band power, which was reduced for horizontal compared with vertical slit-viewing in ASD.
Local synchronization
Coherence estimates can be confounded by changes in signal-to-noise ratios imposed by fluctuations of local power. To ensure that changes in imaginary coherence reflected changes in phase consistency rather than power, we (1) used a coherence measure that maximizes the imaginary part of coherence, thereby eliminating artifacts from volume conduction (Ewald et al., 2012), and (2) ruled out power differences in the selected ROIs. Our results clearly differentiate global synchronization effects (coherence) from local synchronization effects (power), as they occurred in different frequency bands and at different brain sites.
Participants with ASD showed reduced beta-band power in temporal-cerebellar cortices during horizontal slit-viewing. Oscillatory activity in the beta-band oscillations has been associated with several functions relevant for the slit-viewing task, including sensorimotor predictions (Engel and Fries, 2010), sensory evidence accumulation during decision-making (Donner et al., 2007; Siegel et al., 2011), and visual short-term memory (Tallon-Baudry et al., 2001). Moreover, there is evidence for disturbed beta-band activity in ASD (Puzzo et al., 2010; Buard et al., 2013; Yoshimura et al., 2013; Leung et al., 2014), for example, during the perception of coherent compared with scrambled shapes (Bangel et al., 2014). The sites of disturbed beta-band activity in our ASD group (i.e., the inferior temporal cortex and the cerebellum) have also previously been implicated in both cognitive processes relevant to our task (e.g., complex object representations in the inferior temporal lobule) (Gauthier and Tarr, 2002; Gerlach et al., 2002) as well as ASD pathology (e.g., Dalton et al., 2005; Wang et al., 2014). Especially the cerebellum has long been implicated in ASD pathophysiology (Courchesne, 1997; Fatemi et al., 2012; Wang et al., 2014). Despite the classical assignment of motor functions to the cerebellum, there is growing evidence for its involvement in perception (Baumann et al., 2015), including the predictive processing of perceptual events (Shadmehr et al., 2010; Popa et al., 2012; Roth et al., 2013).
Given this previous evidence, the ASD-related reduction in cerebellar beta power might reflect impaired sensory prediction and timing in ASD, a deficit that would certainly interfere with our slit-viewing task, which requires the successful integration of visual details over time. This interpretation would be in line with the recent notion of autism as disorder of prediction (Sinha et al., 2014). It is unclear, though, why sensory prediction abnormalities in ASD would affect horizontal more strongly than vertical slit-viewing (see also David et al., 2010). Therefore, it seems more likely that our effect reflects deficits in perceptual motion integration rather than prediction, as the cerebellum has also been related to the perception of biological motion (Bonda et al., 1996; Grossman et al., 2000; Vaina et al., 2001). This interpretation is underlined by previously reported strong functional connectivity between the cerebellum, in particular left Crus 1, and right pSTS (Jack et al., 2011; Sokolov et al., 2012, 2014; Jack and Pelphrey, 2015). Moreover, Jack and Morris (2015) reported disturbed connectivity between these areas in ASD, which predicted mentalizing disabilities in their participants. Our results suggest reduced interhemispheric communication in ASD by means of synchronized activity in the low gamma-frequency range between left cerebellum and right pSTS for perceptual integration across hemifields. This finding nicely links the presently detected ASD-related neural abnormalities at the local and global synchronization level, suggesting that reduced beta-band power is closely related to the lack of enhancement of interhemispheric gamma-band coupling in ASD during perceptual integration.
Limitations
The lack of correlation between the neuronal dynamics and the behavioral effects limits our conclusions about the functional relationship between interhemispheric gamma-band coherence of bilateral pSTS and impaired perceptual integration across visual hemifields. For the following reasons, we do not think that these two findings are unrelated or merely coincidental: (1) interhemifield perception has been previously associated with interhemispheric signal transfer (e.g., Engel et al., 1991); (2) the pSTS represents a higher-order visual area that has been associated with perceptual-semantic functions closely related to the demands of the slit-viewing task (Beauchamp et al., 2004); (3) our analysis was conservative (e.g., using multiple comparison correction), hypothesis-driven, and focused on specific, task-relevant ROIs, providing a high level of anatomical specificity; and (4) direct correlations between behavior and neural changes are not easily obtained due to the heterogeneity of behavioral characteristics associated with ASD (Geschwind, 2009) and might require larger sample sizes.
Footnotes
This work was supported by European Union FP7-ICT-270212, ERC-AdG-269716, and H2020-641321 to A.K.E. We thank all volunteers who participated in the experiment; Shigeru Kitazawa for help with the slit-viewing paradigm; Christiane Reissmann and Gerhard Steinmetz for assistance with MEG data recording; and Karin Deazle for help with recruitment and data assessment.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Nicole David, Department of Neurophysiology and Pathophysiology, University Medical Center Hamburg-Eppendorf, Martinistrasse 52, 20246 Hamburg, Germany. ndavid{at}uke.de