Elsevier

NeuroImage

Volume 52, Issue 2, 15 August 2010, Pages 606-616
NeuroImage

Multisensory conflict modulates the spread of visual attention across a multisensory object

https://doi.org/10.1016/j.neuroimage.2010.04.245Get rights and content

Abstract

Spatial attention to a visual stimulus that occurs synchronously with a task-irrelevant sound from a different location can lead to increased activity not only in the visual cortex, but also the auditory cortex, apparently reflecting the object-related spreading of attention across both space and modality (Busse et al., 2005). The processing of stimulus conflict, including multisensory stimulus conflict, is known to activate the anterior cingulate cortex (ACC), but the interactive influence on the sensory cortices remains relatively unexamined. Here we used fMRI to examine whether the multisensory spread of visual attention across the sensory cortices previously observed will be modulated by whether there is conceptual or object-related conflict between the relevant visual and irrelevant auditory inputs. Subjects visually attended to one of two lateralized visual letter-streams while synchronously occurring, task-irrelevant, letter sounds were presented centrally, which could be either congruent or incongruent with the visual letters. We observed significant enhancements for incongruent versus congruent letter–sound combinations in the ACC and in the contralateral visual cortex when the visual component was attended, presumably reflecting the conflict detection and the need for boosted attention to the visual stimulus during incongruent trials. In the auditory cortices, activity increased bilaterally if the spatially discordant auditory stimulation was incongruent, but only in the left, language-dominant side when congruent. We conclude that a conflicting incongruent sound, even when task-irrelevant, distracts more strongly than a congruent one, leading to greater capture of attention. This greater capture of attention in turn results in increased activity in the auditory cortex.

Introduction

Attentional selectivity, the ability to select one particular source of information to guide action while ignoring others that are irrelevant to the current behavioral goal, is a critical cognitive function for successful navigation in the world. Our attentional selection is enhanced when the behaviorally relevant stimuli belong to the same object. Unisensory visual studies of object-based attentional selection have shown an apparently automatic spread of visual attention through a visual cued object (Egly et al., 1994, Abrams and Law, 2000). Correspondingly, an fMRI study has reported higher activation in human visual cortex representing locations in a cued object relative to equidistant locations on another uncued object (Müller and Kleinschmidt, 2003).

Based on these results, in our recent study, (Busse et al., 2005) we asked if visual attention can spread cross-modally to encompass other parts of a multisensory object. In that study, subjects attended to one of two lateralized streams consisting of rapidly presented simple visual stimuli. Half of these laterally presented visual stimuli were synchronously accompanied by a centrally presented, task-irrelevant tone. Activity associated with the central auditory stimuli occurring synchronously with an attended versus an unattended lateral visual stimulus was extracted by a set of subtraction contrasts. The EEG analysis revealed a late, sustained, frontally distributed pattern of brain activity for the central tones that occurred in the context of an attended relative to an unattended visual stimulus, an activation that resembled attention-related enhancements seen at earlier latencies during intra-auditory selective attention. The corresponding fMRI contrasts showed that this object-related attention effect included enhanced activity in auditory cortex. The pattern of results strongly suggested that attention to the visual modality can spread to encompass simultaneously occurring signals from the auditory modality, even when the auditory stimuli are task-irrelevant and from a different location.

In the Busse et al. (2005) study, the multisensory spread of visual attention across the sensory cortices during attended visual stimulation was revealed with visual–auditory stimulus combinations that were essentially neutral — i.e., they were neither related nor unrelated. In the present work, we asked whether this ‘spread-of-attention’ activity would be modulated by object-related conflict between the relevant visual input and the irrelevant auditory one. To produce object-related multisensory conflict, the stimulus inputs consisted of task-relevant, laterally presented, visual letters (“A” or “X”) to be discriminated that could be accompanied by simultaneous, task-irrelevant spoken letters presented centrally (also “A” or “X”). If the visual letter was an “A”, the spoken letter could be either congruent (“A”) or incongruent (“X”). In previous studies of such multisensory conflict, all the stimuli were typically presented exclusively from the central position, and these have indicated that reading of written letters can be impeded by incongruent letter–sounds and facilitated by congruent ones (e.g. van Atteveldt et al., 2007). Here, the question was whether and by how such multimodal object-related conflict might modulate the cross-modal ‘spread-of-attention’ activity.

When attention spreads across a multisensory object whose sensory components conflict in some way, we might expect to see increased activity in both the ACC and the visual cortex for incongruent compared to congruent letter/sound combinations. Such effects would be expected for several reasons. First, previous neuroimaging studies have implicated the ACC as being involved in object-related conflict processing (van Veen and Carter, 2005, Weissman et al., 2003, among others). Secondly, in a previous multisensory cueing fMRI study in which the relevant visual and irrelevant auditory letter stimuli were simultaneously presented from a central location (Weissman et al., 2004), the results indicated that conflict induced by co-occurring incongruent auditory letters led to increased activity in both the ACC and the visual cortex (Weissman et al., 2004). The authors concluded that during conflict between the modalities, attention toward the relevant input is enhanced by boosting activity in the sensory cortex of the task-relevant modality. In our present study, given that the visual letters were relevant and auditory letters to be ignored, we predicted activity in the visual cortex would be enhanced during conflicting stimulation to help counter the auditory semantic distraction. In the current case, we could assess whether such incongruency-related activity patterns in the ACC and visual cortex will occur even during the multisensory spreading of attention across space.

An additional important question investigated here, however, is whether and how the ‘spread-of-attention’ activity in the auditory cortex would differ for congruent versus incongruent semantic objects. On the one hand, there is evidence (e.g. Molholm et al., 2007) that visual attention would only spread cross-modally if both modalities involved were associated with the same object, and not if they were linked to different objects. In a recent EEG study, Molholm et al. (2007) presented a central stream of alternating pictures and sounds to subjects, with a task to detect a specific picture while ignoring the sounds. Results indicated an increased auditory ERP negativity beginning around 200 ms when the response to the ignored sound corresponded to the attended visual object (e.g. barking sound and dog picture) relative to when the sound that was not a feature of the attended visual object (e.g. barking sound and guitar picture). Thus, the object-based spread of visual attention to the unattended auditory modality was greater when the visual and auditory semantic features matched (Molholm et al., 2007; also see Fiebelkorn et al., 2010).

On the other hand, a task-irrelevant sound that is to be ignored might be more distracting when incongruent with a relevant visual stimulus. Distracting stimuli are known to capture attention (e.g. Sabri et al., 2006, Watkins et al., 2007; cf. Dalton and Spence, 2007 for multisensory distraction). For example, a combined EEG/fMRI auditory study revealed that the more an irrelevant auditory deviant was perceived as distracting, the higher the elicited activity in the auditory cortex (Sabri et al., 2006; see also Watkins et al., 2007).Thus, for the auditory cortex, two opposing predictions could be made. One prediction hypothesizes that there would be greater ‘spreading-of-attention’ activity from the attended visual component to the synchronous task-irrelevant auditory component when they are congruent, due explicitly to their congruence engendering increased activity. Enhanced activity for congruent trials relative to incongruent ones could also be because as soon as the incongruency is detected the brain would respond by quickly sending inhibitory signals to suppress its processing. In contrast, if a conflicting incongruent sound (even when task-irrelevant) distracts more strongly than a congruent one, it may trigger a greater capture of attention, thereby leading to more activity in auditory cortex.

In summary, we specifically asked how the ‘spread-of-attention’ from an attended visual stimulus to a synchronous task-irrelevant auditory stimulus arising from another location would be modulated differentially as a function of whether these components conflicted. Accordingly, we presented congruent and incongruent letter–sound combinations while subjects visually attended one of two lateralized visual letter-streams and ignored the synchronously occurring task-irrelevant central sounds, which could be either semantically congruent or incongruent. Analyses of the interactions of attention by congruency would then provide insight into the mechanisms of the multisensory ‘spread-of-attention’ process, and into multisensory feature processing interactions more generally.

Section snippets

Subjects

Sixteen healthy, right-handed subjects (ages 18–35 years; equal numbers of males and females) participated in the experiment. After receiving an explanation of the procedures, all subjects gave written informed consent according to a protocol approved by the Institutional Review Board of Duke University.

Paradigm

Sequences of lateralized visual letters (“A” or “X”) were presented to the left and right visual fields (see Fig. 1) while subjects covertly attended to a designated side in each run to

Behavioral data during fMRI scanning

Subjects were instructed to visually attend to the letters presented on the designated side and to discriminate between the letters “A” and “X” with a button press. Reaction times for correct responses, pooled over the responses to visual letters “A” and “X”, were as follows: pure visual presentation 573 ms (SD 61 ms); congruent presentation 541 ms (SD 59 ms), and incongruent presentation 549 ms (SD 59 ms). A repeated-measures ANOVA revealed a highly significant effect of stimulus presentation (F

Discussion

In the present study, we investigated how the ‘spread-of-attention’ across a multisensory audiovisual object shown in Busse et al. (2005) for simple unrelated stimuli would be modulated when the spatially discordant auditory and visual stimulus components conflicted. In agreement with our predictions, we observed significant enhancements for incongruent versus congruent letter–sound combinations in the ACC and in the contralateral visual cortex when the visual component was attended. In the

Acknowledgments

We thank Allen Song for his input on fMRI issues of this study and to Susan Music and Natalie Goutkin for technical assistance. This research was supported by NIH R01-NS051048 grant to M.G.W.

References (45)

  • N.M. van Atteveldt et al.

    Top-down task effects overrule automatic multisensory responses to letter–sound pairs in auditory association cortex

    Neuroimage

    (2007)
  • V. van Veen et al.

    Separating semantic conflict and response conflict in the Stroop task: a functional MRI study

    Neuroimage

    (2005)
  • M.B. Wall et al.

    Functional imaging of the human superior colliculus: an optimised approach

    Neuroimage

    (2009)
  • D.H. Weissman et al.

    Conflict monitoring in the human anterior cingulate cortex during selective attention to global and local object features

    Neuroimage

    (2003)
  • D.H. Weissman et al.

    Momentary reductions of attention permit greater processing of irrelevant stimuli

    Neuroimage

    (2009)
  • R.A. Abrams et al.

    Object-based visual attention with endogenous orienting

    Percept. Psychophys.

    (2000)
  • M.A. Burock et al.

    Randomized event-related experimental designs allow for extremely rapid presentation rates using functional MRI

    NeuroReport

    (1998)
  • L. Busse et al.

    The spread of attention across modalities and space in a multisensory object

    Proc. Natl Acad. Sci. USA

    (2005)
  • M. Corbetta et al.

    Control of goal-directed and stimulus-driven attention in the brain

    Nat. Rev. Neurosci.

    (2002)
  • S. Crottaz-Herbette et al.

    Where and when the anterior cingulate cortex modulates attentional response: combined fMRI and ERP evidence

    J. Cogn. Neurosci.

    (2006)
  • P. Dalton et al.

    Attentional capture in serial audiovisual search tasks

    Percept. Psychophys.

    (2007)
  • Dorsaint-Pierre, R., Penhune, V.B., Watkins, K.E., Neelin, P., Lerch, J.P., Bouffard, M., Zatorre, R.J., 2006....
  • Cited by (35)

    • ERP-study on the time course of disgust-motivated spatial avoidance

      2019, Biological Psychology
      Citation Excerpt :

      To select the P1 and P3 time windows, we extended the traditional ERP-data-driven approach (Keil et al., 2014) by choosing temporal ROI’s based on an averaged data set across all target conditions. Note, that a similar approach is also used in fMRI analyses to confine spatial ROIs (e.g., Macaluso, Frith, & Driver, 2000; Zimmer & Macaluso, 2007; Zimmer, Roberts, Harshbarger, & Woldorff, 2010). Importantly, such an averaged data set is comprised of the sum of all stimulus conditions and therefore orthogonal to the final analyses (i.e. subtraction comparisons of conditions).

    • An attentional mechanism for minimizing cross-modal distraction

      2017, Acta Psychologica
      Citation Excerpt :

      Investigating this hypothesis is important for three reasons. First, multisensory interactions are linked to unique distraction effects (Zimmer, Roberts, Harshbarger, & Woldorff, 2010). Second, some methods for reducing unimodal distraction (e.g., increasing the perceptual load of a visual display) do not minimize cross-modal distraction (Tellinghuisen & Nowak, 2003).

    • The interactions of multisensory integration with endogenous and exogenous attention

      2016, Neuroscience and Biobehavioral Reviews
      Citation Excerpt :

      There are no consistent conclusions regarding whether a congruent sound or an incongruent sound with a task-relevant visual stimulus can trigger a greater spread of attention. For example, Zimmer et al. (2010a,b) proposed that an incongruent sound is a stronger distraction than a congruent one and captures attention more intensively. However, the cross-modal spread of attention has also been demonstrated to occur only when a task-irrelevant sound is semantically congruent with a visual stimulus (Molholm et al., 2007).

    • Top-down attention regulates the neural expression of audiovisual integration

      2015, NeuroImage
      Citation Excerpt :

      Specifically this network is activated when participants have to override an automatic behavior in favor of a non-automatic one (e.g., Stroop or Flanker task). This type of conflict responses in the ACC have been reported before in multisensory contexts in written words, letters or pictures combined with corresponding or non-corresponding auditory counterparts (Noppeney et al., 2008; Orr and Weissman, 2009; Weissman et al., 2004; Zimmer et al., 2010) and in AV speech (Miller and D'Esposito, 2005; Pekkola et al., 2006; Szycik et al., 2009). Given the above, one has to consider that in our results of visual context down-regulation accompanied by the increment in the cingulate gyrus and the insula when attending to incongruent AV speech could reflect detection of the AV speech conflict (ACC) and, as a consequence of this conflict, a decrease in the processing of the least relevant modality, in this case the visual one (see Navarra et al., 2010, for a similar hypothesis based on behavioral findings).

    View all citing articles on Scopus
    View full text