The effects of auditory attention measured from human electrocorticograms

https://doi.org/10.1016/j.clinph.2005.11.009Get rights and content

Abstract

Objective

A central question in auditory electrophysiology has been whether selective attention can modulate exogenous components of the scalp-recorded N1 (the ‘N1 effect’). Intracranial electrocorticograms were used in the current work to investigate this issue in greater anatomical detail.

Methods

Data were recorded from subdural electrodes placed across temporal cortex in 6 patient-volunteers undergoing diagnostic procedures for medically intractable epilepsy. Patients performed a dichotic listening task in which they alternately attended to a series of tones presented to both ears (mean ISI 800 ms) by responding to rare frequency deviants.

Results

Effects of attention were measured on the largest negative and positive waveform deflections observed between 70 and 220 ms post-stimulus for stimuli presented contralateral to grid location. Peak deflections were most often recorded from the upper bank of the posterior superior temporal gyrus at approximately 89 and 173 ms on average (labeled N90stg and P170stg, respectively). Selective attention had little effect on peak latencies but significantly increased the N90stg for 3 subjects, increased the P170stg for two subjects, and decreased the P170stg for two other subjects.

Conclusions

Selective auditory attention can modulate neural response in auditory cortex.

Significance

The effects of attention on the scalp-recorded N1 component may arise in part from the enhancement of exogenous responses in temporal cortex.

Introduction

Many attempts have been made over the last several decades to provide a physiological foundation to the classic psychological debate regarding when and how selective attention affects sensory processing. This debate has traditionally centered around two opposing models of attentional intervention, the early and late selection theories. Both views are based on the assumption that cognitive resources are limited, and hence only some sensory input may be selected for continued processing en route to conscious awareness or a behavioral response. The role of selective attention is thought to determine when and how signals are selected for further processing.

The early-selection theory (Broadbent, 1958) posits that the attentional ‘bottleneck’ occurs at an early stage before sensory processing is complete. Such a view suggests an anatomical locus early in the sensory stream, perhaps at the thalamus or modality-specific cortex. Late-selection models argue that some information from the unattended channels is processed along with attended information (Treisman, 1960), or may even undergo full semantic analysis (Deutsch and Deutsch, 1963), before attention intervenes to select information for further cognitive processing. This latter view also implies a possibly distinct, endogenous source of the effects of selective attention on sensory processing (Kastner and Ungerleider, 2000, Kemner et al., 2004, Posner and Petersen, 1990).

Neurophysiological work in attention has investigated these implications by measuring when, where and how attention modulates neural response in the perceptual stream. Single-unit and neuroimaging studies have both found that selective attention can enhance visual neural activity, most commonly in extra-striate cortex (Maunsell and Cook, 2002, Treue and Maunsell, 1999), but also as early as V1 (Kastner and Ungerleider, 2000; Martínez et al., 1999). Neurophysiological research in auditory selective attention has been dominated by studies using non-invasive measures in humans. Electro- and magnetoencephalography (EEG, MEG) have been especially useful in this regard because their inherent temporal resolution is well suited to measuring the engagement of cognition (e.g. attention) on sensory events.

As in the psychological literature, two competing models of selective attention have emerged from these studies. In one view, Hillyard and colleagues have proposed an early-selection enhancement of the evoked auditory response (or event related potential, ERP) component occurring approximately 70–120 ms post-stimulus (the ‘N1 effect’, Hillyard et al., 1973, Näätänen and Picton, 1987, Woods, 1990). Though the N1 deflection in the ERP waveform may have several sub-components (Giard et al., 2000, Näätänen and Picton, 1987, Woods, 1990), its primary neural generator is believed to reside in auditory cortical fields along the supratemporal plane (Godey et al., 2001, Halgren et al., 1995, Liégeois-Chauvel et al., 1994, Näätänen, 1992, Picton et al., 1999, Woldorff et al., 1993, Yvert et al., 2001). This location, along with its early latency, tonotopic organization, and sensitivity to the state of the listener, has lead many to view the N1 as an exogenous (i.e. automatic) sensory response (see Näätänen, 1992 for a review).

An alternative position has been proposed by Näätänen and colleagues in which the effects of selective attention in auditory cortex arises primarily from an endogenous generator distinct from that producing the N1 (Näätänen, 1992). In this view, selective attention creates a generally broad negative component that adds to the N1 amplitude. The neural source of this negativity can purportedly be distinguished from that causing the N1 deflection mainly by the increased latency in the peak negative difference wave (Nd), formed by subtracting the ERP of the unattended signal from that of the attended signal. This negative enhancement has been labeled the ‘processing negativity’ (PN) and is viewed as the physiological index of an endogenous matching process occurring when subjects listen for targets in auditory input streams. Under conditions of high attentional load (e.g. signals with short inter-stimulus intervals (ISIs)), the peak PN latency may decrease, causing it to temporally overlap with the N1 in the ERP waveform and thus create an apparent N1 effect. The PN can also have a broader temporal effect such that later components like the P2, a positive deflection occurring on average 170–200 ms post-stimulus, may also exhibit a significant negative shift in attend relative to ignore conditions (Näätänen, 1992, Teder et al., 1993).

There is other evidence, however, to suggest that exogenous sensory components may indeed be directly modulated by selective attention. Woldorff and colleagues reported the attentional modulation of the positive mid-latency ERP deflection occurring around 20–50 ms post-stimulus (known alternately as P20–50 or P1), but only under conditions of high attentional load (e.g. ISIs of 200 ms) (Woldorff and Hillyard, 1991, Woldorff et al., 1993). These early attentional effects occur very near in time to the earliest evoked potentials recorded from human auditory cortex (Celesia, 1976, Howard et al., 2000), suggesting a potentially very early involvement of attention in sensory processing. In addition, the scalp distribution of some sub-components of the N1 effect can match that of the underlying N1 component itself, supporting the view of a common generator underlying the two phenomena (Giard et al., 2000, Woldorff et al., 1993, Woods et al., 1994). Nonetheless, a PN may still co-occur with attentional modulation of exogenous components due to its possibly distinct generator (Alho et al., 1994, Giard et al., 2000, Teder et al., 1993, Woods et al., 1991).

Patients undergoing evaluation for the surgical treatment of medically intractable epilepsy provide a unique opportunity to measure the effects of selective attention directly from human cortex. As part of their treatment, these patients at times require electrodes to be implanted either subdurally on the cortical surface or directly into the parenchyma of the brain in order to identify the locus of seizure activity and/or map cortical areas involved in speech and language. Electrophysiological recordings made from these implants are known as electrocorticograms (ECoG), and their increased signal-to-noise ratio and anatomical precision can help illuminate the sub-processes contributing to the scalp EEG.

Most ECoG studies have investigated aspects of auditory processing other than the potential influence of active attention on exogenous auditory response. This work includes general mapping of acoustically responsive cortex and its cortical connections (Brugge et al., 2003, Celesia, 1976, Halgren et al., 1995, Howard et al., 2000, Kropotov et al., 2000, Liégeois-Chauvel et al., 1994, Rosburg et al., 2004); mapping of responsiveness and discrimination of speech stimuli (Boatman et al., 1994, Crone et al., 2001, Liégeois-Chauvel et al., 1999, Matsumoto et al., 2004, Steinschneider et al., 1999); attention-independent ERP components such as the mismatch negativity (MMN) (Kropotov et al., 1995, Kropotov et al., 2000, Rosburg et al., 2005); and correlates of endogenous cognitive processing such as the late waveform component, P3/P300 (Halgren et al., 1995).

The goal of the current work is to investigate more explicitly the potential effects of selective auditory attention on human peri-Sylvian cortex as produced in the classic dichotic auditory oddball paradigm (Hillyard et al., 1973). It is hoped that results from this study will augment earlier depth electrode work which only indirectly considered the physiological effects of auditory attention (e.g. Halgren et al., 1995), and contribute to extant knowledge about how selective attention may modulate neural responses in auditory cortex.

It should be noted that the relationship between intracranial ERP components and their putative scalp counterparts is necessarily complex and can be difficult to ascertain (Nunez, personal communication, August 7, 2005). In the present work, no scalp EEG recordings to the experimental stimuli were available for any of the patients. However, even with such comparison recordings it may not always be theoretically possible to establish the desired isomorphisms between intra- and extra-cranial components. Such limits are primarily due to (1) electrical signal volume conduction across the scalp, and (2) the imposition by the high conductance skull of a strong low-pass filter on emanating electrical signals, in both the temporal and spatial domains (Srinivasan, 1999). One possible effect of these physical factors may be to allow a focal intracranial source with strong low temporal frequency content to impact scalp-recorded ERPs (e.g. on lower bandwidth components such as N1, P2, etc.), but simultaneously attenuate its higher spatial frequencies and thus obscure its cortical location.

Given the generally greater signal-to-noise ratio and spatial sensitivity of ECoG recordings, intracranial waveforms most likely represent only sub-components of scalp-recorded ERP components. It is also unclear, in light of the analysis complexities described earlier, whether such sub-components can be unequivocally isolated from scalp ERP component complexes even though they may contribute to the scalp ERP waveform. Hence, to distinguish intracranial sources from their possible scalp counterparts, we will identify the intracranial deflections under current investigation using a nomenclature of polarity (negative/positive), mean latency (in meter second), and anatomical location.

Section snippets

Subjects

Subjects were volunteers recruited from a patient population undergoing diagnostic and surgical procedures for medically intractable epilepsy. Protocols were approved by the University of Wisconsin–Madison and Middleton Veteran's Affairs Hospital institutional review boards and all patients provided informed consent. Electrodes were placed intracranially according to the clinical need of each patient. Patients were tested in either of two paradigms. (1) Semi-chronic—electrodes were implanted in

Behavioral results

Table 2 presents mean (SD) p(C) and d-prime averaged across all experimental sessions for all patients for the two attention conditions (‘attend toward’/‘attend away’). The nearly equivalent level of performance in the two conditions suggests that a similar level of attention was achieved across all patients when attending either toward or away from the contralateral ear. However, the large standard deviations for the p(C) measure indicate that some patients were much better at the deviant

Auditory responses recorded across the lateral surface of the superior temporal gyrus

The principal finding of this study was that presentation of brief tones produced a strong, multiphasic response in the human peri-Sylvian area, most often reaching a maximum over posterior portions of the upper superior temporal gyrus (STG) and Sylvian fissure (SF). The averaged ERP waveform recorded from the electrode with the generally strongest response to acoustic stimulation exhibited a large negative peak between 80 and 90 ms post-stimulus, and was subsequently labeled the N90stg based on

Acknowledgements

Funding for this research was provided in part by NIH/NIDCD 5K23DC006415 (PCG). The authors would like to thank Steven Hillyard, John Brugge, and two anonymous reviewers for their helpful comments on earlier drafts of this paper. Thanks also go to Prakash Khanikar, Lisa Rhuelow, and Sue Busta for their assistance in organizing, collecting and analyzing portions of the data reported in this paper. Finally, this work would not have been possible without the generosity of the patients who

References (70)

  • R.T. Knight et al.

    The effects of frontal and temporal–parietal lesions on the auditory evoked potential in man

    Electroencephalogr Clin Neurophysiol

    (1980)
  • R.T. Knight et al.

    The effects of lesions of superior temporal gyrus and inferior parietal lobe on temporal and vertex components of the human AEP

    Electroencephalogr Clin Neurophysiol

    (1988)
  • J.D. Kropotov et al.

    Human auditory-cortex mechanisms of preattentive sound discrimination

    Neurosci Lett

    (2000)
  • A. Liasis et al.

    Intracranial auditory detection and discrimination potentials as substrates of echoic memory in children

    Cogn Brain Res

    (1999)
  • C. Liégeois-Chauvel et al.

    Evoked potentials recorded from the auditory cortex in man: evaluation and topography of the middle latency components

    Electroencephalogr Clin Neurophysiol

    (1994)
  • W.C. McCallum et al.

    The form and distribution of auditory evoked potentials and CNVs when stimuli and responses are lateralized

  • R.W. McCarley et al.

    Event-related potentials in schizophrenia: their biological and clinical correlates and a new model of schizophrenic pathophysiology

    Schizophr Res

    (1991)
  • T. Rosburg et al.

    Short-term habituation of the intracranially recorded auditory evoked potentials P50 and N100

    Neurosci Lett

    (2004)
  • M.G. Woldorff et al.

    Modulation of early auditory processing during selective listening to rapidly presented tones

    Electroencephalogr Clin Neurophysiol

    (1991)
  • D.L. Woods et al.

    Intermodal selective attention. I. Effects on event-related potentials to lateralized auditory and visual stimuli

    Electroencephalogr Clin Neurophysiol

    (1992)
  • R.J. Zatorre et al.

    Auditory attention to space and frequency activates similar cerebral systems

    Neuroimage

    (1999)
  • M. Alcaini et al.

    Selective auditory attention effects in tonotopically organized cortical areas: a topographic ERP study

    Hum Brain Mapp

    (1995)
  • D.E. Broadbent

    Perception and communication

    (1958)
  • J.F. Brugge et al.

    Functional connections between auditory cortex on Heschl's gyrus and on the lateral superior temporal gyrus in humans

    J Neurophysiol

    (2003)
  • G. Celesia

    Organization of auditory cortical areas in man

    Brain

    (1976)
  • J.A. Deutsch et al.

    Attention: some theoretical considerations

    Psychol Rev

    (1963)
  • F. Di Nocera et al.

    Resampling approach to statistical inference: bootstrapping from event-related potentials data

    Behav Res Methods Instrum Comput

    (2000)
  • B. Efron et al.

    An introduction to the bootstrap

    (1993)
  • M.H. Giard et al.

    Neurophysiological mechanisms of auditory selective attention in humans

    Front Biosci

    (2000)
  • R. Hari

    The neuromagnetic method in the study of the human auditory cortex

  • S.A. Hillyard et al.

    Electrical signs of selective attention in the human brain

    Science

    (1973)
  • M.A. Howard et al.

    Auditory cortex on the human posterior superior temporal gyrus

    J Comp Neurol

    (2000)
  • S. Kastner et al.

    Mechanisms of visual attention in the human cortex

    Annu Rev Neurosci

    (2000)
  • J.D. Kropotov et al.

    Mismatch negativity to auditory stimulus change recorded directly from the human temporal cortex

    Psychophysiology

    (1995)
  • A. Liasis et al.

    Intracranial evidence for differential encoding of frequency and duration discrimination responses

    Ear Hear

    (2000)
  • Cited by (31)

    • Neural modulations in the auditory cortex during internal and external attention tasks: A single-patient intracranial recording study

      2022, Cortex
      Citation Excerpt :

      fMRI studies demonstrated enhancement of the activity of the auditory cortex by attention (e.g., Allen et al., 2019; Paltoglou et al., 2011; Petkov et al., 2004; Pugh et al., 1996; Woodruff et al., 1996), but fMRI cannot establish temporal precision of the effects. In non-invasive, scalp EEG/MEG recordings, the most reliable finding has been that the N100 − an auditory ERP component that peaks at about 100 msec from stimulus onset (Näätänen & Picton, 1987; Pratt, 2011) and likely has its primary source generator in the auditory cortex (Giard et al., 2000; Godey et al., 2001; Näätänen & Picton, 1987; Neelon et al., 2006; Vaughan & Ritter, 1970) − might be enhanced by attentional focus (Hackley et al., 1990; Hansen and Hillyard, 1980, 1983; Hillyard et al., 1973; Luck & Kappenman, 2012; Näätänen et al., 1981; Rosburg et al., 2009; Woldorff et al., 1993; Woldorff & Hillyard, 1991). In addition to N100, there were some reports that attention-driven enhancement can already be found for the P50 auditory ERP component (Guterman et al., 1992; Wan et al., 2008; Woldorff et al., 1993; Woldorff & Hillyard, 1991; Zabelina et al., 2015) − the component that peaks at about 50 msec from stimulus onset (for review: Pratt, 2011) and also likely originates in the auditory cortex (Huotilainen et al., 1998; Liegeois-Chauvel et al., 1994; Ponton et al., 2002; Pratt, 2011).

    • Differential modulation of auditory responses to attended and unattended speech in different listening conditions

      2014, Hearing Research
      Citation Excerpt :

      The attentional effect can occur at various processing stages. For example, the effects of attentional modulation appear as early as ∼100 msec in some experiments (e.g., Hillyard et al., 1973; Hansen and Hillyard, 1988; Alho et al., 1994; Melara et al., 2002; Choi et al., 2014) but later in other experiments (>150 msec; e.g, Picton and Hillyard, 1974; Neelon et al., 2006; Snyder et al., 2006; Ross et al., 2010). Stimulus properties and the subject's task are likely to determine the latency of top-down attentional modulations.

    • Generator localization by current source density (CSD): Implications of volume conduction and field closure at intracranial and scalp resolutions

      2012, Clinical Neurophysiology
      Citation Excerpt :

      MEG and intracranial methods suggest multiple generators in Heschl’s gyrus and the planum temporale (Godey et al., 2001). However, the human AEP N1 component occurs much later than the initial response of primary cortex, its generators are not as sharply localized to Heschl’s gyrus (Liégeois-Chauvel et al., 1994), and subdural electrode grids identify a corresponding maximum over posterior portions of the Sylvian fissure and upper superior temporal gyrus (Neelon et al., 2006). Despite these caveats, the auditory N1 component is particularly well-suited to showcase the merits of CSD methods.

    View all citing articles on Scopus

    Portions of this work have been presented earlier in abstract form.

    View full text