Skip to main content

Main menu

  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Collections
    • Podcast
  • ALERTS
  • FOR AUTHORS
    • Information for Authors
    • Fees
    • Journal Clubs
    • eLetters
    • Submit
  • EDITORIAL BOARD
  • ABOUT
    • Overview
    • Advertise
    • For the Media
    • Rights and Permissions
    • Privacy Policy
    • Feedback
  • SUBSCRIBE

User menu

  • Log out
  • Log in
  • My Cart

Search

  • Advanced search
Journal of Neuroscience
  • Log out
  • Log in
  • My Cart
Journal of Neuroscience

Advanced Search

Submit a Manuscript
  • HOME
  • CONTENT
    • Early Release
    • Featured
    • Current Issue
    • Issue Archive
    • Collections
    • Podcast
  • ALERTS
  • FOR AUTHORS
    • Information for Authors
    • Fees
    • Journal Clubs
    • eLetters
    • Submit
  • EDITORIAL BOARD
  • ABOUT
    • Overview
    • Advertise
    • For the Media
    • Rights and Permissions
    • Privacy Policy
    • Feedback
  • SUBSCRIBE
PreviousNext
Research Articles, Behavioral/Cognitive

Improved Speech Hearing in Noise with Invasive Electrical Brain Stimulation

Prachi Patel, Bahar Khalighinejad, Jose L. Herrero, Stephan Bickel, Ashesh D. Mehta and Nima Mesgarani
Journal of Neuroscience 27 April 2022, 42 (17) 3648-3658; DOI: https://doi.org/10.1523/JNEUROSCI.1468-21.2022
Prachi Patel
1Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York 10027
2Department of Electrical Engineering, Columbia University, New York, New York 10027
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bahar Khalighinejad
1Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York 10027
2Department of Electrical Engineering, Columbia University, New York, New York 10027
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jose L. Herrero
3Hofstra Northwell School of Medicine, New York, New York 11549
4Feinstein Institute for Medical Research, New York, New York 11030
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stephan Bickel
3Hofstra Northwell School of Medicine, New York, New York 11549
4Feinstein Institute for Medical Research, New York, New York 11030
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ashesh D. Mehta
3Hofstra Northwell School of Medicine, New York, New York 11549
4Feinstein Institute for Medical Research, New York, New York 11030
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ashesh D. Mehta
Nima Mesgarani
1Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, New York 10027
2Department of Electrical Engineering, Columbia University, New York, New York 10027
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF
Loading

This article has a correction. Please see:

  • Erratum: Patel et al., “Improved Speech Hearing in Noise with Invasive Electrical Brain Stimulation” - June 24, 2022

Abstract

Speech perception in noise is a challenging everyday task with which many listeners have difficulty. Here, we report a case in which electrical brain stimulation of implanted intracranial electrodes in the left planum temporale (PT) of a neurosurgical patient significantly and reliably improved subjective quality (up to 50%) and objective intelligibility (up to 97%) of speech in noise perception. Stimulation resulted in a selective enhancement of speech sounds compared with the background noises. The receptive fields of the PT sites whose stimulation improved speech perception were tuned to spectrally broad and rapidly changing sounds. Corticocortical evoked potential analysis revealed that the PT sites were located between the sites in Heschl's gyrus and the superior temporal gyrus. Moreover, the discriminability of speech from nonspeech sounds increased in population neural responses from Heschl's gyrus to the PT to the superior temporal gyrus sites. These findings causally implicate the PT in background noise suppression and may point to a novel potential neuroprosthetic solution to assist in the challenging task of speech perception in noise.

SIGNIFICANCE STATEMENT Speech perception in noise remains a challenging task for many individuals. Here, we present a case in which the electrical brain stimulation of intracranially implanted electrodes in the planum temporale of a neurosurgical patient significantly improved both the subjective quality (up to 50%) and objective intelligibility (up to 97%) of speech perception in noise. Stimulation resulted in a selective enhancement of speech sounds compared with the background noises. Our local and network-level functional analyses placed the planum temporale sites in between the sites in the primary auditory areas in Heschl's gyrus and nonprimary auditory areas in the superior temporal gyrus. These findings causally implicate planum temporale in acoustic scene analysis and suggest potential neuroprosthetic applications to assist hearing in noise.

  • auditory cortex
  • human electrophysiology
  • speech preception

Introduction

Speech communication often takes place in the presence of competing sound sources, yet we are able to carry on conversations effortlessly even when speech signals are mixed with considerable noise. The neural processes that enable the segregation and enhancement of the acoustic features of speech relative to background noise remain largely unknown (Bregman, 1994; Assmann and Summerfield, 2004). It has been shown that the invariance of speech emerges gradually as the neural responses to sound ascend in the auditory pathway from the cochlea (Wang and Shamma, 1994) to the thalamus (Miller et al., 2001) and primary and nonprimary cortical areas (Kumar et al., 2007; Hong et al., 2008; Rabinowitz et al., 2013; Mesgarani et al., 2014). Studies of speech-in-noise perception in humans have found an increasingly invariant cortical representation of speech in the nonprimary auditory cortex that is unaffected by noise (Kell and McDermott, 2019). This invariant representation is constructed by the selective suppression of the encoded noise features in neural responses that occur across multiple anatomic areas, including Heschl's gyrus (HG), the planum temporale (PT), and the superior temporal gyrus (STG) (Khalighinejad et al., 2019). The causal role of each of these regions in removing noise from speech is not yet fully understood; and in particular, the role of the PT in acoustic scene analysis remains debated (Hickok and Saberi, 2012).

Anatomical studies of the PT have shown that it is an intermediary area between HG and the STG (Griffiths and Warren, 2002; Morosan et al., 2005; Hickok and Saberi, 2012), placing its function between those of nonspecific responses to speech in HG and speech-specific responses in the lateral STG (Humphries et al., 2014). These findings are consistent with the functional studies that have speculated on the role of the PT in the separation and identification of sound sources from background sounds (Griffiths and Warren, 2004; Isenberg et al., 2012). Indeed, an increased cortical surface area in the PT has been shown to correlate with the accuracy of acoustically distorted speech sound categorization (Elmer et al., 2013), confirming the role of the PT as a primary processor of rapidly changing acoustic cues in speech and nonspeech signals (Meyer et al., 2012).

Beyond these correlative studies that suggest a role for the PT in processing speech in noise, the causal role of the PT in speech perception in noise has not yet been studied. This absence of progress is because of the lack of methods that can manipulate neural responses with a high degree of spatiotemporal specificity in awake behaving humans as they engage in realistic speech-in-noise perception tasks. One such method that is often used for clinical mapping is electrical brain stimulation (EBS). During EBS, an electrical charge is delivered to a focal brain area to disrupt its function (Parvizi and Kastner, 2018). EBS in humans is unique because humans can articulate their perceived experiences during stimulation, an advantage that is absent in animal models. In the auditory domain, previous studies have reported that EBS of the STG causes auditory hallucinations (e.g., hearing a buzzing sound), illusions, and errors (e.g., hearing a distorted version of a sound) (Selimbeyoglu and Parvizi, 2010; Leonard et al., 2019), suppression of tinnitus (Fenoy et al., 2006), transient hearing loss (Sinha et al., 2005), and have implicated the posterior STG in basic auditory functions, such as syllable discrimination (Boatman et al., 1995; Miglioretti and Boatman, 2003). On the other hand, EBS of the medial HG can induce the perception of distinct tones (Donovan et al., 2015). The effect of EBS in PT remains less clear because of the limited intracranial studies with depth coverage to target this region. Notably, none of the previous EBS studies in the human auditory cortex has demonstrated a facilitative role for EBS in speech perception, a task that remains challenging, particularly in adverse and noisy acoustic conditions.

In this study, we used a unique instance in which an epilepsy patient was implanted with both grid and depth electrodes (intracranial EEG [iEEG]) covering multiple auditory areas, including HG, the PT, and the STG. We stimulated and recorded the responses of HG, the PT, and the STG during a speech-in-noise perception task. Stimulation of the PT sites resulted in significant improvement in both the perceived quality and intelligibility of speech. Furthermore, we showed distinct local- and network-level properties for these neural sites using spectrotemporal receptive field analysis, corticocortical evoked potentials (CCEPs), and speech-nonspeech discriminability. This study suggests that the PT has a causal role in speech-in-noise perception and suggests potential neuroprosthetic solutions to enhance speech-in-noise perception, which currently remain a major challenge for individuals with hearing loss.

Materials and Methods

Experimental design and statistical analysis

Participants and data collection

A 27-year-old female adult with pharmacoresistant focal epilepsy was the subject of this study. The subject had self-reported normal hearing and showed no difficulty in speech communication during the neuropsychological language tests done before implant. The subject underwent chronic iEEG monitoring at North Shore University Hospital to identify epileptogenic focus in the brain for later removal. She was implanted with both surface grid & strip as well as depth electrodes (PMT). Electrodes showing any sign of abnormal epileptiform discharges, as identified in the epileptologists' clinical reports, were excluded from the analysis. All included iEEG time series were manually inspected for signal quality and were free from interictal spikes. All research protocols were approved and monitored by the institutional review board at the Feinstein Institute for Medical Research, and informed written consent to participate in research studies was obtained from the subject before the implantation of electrodes.

Stimulus

The subject participated in four experiments: EBS, listening to continuous speech, listening to speech versus nonspeech sounds, and CCEPs. The details of each experiment are described below.

Experiment 1: EBS during speech-in-noise

Bipolar electrical stimulation was delivered to neighboring contacts along the shaft of a stereotactic high-density depth electrode (16 contacts, 2.2 mm intercontact distance, 1.3 mm contact size) using 200 µs squared-wave biphasic pulses at 50 Hz for 4 s (S12 stimulator, Grass Technologies). Two intensities (1 and 3 mA) were tested. The patient was asked to keep her eyes open during the procedure. We used a standard speech-in-noise intelligibility test (BKB-SIN) (Wilson et al., 2007) to measure the intelligibility of speech during stimulation of electrodes. The speech-in-noise sentences were sampled at 20,000 Hz.

Although every contact was stimulated at least once, the objective and subjective measurements of the sham versus stimulation trials were calculated based on 60 stimulation trials of one site along electrode shaft in the left PT (see Fig. 1B, electrode contact 8-9) because they had the highest noise reduction effect based on subject self-report. Randomized sham stimulations (asking the patient to report the percepts without introducing any current into the electrodes) were performed for the quantification of intelligibility and quality improvement. During the sham trials, we specifically tested the placebo effect by counting “1, 2, 3” before clicking the stimulator button, similar to how the stim trials were performed, but the electrical current was set at 0 mA. EEG and clinical signs in the patient ruled out the involvement of HG, the PT, and the STG in the patients' seizures, and the patient was instructed to report whether the electrical stimulation caused her typical seizure auras. The patient did not report experiencing seizure auras during the experiment.

Experiment 2: listening to continuous speech stories

The subject listened to stories recorded by four voice actors (two males and two females) with a duration of 20 min and a sampling rate of 11,025 Hz. The sentences were on an average 5.2-s-long with intersentence interval of 0.5 s. The stimuli were presented using a single Bose SoundLink Mini 2 speaker situated directly in front of the subject. The stimuli and the neural responses to these stimuli were used to calculate the spectrotemporal receptive fields of each neural site.

Experiment 3: listening to speech versus nonspeech sounds

The subject listened to 20 min of 69 commonly heard natural sounds. Among these 69 sounds were 16 samples of English and foreign speech and 53 nonspeech sounds from 14 categories (coughing, crying, screaming, music [jazz, pop, classical], animal vocalizations, laughing, syllables, sneezing, breathing, singing, shooting, tones, drumming, and subway noise) (Khalighinejad et al., 2021). The sounds were on an average 12.5-s-long with intersentence interval of 0.5 s. To find the neural representation of each sound, we averaged the high γ (70-150 Hz) activity in response to the sound over time. Data from this experiment were used for the latency analysis (see Fig. 5) and for the speech/nonspeech analysis (see Fig. 6). The latency was measured as the excitatory peak of spectrotemporal receptive field (STRF) as the center of gravity along the time dimension. The STRF for each electrode was obtained using all the sounds (speech and nonspeech).

Experiment 4: CCEPs

CCEP mapping was performed with bipolar stimulation of each pair of adjacent electrodes with single pulses of an electrical current (10 mA, biphasic, 100 µs/phase, 30 trials per electrode pair) using a Grass S12 cortical stimulator. The interstimulation interval was 1.5 s (±0.5 s jitter). The current magnitude of 10 mA was chosen for the grid/strip electrodes and 4 mA for depth electrodes. These values were chosen as they were the maximum current that did not induce epileptiform discharges in areas outside of the seizure onset zone. The patient was awake and at rest at the time of CCEP recording.

Preprocessing neural data

iEEG signals were acquired continuously at 3 kHz per channel (16-bit precision, range ± 8 mV, DC) with a data acquisition module (Tucker-Davis Technologies). The skull electrodes were used as references, as dictated by the recording quality at the bedside after online visualization of the spectrogram of the signal. Speech signals were recorded simultaneously with the iEEG for subsequent offline analysis. All further processing steps were performed offline. The iEEG data were resampled to 512 Hz. A first-order Butterworth high-pass filter with a cutoff frequency of 1 Hz was used to remove DC drift. Line noise at 60 Hz and its harmonics (up to 240 Hz) were removed using second-order IIR notch filters with a bandwidth of 1 Hz. The envelope of high γ activity, which correlates with neural firing in the proximity of the electrodes (Ray and Maunsell, 2011; Buzsáki et al., 2012), was used as a measure of the neural response. To obtain the envelope of this broadband signal, we first filtered the data into eight frequency bands between 70 and 150 Hz (Edwards et al., 2009). Then, the envelope of each band was obtained by taking the absolute value of the Hilbert transform. We took the average of all eight frequency bands as the final envelope. The data were resampled to 100 Hz for further analysis. Neural sites that were responsive to sound were determined by a t test between responses to silence and all sounds from Experiment 3 (t test, FDR-corrected, q < 0.01) (Benjamini and Yekutieli, 2001).

Video-iEEG monitoring demonstrated seizures with an onset at the medial and inferior temporal lobe. Frequent interictal spike and wave discharges were also observed in the posterior subtemporal regions. None of the electrodes in HG, the PT, or the STG were part of the irritative or seizure onset zones.

Behavior quantification

To evaluate the quality of speech sentences, we asked the subject to repeat the sentence that was played and rate its quality using the Mean Opinion Scale (MOS) scale (Salza et al., 1996). An MOS rating of between 1 and 5 (1= bad; 2 = poor; 3 = fair; 4 = good; 5 = excellent) was given. Intelligibility was measured as the percentage of key words identified in presented sentences using the BKB-SIN test (Wilson et al., 2007).

To evaluate the perceived tone frequency when HG electrodes were stimulated, the subject was given a knob that varied the tone frequencies being played. The subject was asked to move the knob until the presented pitch matched the tone frequency she perceived when the HG electrode was stimulated.

STRFs

STRFs were computed by a normalized reverse correlation algorithm (Theunissen et al., 2001) using STRFLab (Theunissen et al., 2001). We first converted the sound waveform into a time-frequency representation using a cochlear frequency analysis model consisting of a bank of 128 asymmetric constant-Q filters spaced equally along a logarithmic axis (Chi et al., 2005). The MATLAB code used to calculate the auditory spectrogram is available at http://nsl.isr.umd.edu/downloads.html. The output of the filter bank was then resampled to 16 frequency bands to prevent parameter overfitting. The amplitude of the high γ band was used as the measure of neural activity. Regularization and sparseness parameters were found via cross-validation (David et al., 2007). The best frequency and response latency parameters were estimated by finding the center of the excitatory region of the STRF along the frequency and time dimensions.

Multidimensional scaling (MDS) analysis

To calculate the MDS diagram of speech and nonspeech sounds, we first found the average of the high γ activity across the duration of the sounds in each category. In determining these averages, we eliminated the small segments of silences during the trial by averaging only the samples with above-threshold spectrogram energy. Next, we calculated the Euclidean distance between the average response to different sounds to find a dissimilarity matrix. To visualize this dissimilarity matrix, we used a two-dimensional MDS algorithm based on Kruskal's normalized criterion to minimize stress for the two MDS dimensions (Cox and Cox, 2008).

Analysis of CCEPs

CCEP mapping was performed with bipolar stimulation of each pair of adjacent electrodes with single pulses of electrical current (10 mA, biphasic, 100 µs/phase, 30 trials per electrode pair) using a Grass S12 cortical stimulator (Grass Technologies). Electrical stimulation artifact was found in first 20 ms of the data. This artifact was removed from each CCEP before analysis. Recorded neural data were sampled at 3000 Hz and bandpass filtered at 0.1-1 kHz. Acquired data were notch filtered at 60 Hz. The interstimulation interval was 1.5 s (±0.5 s). Differences in the interstimulation interval had no effect on evoked potentials. A current magnitude of 10 mA was chosen for grid/strip electrodes and 4 mA for depth electrodes. CCEPs in the human cortex generally consist of an early sharp response (10-50 ms poststimulation) and a later slow wave (50-250 ms). However, it has been shown that polarity can be positive in some cases, and there is a similar correlation between CCEPs and resting when the N1 or P1 response is used (Keller et al., 2011). Therefore, to compare the connectivity among regions, we used the absolute value of the response during the duration of 20-30 ms, which corresponds to the N1 component (Keller et al., 2014a).

Graph analysis of CCEPs

For each stimulation and response site, mean evoked potentials (derived from 20 repetitions) were normalized relative to the mean and SD of the prestimulus baseline (−500 to −5 ms). To assess which electrodes are close to which other electrodes, an adjacency matrix was formed, with rows indicating stimulated sites and columns indicating CCEP recording sites. The maximum absolute value of the N1 component of the normalized CCEP between two sites was used to fill in the values of the adjacency matrix. A Z score of 12 was determined as a cutoff to convert the adjacency matrix to a sparse matrix of 0 s and 1 s that indicated whether the corresponding row electrode was connected to the corresponding column electrode. We did not find any difference in the results when a different threshold within the range of 5-15 was used. To evaluate electrode connectivity, specifically the directions of connections and relative node connectivity, digraphs were made from this adjacency matrix using the MATLAB function digraph. The electrode locations on the graph were obtained by using the MATLAB function layout with the parameter force, which estimates node coordinates based on the force-directed structure of the graph (Fruchterman and Reingold, 1991). This relative location of nodes was determined by the force-directed layout using node connectivity in the adjacency matrix, such that nodes that are not connected are placed farther apart in the plot and those that are connected are placed close together. Finally, the flow of information was assessed by measuring the shortest path in the digraphs using the function shortestpath with Dijkstra's algorithm.

Results

Direct electrical stimulation of auditory areas elicited hallucinations of tones and noise reduction effects

A 27-year-old patient with refractory epilepsy was implanted with intracranial depth and grid electrodes to localize the source of seizure activity. The positions of intracranial electrodes covering the perisylvian regions are shown on the subject's neuroanatomical space with the speech responsive sites (see Materials and Methods) in red (Fig. 1A). In Figure 1B, the anodes of the eight bipolar contacts that generated an auditory effect with bipolar electrical stimulation are shown. Stimulation of the four electrodes in the left HG induced an auditory hallucination effect, and the patient matched the induced frequencies to 230, 230, 250, and 250 Hz. Stimulation of the three electrodes in the left PT and one electrode on the boundary of the left PT and HG generated a noise-suppression effect, meaning that the patient reported an increase in the volume and clarity of speech compared with background noise. In the following section, we describe the behavioral task and subjective and objective measures we used to systematically evaluate the improved quality and intelligibility of speech in noise.

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Perceptual reduction in background noise by EBS. A, The anatomic location of the depth and grid electrodes. B, Behavioral report of the subject when bipolar, 50 Hz electrical stimulation was delivered to electrode pairs. The subject reported hearing tones when the blue electrodes (located in HG) were stimulated and reported background noise reduction when the red electrodes (located in the PT) were stimulated. C, Experimental design used to measure the subject's perceptions of the quality and intelligibility of speech in background noise during EBS. The subject heard sentences from the BKB-SIN speech intelligibility corpus in background noise, played either in sync with an electrical stimulation pulse train (stim trial, green) or with no electrical stimulation (sham trial, gray). An example sentence and the keywords used to calculate the intelligibility score are underlined. D, Perceived quality of speech (mean opinion score) in sham and stim trials under four conditions: (1) jet noise with a low SNR and 3 mA stimulation current; (2) bar noise with a low SNR and 3 mA stimulation current; (3) bar noise with a high SNR and 3 mA stimulation current; and (4) bar noise with a low SNR and 1 mA stimulation current. Middle line indicates median. Box represents quartiles. Whiskers represent maximum and minimum.

Measurement of improved intelligibility and quality of perceived speech after EBS of the PT electrode

We systematically tested the behavioral effect observed during clinical stimulation while the subject listened to speech in noise sentences (see Materials and Methods). Half of the trials were chosen randomly for simulation (stim trials), and the other half were performed with no stimulation (sham trials) (Fig. 1C). The patient was asked to repeat the sentence and rate its quality using the MOS scale from 1 (poor) to 5 (excellent) (Salza et al., 1996) (see Materials and Methods). The reported MOS scale was used as the measure of perceived quality of speech, and the percentage of correctly repeated key words was used as the measure of perceived intelligibility of speech (Wilson et al., 2007).

We tested four different conditions (Fig. 1D). In Condition 1, we used jet background noise; in Conditions 2, 3, and 4, we used bar background noise. In Condition 4, we reduced the stimulation current from 3 to 1 mA (Fig. 1D). In Condition 1, where the background noise was jet noise with a signal-to-noise ratio (SNR) of −2.27 dB, the patient rated the speech quality significantly higher in the stim trials than in the sham trials (MOS = 3.2 vs 2.4, p < 10–3, N = 24, Wilcoxon rank-sum test). In addition, speech intelligibility was significantly higher in the stimulated jet noise condition than in the sham trials (91% correct vs 46% correct, p < 0.005, N = 24, Wilcoxon rank-sum test).

In Condition 2, where the background noise was bar noise with an SNR of −2.9 dB, the patient rated the speech quality significantly higher in the stim trials than in the sham trials (3.7 vs 3.1, p = 0.02, N = 36, Wilcoxon rank-sum test). However, we did not find a significant difference between the intelligibility of speech in the stim trials compared with the sham trials (67% correct vs 65% correct, p = 0.9 [not significant], N = 36, Wilcoxon rank-sum test).

In Condition 3, we tested the effect of stimulation in speech in noise with a high SNR, where the background noise was bar noise with an SNR of 3 dB. All of the sentences in this high-SNR condition were intelligible with or without stimulation. However, the perceived quality of the speech was significantly higher in the stim trials than in the sham trials (4.5 vs 3, p < 10–3, N = 41, Wilcoxon rank-sum test).

In Condition 4, as a control condition, we reduced the stimulation current to 1 from 3 mA (with bar background noise and an SNR of −4 dB). We observed that stimulation at a lower current (1 mA) did not result in improvement in the perceived quality of speech, although the patient could detect that a lower stimulation current was being used (p = 0.57, Wilcoxon rank-sum test).

Description of the patient's experience during stimulation

Subjective reports of the effects of electrical stimulation are shown in Table 1 (see also Movie 1). Stimulation of electrode contacts 8-9 in the PT caused the patient to experience a complex auditory effect: she reported improved quality and intelligibility of speech in background noise. In her own words, “The voices get a lot more clear; I still hear the noise, but the voice gets a lot more clear”; “It is always the speech that goes stronger; it's never the background noise”; “If two people talk at the same time, they both increase; it only happens for the background noise”; and “[It's] as if you take an equalizer for music and change the mode of how you want to hear.” In Table 1, the patient's description of 1 versus 3 mA stimulation is also included; she could feel the stimulation in both cases, but only the 3 mA stimulation caused the noise reduction effect.

View this table:
  • View inline
  • View popup
Table 1.

The full subjective report of the stimulation experiment

Movie 1.

The patient describing her subjective experience during the stimulation and sham trials.

PT sites have distinct spectrotemporal tuning properties

To examine the functional properties of the cortical sites and calculate their temporal and spectral tuning, we calculated the STRF of each electrode. The subject listened to 30 min of clean speech stories without the application of any electrical stimulation. The envelope of the high γ frequency band (70-150 Hz), which correlates with neural firing in the proximity of the electrode (Ray and Maunsell, 2011; Buzsáki et al., 2012), was extracted as the neural response measure of the recorded signals (see Materials and Methods). We found that the electrodes in HG were narrowly tuned to a specific frequency range (Fig. 2A); on the other hand, the electrodes in the PT were responsive to temporal changes across a broad range of frequencies, as shown in Figure 2B.

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

STRFs of neural sites in HG and the PT. A, STRFs of neural sites in HG that generated a perceived tone. B, STRFs of neural sites in the PT that caused a noise reduction effect.

It is worth mentioning that we accurately measured the frequency of the hallucinated tone in HG by asking the subject to manually adjust the frequency of a tone generator to the frequency of the induced tone. We did not find any relationship between the best frequency of the contacts in HG and the frequency that was induced through stimulation. Specifically, the subject heard 230, 230, 250, and 250 Hz as a result of stimulation of four contacts in HG with optimal frequencies of 715, 676, 684, and 963 Hz (see Materials and Methods).

Functional connectivity of HG and the STG through the PT

Stimulation of a neural site affects not only the site where the current is directed but also the network of regions that are connected to it. To examine the spatial extent of the stimulation and to test the differences in connectivity of the cortical sites located in HG and the STG versus the PT and the STG, we used the CCEP method. CCEP mapping applies single pulses of a current to measure electrophysiological responses with accurate localization and evaluate interareal connectivity (Keller et al., 2014a,b, 2018).

Most CCEPs consist of an early (10-30 ms) negative surface deflection labeled N1 and a later (80-250 ms) slow wave labeled N2. Although studies have shown a possibility of shorter N1 component at 1.7 and 2.3 ms (Brugge et al., 2003), the N1 of CCEPs from our study align with the 10-30 ms shown by Keller et al. (2014a). The N1 and N2 components of CCEPs exist across spatially diverse recording sites after stimulation. Therefore, the stimulation-evoked response provides a measure of directional connectivity that is calculated directly from the cortical areas with high spatial resolution. It has been shown that the number of white matter tracts measured with DTI is positively correlated with the strength of the N1 component of CCEPs (Keller et al., 2014a,b, 2018).

We divided the speech-responsive sites in this subject into three groups based on their anatomic locations: the HG, the PT, and the STG. We tested the connectivity among the sites in HG, the PT, and the STG (Fig. 3A). To test whether connectivity between the PT and the STG differs from connectivity between HG and the STG, we examined the N1 component of the evoked responses in the STG induced by stimulation of the PT versus stimulation of HG sites. We observed that the stimulation of electrodes in the PT, but not HG, resulted in a significantly stronger N1 in electrodes responsive to speech in the STG. In Figure 3B, representative evoked potentials for individual electrodes are shown. The N1 component induced by stimulation of the PT compared with stimulation of the HG was significantly stronger in recorded neural sites in the STG (20.45 ± 0.6 vs 5.7 ± 0.42 μV, t test, p < 0.001; Fig. 3C).

Figure 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 3.

CCEP activity in the STG through stimulation in PT versus HG. A, Location of electrodes in the HG, the PT, and the STG. B, Evoked responses in representative sites in the HG, in the PT, and in the STG using corticocortical stimulation of the HG versus the PT. Bottom row represents the average of the absolute value of the evoked responses for all neural sites in three areas (HG, PT, and STG). C, Comparison of the absolute value of amplitude of N1 recorded in the STG and generated by stimulation of the PT versus HG.

We generated an adjacency matrix and digraphs to express the connectivity of the neural sites (Biggs et al., 1993). The adjacency matrix of a graph is a square matrix in which the elements indicate the distance between each pair of nodes. From this adjacency matrix, we can generate a digraph (Gross and Yellen, 2005), which is a directed graph of the connectivity between the nodes, with edge arrows showing the direction of the connection and the relative location of nodes showing the strength of connectivity. From the previous analysis, we obtained the CCEPs between two electrodes and normalized their mean and variance based on the neural response before the start of the stimulation. We calculated the absolute value of the N1 component of this normalized CCEP between two electrodes to fill in the elements of the adjacency matrix as a measure of connectivity (rows indicate the stimulated electrode, and columns indicate the target electrode where the CCEP was measured). Thus, each element of this matrix gives a measure of connectivity between the respective electrodes. We further applied a threshold to the adjacency matrix by assigning a value of 1 for significant connections and a value of 0 for nonsignificant connections (see Materials and Methods). Using this adjacency matrix, we generated directional digraph plots with nodes indicating electrodes, edges with arrows indicating the direction of connectivity between the nodes, and the location of nodes based on the method of force-based graph drawing (Fruchterman and Reingold, 1991). In this method, nodes that are not connected are placed further apart in the graph, and those that are highly connected are placed close together in the graph (see Materials and Methods; Fig. 4A). This analysis revealed an overall pattern in which the electrodes in HG, the PT, and the STG were placed closer to the electrodes within the same anatomic division, indicating their stronger connectivity (Fig. 4A). Moreover, the electrodes in the PT were positioned closer to the electrodes in the STG than to the electrodes in HG, indicating a higher connectivity of the PT sites to the STG than to HG. Furthermore, we measured the shortest path of connectivity between these anatomic groups to estimate the direction of information flow. Since the edges connecting pairs of nodes all have the same weight, the shortest path between two nodes is the path with the smallest number of edges that must be traveled. We found that the direction of connectivity of the HG electrodes to the higher auditory areas of the STG went through the PT. (Fig. 4A1). In contrast, the connectivity of PT electrodes to the STG was direct and did not go through HG (Fig. 4A2). Finally, the connectivity between the PT and HG was also bidirectional and direct and did not through the STG (Fig. 4A3).

Figure 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 4.

Network connectivity between auditory areas of HG, the PT, and the STG. A1, The shortest path from HG to the STG is shown in black. The shortest path from the STG to HG is shown in red. A2, The shortest path from the PT to the STG is shown in black. The shortest path from the STG to the PT is shown in red. A3, The shortest path from HG to the PT is shown in black. The shortest path from the PT to HG is shown in red. B, Summary of shortest path connectivity.

A summary of the shortest path analysis is shown in Figure 4B. No direct path was found from HG to the STG; instead, the HG sites were connected to the STG sites through the PT sites. Moreover, the electrodes in the PT were placed closer to the electrodes in the STG than to the electrodes in HG. It is important to note that our analysis of the network properties is limited to the electrodes from which we were recording and does not represent all parts of HG or the PT.

The observed intermediate location of the PT is also supported by the results of a response latency analysis (see Materials and Methods). We first calculated the latency of each neural site and averaged the values in each area, as shown in Figure 5. This figure shows that response latency gradually increases from HG to the PT to the STG. The latency of the response along the auditory pathway approximately reflects the number of synapses away from the auditory periphery and hence has been used to speculate the direction of information processing in the auditory cortex (Da Costa et al., 2011; McMurray and Jongman, 2011; Nourski et al., 2014).

Figure 5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 5.

Latency of the neural sites in the HG, the PT, and the STG. The height of bars indicates the mean latency of electrodes in respective anatomic regions. Error bars indicate SE.

Separation of speech and noise starts in the PT

In addition to latency, we also looked at the separation of speech from nonspeech sounds in each region. Neural responses to specific categories of sounds, such as speech versus nonspeech sounds, have been attributed to a higher level of neural processing in the auditory cortex (Chan et al., 2014). To examine the neural responses to speech versus nonspeech sounds, we designed a task that consisted of 69 samples of sounds: 16 of the sounds were English and foreign news segments, and 53 were diverse sounds, such as coughing, crying, screaming, noise, music, animal vocalization, laughing, singing, shooting, and drumming. We determined the separability of the responses to speech versus nonspeech category of sounds using unpaired t test. We observed that the segregation of speech from nonspeech sounds gradually increased from the HG to the PT to the STG (Fig. 6A). To better demonstrate how the separation of speech and nonspeech sounds takes place along these regions, we projected the response of sites in each region onto a two-dimensional MDS diagram (see Materials and Methods). Figure 6B shows that, while the responses to speech and nonspeech sounds fully overlap in HG, they begin to form separate categories in the PT and become fully separate in the STG.

Figure 6.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 6.

Separation of speech and nonspeech sounds from HG to the PT to the STG. A, The t score between speech and nonspeech sounds for each region. Error bars indicate SE. B, Speech versus nonspeech neural responses on an MDS scale.

Our characterization of the electrodes located in HG and the PT and STG can be summarized as follows: (1) CCEP analysis showed that the PT electrodes are situated between the HG and STG electrodes; (2) response latency increases from the HG electrodes to PT electrodes to the STG electrodes; and (3) the responses to speech and nonspeech sounds becomes more distinct from the HG to the PT to the STG electrodes. Collectively, these results suggest that our electrodes in the PT are located between the electrodes in the HG and the STG. This intermediate location might explain the perceptual effect generated by stimulating the electrodes in this region, which is absent during the stimulation of electrodes in HG and the STG.

Discussion

We report that EBS of electrodes implanted in the PT in one patient significantly and reliably improved both the perceived quality and the intelligibility of speech in noise. The subject reported the suppression of background noise and the selective amplification of speech sounds. We observed significant improvement in both the subjective quality rating and the objective intelligibility score of speech in background noise. The neural sites whose stimulation resulted in enhanced perception of speech had distinct properties compared with the sites in HG; namely, their receptive fields were more tuned to rapid temporal changes in spectrally broad sounds and had a longer response latency. Moreover, network CCEP and speech/nonspeech separability analysis revealed an intermediary role of the neural sites in the PT, suggesting an intermediate functional role relative to the sites in HG and the STG.

Our results causally implicate the examined neural sites in the PT in the suppression of background noise. This triangular region, which occupies the superior temporal plane posterior to HG, is believed to be part of the auditory association cortex (Hickok and Poeppel, 2007). The PT has been suggested to play a role in auditory scene analysis because of its modular structure and multimodal and diverse patterns of neural processing (Griffiths and Warren, 2002). For example, studies have shown that portions of the PT selectively respond to spatial sounds, including moving sounds (Warren et al., 2002; Warren and Griffiths, 2003), with the anterior PT showing sensitivity to pitch-related acoustic changes and the posterior PT showing sensitivity to changes in spatial location (Warren and Griffiths, 2003). Other studies have shown that, in addition to this pure acoustic feature selectivity, the PT is sensitive to spatial sound source locations only when the location serves as a cue for auditory object disambiguation, as opposed to being purely selective of spatial features (Zatorre et al., 2002). Similarly, one study showed that no subregion of the PT was explicitly selective of spatial or object-related information; instead, the PT actively processes spatial cues to aid auditory stream segregation (Smith et al., 2010). Hence, a task/goal-based role of the PT in auditory stream segregation and integration has emerged (Ragert et al., 2014). Consistent with this view, our study is the first to provide experimental evidence in support of an active and causal role of the PT in the crucial task of separating speech from background noise. Moreover, we found that the receptive fields of the PT are distinctly more sensitive to broad spectral temporal changes than the sites in HG. This is in line with anatomic studies showing that the neurite architecture of the PT increases its temporal precision and thus facilitates the processing of auditory speech (Ocklenburg et al., 2018) and with functional studies showing the role of the PT in integrating sequential auditory events (Mustovic et al., 2003). This functional characteristic of the PT may be crucial in auditory scene analysis and noise suppression, as speech and most background noises differ substantially in the spectrotemporal domain (Mesgarani et al., 2006; Mesgarani and Shamma, 2007).

EBS is regarded as complementary to methods that study the neural correlates of stimuli and behavior because EBS directly tests the causal relationship between brain regions and perception. The efficacy of EBS has been demonstrated in vision and memory tasks (Jacobs et al., 2012; Parvizi et al., 2012; Mégevand et al., 2014; Keller et al., 2017; Schalk et al., 2017). Many previous EBS studies relied solely on qualitative descriptions of the subject's experience (Parvizi et al., 2012; Mégevand et al., 2014; Schalk et al., 2017). In contrast, we supplemented the subject's qualitative report with additional quantitative measures to determine the statistical significance of the perceived effects compared with the control (sham) condition. Previous studies of clinical language-mapping protocols used to guide neurosurgical resections have commonly shown the induction of transient language deficits (Borchers et al., 2012). These clinical mapping studies have identified sites that are critical for the perception and discrimination of speech and other complex sounds in the posterior temporal lobe, corresponding to the auditory association cortex (Boatman et al., 1995; Boatman and Miglioretti, 2005), by showing that EBS of the posterior STG causes selective suppression of speech over nonspeech sounds (Sinai et al., 2009). However, to our knowledge, the improvement of speech perception following direct brain stimulation has no precedence in the prior literature. Notably, noninvasive transcranial brain stimulation has also been shown to enhance auditory responses in EEG (Van Doren et al., 2014), improve auditory gap-detection performance (Rufener et al., 2017), bias attention in a spatial multitalker task (Wöstmann et al., 2017), and improve speech perception in cocktail party scenarios (Riecke et al., 2018). Compared with these noninvasive studies, invasive and direct EBS has the power to target a specific brain region, which can result in large and reproducible behavioral effects at the single-trial level, such as those we reported in this study.

Determining how EBS changes neural representation and computation is not a trivial endeavor. It is challenging because of the uncertainty of the spatial extent of the cortex that is responsive to stimulation and the relationship between the circuitry engaged by EBS and the types of neural responses elicited by sensory stimulation. EBS of similar sites in different patients has been shown to yield different results (Borchers et al., 2012; Desmurget et al., 2013). First, small differences in electrode position lead to large differences in targeted responsive neurons (Histed et al., 2009); second, the location of brain functions varies among individual patients (Borchers et al., 2012); and finally, these networks are susceptible to reorganization (Enatsu et al., 2013). Furthermore, EBS can generate complex behavioral effects in regions that are involved in neural processing (Fried et al., 1998; Desmurget et al., 2009; Parvizi et al., 2013). Similar to these studies, we found a specific behavioral effect, although the stimulation signal itself was generic and nonspecific. Additionally, whether EBS of a brain area results in the suppression of or an increase of neural activity is still a matter of scientific debate (Selimbeyoglu and Parvizi, 2010). Extensive preclinical studies have shown that high-frequency (∼100 Hz) EBS increases neuronal excitability (Bliss and Lømo, 1973; Douglas, 1977; Skrede and Malthe-Sørenssen, 1981), whereas low-frequency stimulation (∼1 Hz) decreases neuronal excitability (Mulkey and Malenka, 1992). As a result, several questions warrant further research, including the characterization of local and remote effects of EBS, its inhibitory and excitatory effects (Borchers et al., 2012).

Our study cannot adequately answer how the stimulation of electrodes in the PT results in a perceived reduction in noise. One possible explanation is motivated by our observation of a gradual separation of speech and noise encoding along the auditory pathway (Bar-Yosef and Nelken, 2007; Norman-Haignere et al., 2015). Given these findings, it is possible that the partially separated representations of speech and noise may be differentially targeted by electrical stimulation and hence may be selectively enhanced or suppressed. Consistent with this hypothesis, we found that the representation of speech and noise at PT sites was more separable than the representations at HG sites but less separable than those at STG sites. Additionally, our analysis of latency and network connectivity (CCEP) placed the PT sites between the HG and STG sites. It is plausible that selective suppression of the neural representation of background noise in the PT could reduce its perception. Alternatively, stimulation can activate and excite recurrent connections to upstream and downstream areas that may engage inhibitory connections and hence result in the increased removal of noise from the mixed sound. While we cannot determine the neural mechanism underlying this perceptual effect, future research that allows a better understanding of the role of EBS can result in a more conclusive explanation of the behavioral effect reported in this study and advance our knowledge of the role of the PT in speech-in-noise perception. Finally, speech perception in noise is a very challenging task for individuals with hearing loss. This study opens the possibility of neuroprosthetic solutions that can directly manipulate the cortical representation of speech to enhance the perception of a target sound among competing sources, which can assist individuals who struggle to hear speech in adverse acoustic conditions.

Footnotes

  • This work was supported by National Institutes of Health, National Institute on Deafness and Other Communication Disorders Grant DC014279. The data that support the findings of this study are available on request from corresponding author N.M. The codes for calculating the high gamma envelope, and spectrotemporal receptive field analysis are available at http://naplab.ee.columbia.edu/naplib.html (Khalighinejad et al., 2017).

  • The authors declare no competing financial interests.

  • Correspondence should be addressed to Nima Mesgarani at nima{at}ee.columbia.edu

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license, which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.

References

  1. ↵
    1. Assmann P,
    2. Summerfield Q
    (2004) The perception of speech under adverse conditions. In: Speech processing in the auditory system, pp 231–308. Springer: New York, NY.
  2. ↵
    1. Bar-Yosef O,
    2. Nelken I
    (2007) The effects of background noise on the neural responses to natural sounds in cat primary auditory cortex. Front Comput Neurosci 1:3. doi:10.3389/neuro.10.003.2007 pmid:18946525
    OpenUrlCrossRefPubMed
  3. ↵
    1. Benjamini Y,
    2. Yekutieli D
    (2001) The control of the false discovery rate in multiple testing under dependency. Annals of statistics 29:1165–1188.
    OpenUrlCrossRef
  4. ↵
    1. Biggs N,
    2. Biggs NL,
    3. Norman B
    (1993) Algebraic graph theory. Cambridge, MA: Cambridge UP.
  5. ↵
    1. Bliss TV,
    2. Lømo T
    (1973) Long-lasting potentiation of synaptic transmission in the dentate area of the anaesthetized rabbit following stimulation of the perforant path. J Physiol 232:331–356. doi:10.1113/jphysiol.1973.sp010273 pmid:4727084
    OpenUrlCrossRefPubMed
  6. ↵
    1. Boatman DF,
    2. Miglioretti DL
    (2005) Cortical sites critical for speech discrimination in normal and impaired listeners. J Neurosci 25:5475–5480. doi:10.1523/JNEUROSCI.0936-05.2005 pmid:15944375
    OpenUrlAbstract/FREE Full Text
  7. ↵
    1. Boatman D,
    2. Lesser RP,
    3. Gordon B
    (1995) Auditory speech processing in the left temporal lobe: an electrical interference study. Brain Lang 51:269–290. doi:10.1006/brln.1995.1061 pmid:8564472
    OpenUrlCrossRefPubMed
  8. ↵
    1. Borchers S,
    2. Himmelbach M,
    3. Logothetis N,
    4. Karnath HO
    (2012) Direct electrical stimulation of human cortex: the gold standard for mapping brain functions? Nat Rev Neurosci 13:63–70. doi:10.1038/nrn3140
    OpenUrlCrossRefPubMed
  9. ↵
    1. Bregman AS
    (1994) Auditory scene analysis: the perceptual organization of sound. Cambridge, MA: Massachusetts Institute of Technology.
  10. ↵
    1. Brugge JF,
    2. Volkov IO,
    3. Garell PC,
    4. Reale RA,
    5. Howard MA III.
    (2003) Functional connections between auditory cortex on Heschl's gyrus and on the lateral superior temporal gyrus in humans. J Neurophysiol 90:3750–3763. doi:10.1152/jn.00500.2003 pmid:12968011
    OpenUrlCrossRefPubMed
  11. ↵
    1. Buzsáki G,
    2. Anastassiou CA,
    3. Koch C
    (2012) The origin of extracellular fields and currents-EEG, ECoG, LFP and spikes. Nat Rev Neurosci 13:407–420. doi:10.1038/nrn3241 pmid:22595786
    OpenUrlCrossRefPubMed
  12. ↵
    1. Chan AM,
    2. Dykstra AR,
    3. Jayaram V,
    4. Leonard MK,
    5. Travis KE,
    6. Gygi B,
    7. Baker JM,
    8. Eskandar E,
    9. Hochberg LR,
    10. Halgren E,
    11. Cash SS
    (2014) Speech-specific tuning of neurons in human superior temporal gyrus. Cereb Cortex 24:2679–2693. doi:10.1093/cercor/bht127 pmid:23680841
    OpenUrlCrossRefPubMed
  13. ↵
    1. Chi T,
    2. Ru P,
    3. Shamma SA
    (2005) Multiresolution spectrotemporal analysis of complex sounds. J Acoust Soc Am 118:887–906. doi:10.1121/1.1945807 pmid:16158645
    OpenUrlCrossRefPubMed
  14. ↵
    1. Cox MA,
    2. Cox TF
    (2008) Multidimensional scaling. In: Handbook of data visualization, pp 315–347. New York: Springer.
  15. ↵
    1. Da Costa S,
    2. van der Zwaag W,
    3. Marques JP,
    4. Frackowiak RS,
    5. Clarke S,
    6. Saenz M
    (2011) Human primary auditory cortex follows the shape of Heschl's gyrus. J Neurosci 31:14067–14075. doi:10.1523/JNEUROSCI.2000-11.2011 pmid:21976491
    OpenUrlAbstract/FREE Full Text
  16. ↵
    1. David SV,
    2. Mesgarani N,
    3. Shamma SA
    (2007) Estimating sparse spectro-temporal receptive fields with natural stimuli. Network 18:191–212. doi:10.1080/09548980701609235 pmid:17852750
    OpenUrlCrossRefPubMed
  17. ↵
    1. Desmurget M,
    2. Reilly KT,
    3. Richard N,
    4. Szathmari A,
    5. Mottolese C,
    6. Sirigu A
    (2009) Movement intention after parietal cortex stimulation in humans. Science 324:811–813. doi:10.1126/science.1169896 pmid:19423830
    OpenUrlAbstract/FREE Full Text
  18. ↵
    1. Desmurget M,
    2. Song Z,
    3. Mottolese C,
    4. Sirigu A
    (2013) Re-establishing the merits of electrical brain stimulation. Trends Cogn Sci 17:442–449. doi:10.1016/j.tics.2013.07.002 pmid:23932195
    OpenUrlCrossRefPubMed
  19. ↵
    1. Donovan C,
    2. Sweet J,
    3. Eccher M,
    4. Megerian C,
    5. Semaan M,
    6. Murray G,
    7. Miller J
    (2015) Deep brain stimulation of Heschl gyrus: implantation technique, intraoperative localization, and effects of stimulation. Neurosurgery 77:940–947. doi:10.1227/NEU.0000000000000969 pmid:26280116
    OpenUrlCrossRefPubMed
  20. ↵
    1. Douglas RM
    (1977) Long lasting synaptic potentiation in the rat dentate gyrus following brief high frequency stimulation. Brain Res 126:361–365.
    OpenUrlCrossRefPubMed
  21. ↵
    1. Edwards E,
    2. Soltani M,
    3. Kim W,
    4. Dalal SS,
    5. Nagarajan SS,
    6. Berger MS,
    7. Knight RT
    (2009) Comparison of time–frequency responses and the event-related potential to auditory speech stimuli in human cortex. J Neurophysiol 102:377–386. doi:10.1152/jn.90954.2008 pmid:19439673
    OpenUrlCrossRefPubMed
  22. ↵
    1. Elmer S,
    2. Hänggi J,
    3. Meyer M,
    4. Jäncke L
    (2013) Increased cortical surface area of the left planum temporale in musicians facilitates the categorization of phonetic and temporal speech sounds. Cortex 49:2812–2821. doi:10.1016/j.cortex.2013.03.007 pmid:23628644
    OpenUrlCrossRefPubMed
  23. ↵
    1. Enatsu R,
    2. Kubota Y,
    3. Kakisaka Y,
    4. Bulacio J,
    5. Piao Z,
    6. O'Connor T,
    7. Horning K,
    8. Mosher J,
    9. Burgess RC,
    10. Bingaman W,
    11. Nair DR
    (2013) Reorganization of posterior language area in temporal lobe epilepsy: a cortico-cortical evoked potential study. Epilepsy Res 103:73–82. doi:10.1016/j.eplepsyres.2012.07.008 pmid:22819071
    OpenUrlCrossRefPubMed
  24. ↵
    1. Fenoy AJ,
    2. Severson MA,
    3. Volkov IO,
    4. Brugge JF,
    5. Howard MA III.
    (2006) Hearing suppression induced by electrical stimulation of human auditory cortex. Brain Res 1118:75–83. doi:10.1016/j.brainres.2006.08.013 pmid:16979144
    OpenUrlCrossRefPubMed
  25. ↵
    1. Fried I,
    2. Wilson CL,
    3. MacDonald KA,
    4. Behnke EJ
    (1998) Electric current stimulates laughter. Nature 391:650. doi:10.1038/35536 pmid:9490408
    OpenUrlCrossRefPubMed
  26. ↵
    1. Fruchterman TM,
    2. Reingold EM
    (1991) Graph drawing by force-directed placement. Softw: Pract Exp 21:1129–1164. doi:10.1002/spe.4380211102
    OpenUrlCrossRef
  27. ↵
    1. Griffiths TD,
    2. Warren JD
    (2002) The planum temporale as a computational hub. Trends Neurosci 25:348–353. doi:10.1016/S0166-2236(02)02191-4
    OpenUrlCrossRefPubMed
  28. ↵
    1. Griffiths TD,
    2. Warren JD
    (2004) What is an auditory object? Nat Rev Neurosci 5:887–892. doi:10.1038/nrn1538 pmid:15496866
    OpenUrlCrossRefPubMed
  29. ↵
    1. Gross JL,
    2. Yellen J
    (2005) Graph theory and its applications. Boca Raton, FL: CRC.
  30. ↵
    1. Hickok G,
    2. Poeppel D
    (2007) The cortical organization of speech processing. Nat Rev Neurosci 8:393–402. doi:10.1038/nrn2113 pmid:17431404
    OpenUrlCrossRefPubMed
  31. ↵
    1. Hickok G,
    2. Saberi K
    (2012) Redefining the functional organization of the planum temporale region: space, objects, and sensory–motor integration. In: The human auditory cortex, pp 333–350. New York: Springer.
  32. ↵
    1. Histed MH,
    2. Bonin V,
    3. Reid RC
    (2009) Direct activation of sparse, distributed populations of cortical neurons by electrical microstimulation. Neuron 63:508–522. doi:10.1016/j.neuron.2009.07.016 pmid:19709632
    OpenUrlCrossRefPubMed
  33. ↵
    1. Hong S,
    2. Lundstrom BN,
    3. Fairhall AL
    (2008) Intrinsic gain modulation and adaptive neural coding. PLoS Comput Biol 4:e1000119. doi:10.1371/journal.pcbi.1000119 pmid:18636100
    OpenUrlCrossRefPubMed
  34. ↵
    1. Humphries C,
    2. Sabri M,
    3. Lewis K,
    4. Liebenthal E
    (2014) Hierarchical organization of speech perception in human auditory cortex. Front Neurosci 8:406. doi:10.3389/fnins.2014.00406 pmid:25565939
    OpenUrlCrossRefPubMed
  35. ↵
    1. Isenberg AL,
    2. Vaden KI Jr.,
    3. Saberi K,
    4. Muftuler LT,
    5. Hickok G
    (2012) Functionally distinct regions for spatial processing and sensory motor integration in the planum temporale. Hum Brain Mapp 33:2453–2463. doi:10.1002/hbm.21373 pmid:21932266
    OpenUrlCrossRefPubMed
  36. ↵
    1. Jacobs J,
    2. Lega B,
    3. Anderson C
    (2012) Explaining how brain stimulation can evoke memories. J Cogn Neurosci 24:553–563. doi:10.1162/jocn_a_00170 pmid:22098266
    OpenUrlCrossRefPubMed
  37. ↵
    1. Kell AJ,
    2. McDermott JH
    (2019) Invariance to background noise as a signature of non-primary auditory cortex. Nat Commun 10:3958.
    OpenUrl
  38. ↵
    1. Keller CJ,
    2. Bickel S,
    3. Entz L,
    4. Ulbert I,
    5. Milham MP,
    6. Kelly C,
    7. Mehta AD
    (2011) Intrinsic functional architecture predicts electrically evoked responses in the human brain. Proceedings of the National Academy of Sciences 108:10308–10313.
    OpenUrlAbstract/FREE Full Text
  39. ↵
    1. Keller CJ,
    2. Honey CJ,
    3. Mégevand P,
    4. Entz L,
    5. Ulbert I,
    6. Mehta AD
    (2014a) Mapping human brain networks with cortico-cortical evoked potentials. Philos Trans R Soc Lond B Biol Sci 369:20130528. doi:10.1098/rstb.2013.0528
    OpenUrlCrossRefPubMed
  40. ↵
    1. Keller CJ,
    2. Honey CJ,
    3. Entz L,
    4. Bickel S,
    5. Groppe DM,
    6. Toth E,
    7. Ulbert I,
    8. Lado FA,
    9. Mehta AD
    (2014b) Corticocortical evoked potentials reveal projectors and integrators in human brain networks. J Neurosci 34:9152–9163. doi:10.1523/JNEUROSCI.4289-13.2014 pmid:24990935
    OpenUrlAbstract/FREE Full Text
  41. ↵
    1. Keller CJ,
    2. Davidesco I,
    3. Megevand P,
    4. Lado FA,
    5. Malach R,
    6. Mehta AD
    (2017) Tuning face perception with electrical stimulation of the fusiform gyrus. Hum Brain Mapp 38:2830–2842. doi:10.1002/hbm.23543 pmid:28345189
    OpenUrlCrossRefPubMed
  42. ↵
    1. Keller CJ,
    2. Huang Y,
    3. Herrero JL,
    4. Fini ME,
    5. Du V,
    6. Lado FA,
    7. Honey CJ,
    8. Mehta AD
    (2018) Induction and quantification of excitability changes in human cortical networks. J Neurosci 38:5384–5398. doi:10.1523/JNEUROSCI.1088-17.2018 pmid:29875229
    OpenUrlAbstract/FREE Full Text
  43. ↵
    1. Khalighinejad B,
    2. Nagamine T,
    3. Mehta A,
    4. Mesgarani N
    (2017) NAPLib: an open source toolbox for real-time and offline Neural Acoustic Processing. Acoustics, Speech and Signal Processing, 2017 IEEE International Conference, pp 846–850.
  44. ↵
    1. Khalighinejad B,
    2. Herrero JL,
    3. Mehta AD,
    4. Mesgarani N
    (2019) Adaptation of the human auditory cortex to changing background noise. Nat Commun 10:2509. pmid:31175304
    OpenUrlPubMed
  45. ↵
    1. Khalighinejad B,
    2. Patel P,
    3. Herrero JL,
    4. Bickel S,
    5. Mehta AD,
    6. Mesgarani N
    (2021) Functional characterization of human Heschl's gyrus in response to natural speech. Neuroimage 235:118003. doi:10.1016/j.neuroimage.2021.118003 pmid:33789135
    OpenUrlCrossRefPubMed
  46. ↵
    1. Kumar S,
    2. Stephan KE,
    3. Warren JD,
    4. Friston KJ,
    5. Griffiths TD
    (2007) Hierarchical processing of auditory objects in humans. PLoS Comput Biol 3:e100. doi:10.1371/journal.pcbi.0030100 pmid:17542641
    OpenUrlCrossRefPubMed
  47. ↵
    1. Leonard MK,
    2. Cai R,
    3. Babiak MC,
    4. Ren A,
    5. Chang EF
    (2019) The peri-Sylvian cortical network underlying single word repetition revealed by electrocortical stimulation and direct neural recordings. Brain Lang 193:58–72. doi:10.1016/j.bandl.2016.06.001 pmid:27450996
    OpenUrlCrossRefPubMed
  48. ↵
    1. McMurray B,
    2. Jongman A
    (2011) What information is necessary for speech categorization? Harnessing variability in the speech signal by integrating cues computed relative to expectations. Psychol Rev 118:219–246. doi:10.1037/a0022325 pmid:21417542
    OpenUrlCrossRefPubMed
  49. ↵
    1. Mégevand P,
    2. Groppe DM,
    3. Goldfinger MS,
    4. Hwang ST,
    5. Kingsley PB,
    6. Davidesco I,
    7. Mehta AD
    (2014) Seeing scenes: topographic visual hallucinations evoked by direct electrical stimulation of the parahippocampal place area. J Neurosci 34:5399–5405. doi:10.1523/JNEUROSCI.5202-13.2014 pmid:24741031
    OpenUrlAbstract/FREE Full Text
  50. ↵
    1. Mesgarani N,
    2. Shamma S
    (2007) Denoising in the domain of spectrotemporal modulations. EURASIP J 3:1–8.
    OpenUrl
  51. ↵
    1. Mesgarani N,
    2. Slaney M,
    3. Shamma SA
    (2006) Discrimination of speech from nonspeech based on multiscale spectro-temporal modulations. IEEE Trans Audio Speech Lang Process 14:920–930. doi:10.1109/TSA.2005.858055
    OpenUrlCrossRef
  52. ↵
    1. Mesgarani N,
    2. David SV,
    3. Fritz JB,
    4. Shamma SA
    (2014) Mechanisms of noise robust representation of speech in primary auditory cortex. Proc Natl Acad Sci USA 111:6792–6797. doi:10.1073/pnas.1318017111 pmid:24753585
    OpenUrlAbstract/FREE Full Text
  53. ↵
    1. Meyer M,
    2. Elmer S,
    3. Jäncke L
    (2012) Musical expertise induces neuroplasticity of the planum temporale. Ann NY Acad Sci 1252:116–123. doi:10.1111/j.1749-6632.2012.06450.x pmid:22524348
    OpenUrlCrossRefPubMed
  54. ↵
    1. Miglioretti DL,
    2. Boatman D
    (2003) Modeling variability in cortical representations of human complex sound perception. Exp Brain Res 153:382–387. doi:10.1007/s00221-003-1703-2 pmid:14534769
    OpenUrlCrossRefPubMed
  55. ↵
    1. Miller LM,
    2. Escabı MA,
    3. Read HL,
    4. Schreiner CE
    (2001) Functional convergence of response properties in the auditory thalamocortical system. Neuron 32:151–160. doi:10.1016/S0896-6273(01)00445-7 pmid:11604146
    OpenUrlCrossRefPubMed
  56. ↵
    1. Morosan P,
    2. Rademacher J,
    3. Palomero-Gallagher N,
    4. Zilles K
    (2005) Anatomical organization of the human auditory cortex: cytoarchitecture and transmitter receptors. In: The auditory cortex, pp 45–68. Hove, East Sussex, United Kingdom: Psychology Press.
  57. ↵
    1. Mulkey RM,
    2. Malenka RC
    (1992) Mechanisms underlying induction of homosynaptic long-term depression in area CA1 of the hippocampus. Neuron 9:967–975. doi:10.1016/0896-6273(92)90248-C pmid:1419003
    OpenUrlCrossRefPubMed
  58. ↵
    1. Mustovic H,
    2. Scheffler K,
    3. Di Salle F,
    4. Esposito F,
    5. Neuhoff JG,
    6. Hennig J,
    7. Seifritz E
    (2003) Temporal integration of sequential auditory events: silent period in sound pattern activates human planum temporale. Neuroimage 20:429–434. doi:10.1016/S1053-8119(03)00293-3 pmid:14527603
    OpenUrlCrossRefPubMed
  59. ↵
    1. Norman-Haignere S,
    2. Kanwisher NG,
    3. McDermott JH
    (2015) Distinct cortical pathways for music and speech revealed by hypothesis-free voxel decomposition. Neuron 88:1281–1296. doi:10.1016/j.neuron.2015.11.035 pmid:26687225
    OpenUrlCrossRefPubMed
  60. ↵
    1. Nourski KV,
    2. Steinschneider M,
    3. McMurray B,
    4. Kovach CK,
    5. Oya H,
    6. Kawasaki H,
    7. Howard MA
    (2014) Functional organization of human auditory cortex: investigation of response latencies through direct recordings. Neuroimage 101:598–609. doi:10.1016/j.neuroimage.2014.07.004 pmid:25019680
    OpenUrlCrossRefPubMed
  61. ↵
    1. Ocklenburg S,
    2. Friedrich P,
    3. Fraenz C,
    4. Schlüter C,
    5. Beste C,
    6. Güntürkün O,
    7. Genç E
    (2018) Neurite architecture of the planum temporale predicts neurophysiological processing of auditory speech. Sci Adv 4:eaar6830. doi:10.1126/sciadv.aar6830 pmid:30009258
    OpenUrlFREE Full Text
  62. ↵
    1. Parvizi J,
    2. Kastner S
    (2018) Promises and limitations of human intracranial electroencephalography. Nat Neurosci 21:474–483. doi:10.1038/s41593-018-0108-2 pmid:29507407
    OpenUrlCrossRefPubMed
  63. ↵
    1. Parvizi J,
    2. Jacques C,
    3. Foster BL,
    4. Witthoft N,
    5. Rangarajan V,
    6. Weiner KS,
    7. Grill-Spector K
    (2012) Electrical stimulation of human fusiform face-selective regions distorts face perception. J Neurosci 32:14915–14920. doi:10.1523/JNEUROSCI.2609-12.2012 pmid:23100414
    OpenUrlAbstract/FREE Full Text
  64. ↵
    1. Parvizi J,
    2. Rangarajan V,
    3. Shirer WR,
    4. Desai N,
    5. Greicius MD
    (2013) The will to persevere induced by electrical stimulation of the human cingulate gyrus. Neuron 80:1359–1367. doi:10.1016/j.neuron.2013.10.057 pmid:24316296
    OpenUrlCrossRefPubMed
  65. ↵
    1. Rabinowitz NC,
    2. Willmore BD,
    3. King AJ,
    4. Schnupp JW
    (2013) Constructing noise-invariant representations of sound in the auditory pathway. PLoS Biol 11:e1001710. doi:10.1371/journal.pbio.1001710 pmid:24265596
    OpenUrlCrossRefPubMed
  66. ↵
    1. Ragert M,
    2. Fairhurst MT,
    3. Keller PE
    (2014) Segregation and integration of auditory streams when listening to multi-part music. PLoS One 9:e84085. doi:10.1371/journal.pone.0084085 pmid:24475030
    OpenUrlCrossRefPubMed
  67. ↵
    1. Ray S,
    2. Maunsell JH
    (2011) Different origins of gamma rhythm and high-gamma activity in macaque visual cortex. PLoS Biol 9:e1000610. doi:10.1371/journal.pbio.1000610
    OpenUrlCrossRefPubMed
  68. ↵
    1. Riecke L,
    2. Formisano E,
    3. Sorger B,
    4. Başkent D,
    5. Gaudrain E
    (2018) Neural entrainment to speech modulates speech intelligibility. Curr Biol 28:161–169. doi:10.1016/j.cub.2017.11.033 pmid:29290557
    OpenUrlCrossRefPubMed
  69. ↵
    1. Rufener KS,
    2. Ruhnau P,
    3. Heinze HJ,
    4. Zaehle T
    (2017) Transcranial random noise stimulation (tRNS) shapes the processing of rapidly changing auditory information. Front Cell Neurosci 11:162. doi:10.3389/fncel.2017.00162 pmid:28642686
    OpenUrlCrossRefPubMed
  70. ↵
    1. Salza PL,
    2. Foti E,
    3. Nebbia L,
    4. Oreglia M
    (1996) MOS and pair comparison combined methods for quality evaluation of text-to-speech systems. Acta Acust. United with Acust 82:650–656.
    OpenUrl
  71. ↵
    1. Schalk G,
    2. Kapeller C,
    3. Guger C,
    4. Ogawa H,
    5. Hiroshima S,
    6. Lafer-Sousa R,
    7. Saygin ZM,
    8. Kamada K,
    9. Kanwisher N
    (2017) Facephenes and rainbows: causal evidence for functional and anatomical specificity of face and color processing in the human brain. Proc Natl Acad Sci USA 114:12285–12290.
    OpenUrlAbstract/FREE Full Text
  72. ↵
    1. Selimbeyoglu A,
    2. Parvizi J
    (2010) Electrical stimulation of the human brain: perceptual and behavioral phenomena reported in the old and new literature. Front Hum Neurosci 4:46. doi:10.3389/fnhum.2010.00046 pmid:20577584
    OpenUrlCrossRefPubMed
  73. ↵
    1. Sinai A,
    2. Crone NE,
    3. Wied HM,
    4. Franaszczuk PJ,
    5. Miglioretti D,
    6. Boatman-Reich D
    (2009) Intracranial mapping of auditory perception: event-related responses and electrocortical stimulation. Clin Neurophysiol 120:140–149. doi:10.1016/j.clinph.2008.10.152 pmid:19070540
    OpenUrlCrossRefPubMed
  74. ↵
    1. Sinha SR,
    2. Crone NE,
    3. Fotta R,
    4. Lenz F,
    5. Boatman DF
    (2005) Transient unilateral hearing loss induced by electrocortical stimulation. Neurology 64:383–385. doi:10.1212/01.WNL.0000149524.11371.B1 pmid:15668450
    OpenUrlCrossRefPubMed
  75. ↵
    1. Skrede KK,
    2. Malthe-Sørenssen D
    (1981) Increased resting and evoked release of transmitter following repetitive electrical tetanization in hippocampus: a biochemical correlate to long-lasting synaptic potentiation. Brain Res 208:436–441. doi:10.1016/0006-8993(81)90573-4 pmid:6260292
    OpenUrlCrossRefPubMed
  76. ↵
    1. Smith KR,
    2. Hsieh IH,
    3. Saberi K,
    4. Hickok G
    (2010) Auditory spatial and object processing in the human planum temporale: no evidence for selectivity. J Cogn Neurosci 22:632–639. doi:10.1162/jocn.2009.21196 pmid:19301992
    OpenUrlCrossRefPubMed
  77. ↵
    1. Theunissen FE,
    2. David SV,
    3. Singh NC,
    4. Hsu A,
    5. Vinje WE,
    6. Gallant JL
    (2001) Estimating spatio-temporal receptive fields of auditory and visual neurons from their responses to natural stimuli. Netw Comput Neural Syst 12:289–316. doi:10.1080/net.12.3.289.316
    OpenUrlCrossRef
  78. ↵
    1. Van Doren J,
    2. Langguth B,
    3. Schecklmann M
    (2014) Electroencephalographic effects of transcranial random noise stimulation in the auditory cortex. Brain Stimul 7:807–812.
    OpenUrl
  79. ↵
    1. Wang K,
    2. Shamma S
    (1994) Self-normalization and noise-robustness in early auditory representations. Speech Audio Process IEEE Trans 2:421–435. doi:10.1109/89.294356
    OpenUrlCrossRef
  80. ↵
    1. Warren JD,
    2. Griffiths TD
    (2003) Distinct mechanisms for processing spatial sequences and pitch sequences in the human auditory brain. J Neurosci 23:5799–5804. doi:10.1523/JNEUROSCI.23-13-05799.2003 pmid:12843284
    OpenUrlAbstract/FREE Full Text
  81. ↵
    1. Warren JD,
    2. Zielinski BA,
    3. Green GG,
    4. Rauschecker JP,
    5. Griffiths TD
    (2002) Perception of sound-source motion by the human brain. Neuron 34:139–148. doi:10.1016/S0896-6273(02)00637-2 pmid:11931748
    OpenUrlCrossRefPubMed
  82. ↵
    1. Wilson RH,
    2. McArdle RA,
    3. Smith SL
    (2007) An evaluation of the BKB-SIN, HINT, QuickSIN, and WIN materials on listeners with normal hearing and listeners with hearing loss. J Speech Lang Hear Res 50:844–856. doi:10.1044/1092-4388(2007/059)
    OpenUrlCrossRefPubMed
  83. ↵
    1. Wöstmann M,
    2. Vosskuhl J,
    3. Obleser J,
    4. Herrmann C
    (2017) Opposite effects of lateralised transcranial alpha versus gamma stimulation on spatial attention. bioRxiv 180836.
  84. ↵
    1. Zatorre RJ,
    2. Bouffard M,
    3. Ahad P,
    4. Belin P
    (2002) Where is 'where' in the human auditory cortex? Nat Neurosci 5:905–909. doi:10.1038/nn904 pmid:12195426
    OpenUrlCrossRefPubMed
Back to top

In this issue

The Journal of Neuroscience: 42 (17)
Journal of Neuroscience
Vol. 42, Issue 17
27 Apr 2022
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Index by author
  • Ed Board (PDF)
Email

Thank you for sharing this Journal of Neuroscience article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Improved Speech Hearing in Noise with Invasive Electrical Brain Stimulation
(Your Name) has forwarded a page to you from Journal of Neuroscience
(Your Name) thought you would be interested in this article in Journal of Neuroscience.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Print
View Full Page PDF
Citation Tools
Improved Speech Hearing in Noise with Invasive Electrical Brain Stimulation
Prachi Patel, Bahar Khalighinejad, Jose L. Herrero, Stephan Bickel, Ashesh D. Mehta, Nima Mesgarani
Journal of Neuroscience 27 April 2022, 42 (17) 3648-3658; DOI: 10.1523/JNEUROSCI.1468-21.2022

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Respond to this article
Request Permissions
Share
Improved Speech Hearing in Noise with Invasive Electrical Brain Stimulation
Prachi Patel, Bahar Khalighinejad, Jose L. Herrero, Stephan Bickel, Ashesh D. Mehta, Nima Mesgarani
Journal of Neuroscience 27 April 2022, 42 (17) 3648-3658; DOI: 10.1523/JNEUROSCI.1468-21.2022
Reddit logo Twitter logo Facebook logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • Introduction
    • Materials and Methods
    • Results
    • Discussion
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • eLetters
  • PDF

Keywords

  • auditory cortex
  • human electrophysiology
  • speech preception

Responses to this article

Respond to this article

Jump to comment:

No eLetters have been published for this article.

Related Articles

Cited By...

More in this TOC Section

Research Articles

  • Optogenetics reveals roles for supporting cells in force transmission to and from outer hair cells in the mouse cochlea
  • Pre-saccadic neural enhancements in marmoset area MT
  • Interareal synaptic inputs underlying whisking-related activity in the primary somatosensory barrel cortex
Show more Research Articles

Behavioral/Cognitive

  • Featural representation and internal noise underlie the eccentricity effect in contrast sensitivity
  • Dissociative effects of age on neural differentiation at the category and item level
  • Intonation Units in Spontaneous Speech Evoke a Neural Response
Show more Behavioral/Cognitive
  • Home
  • Alerts
  • Visit Society for Neuroscience on Facebook
  • Follow Society for Neuroscience on Twitter
  • Follow Society for Neuroscience on LinkedIn
  • Visit Society for Neuroscience on Youtube
  • Follow our RSS feeds

Content

  • Early Release
  • Current Issue
  • Issue Archive
  • Collections

Information

  • For Authors
  • For Advertisers
  • For the Media
  • For Subscribers

About

  • About the Journal
  • Editorial Board
  • Privacy Policy
  • Contact
(JNeurosci logo)
(SfN logo)

Copyright © 2023 by the Society for Neuroscience.
JNeurosci Online ISSN: 1529-2401

The ideas and opinions expressed in JNeurosci do not necessarily reflect those of SfN or the JNeurosci Editorial Board. Publication of an advertisement or other product mention in JNeurosci should not be construed as an endorsement of the manufacturer’s claims. SfN does not assume any responsibility for any injury and/or damage to persons or property arising from or related to any use of any material contained in JNeurosci.