Elsevier

NeuroImage

Volume 21, Issue 2, February 2004, Pages 494-506
NeuroImage

Learning new sounds of speech: reallocation of neural substrates

https://doi.org/10.1016/j.neuroimage.2003.09.071Get rights and content

Abstract

Functional magnetic resonance imaging (fMRI) was used to investigate changes in brain activity related to phonetic learning. Ten monolingual English-speaking subjects were scanned while performing an identification task both before and after five sessions of training with a Hindi dental–retroflex nonnative contrast. Behaviorally, training resulted in an improvement in the ability to identify the nonnative contrast. Imaging results suggest that the successful learning of a nonnative phonetic contrast results in the recruitment of the same areas that are involved during the processing of native contrasts, including the left superior temporal gyrus, insula–frontal operculum, and inferior frontal gyrus. Additionally, results of correlational analyses between behavioral improvement and the blood-oxygenation-level-dependent (BOLD) signal obtained during the posttraining Hindi task suggest that the degree of success in learning is accompanied by more efficient neural processing in classical frontal speech regions, and by a reduction of deactivation relative to a noise baseline condition in left parietotemporal speech regions.

Introduction

Infants aged 6 months or younger are able to discriminate speech sounds, including many that are not used to distinguish words in their native language. However, during development and starting as early as at 6 months of age, lack of experience with certain nonnative speech sounds results in a developmental shift from a language-general to a language-specific pattern of phonetic perception Best et al., 1988, Jusczyk, 1995, Kuhl, 2000, Kuhl et al., 1992, Polka and Werker, 1994, Werker and Lalonde, 1988, Werker and Tees, 1984a. Most adults can better distinguish two speech sounds belonging to different phonetic categories than ones belonging to the same category, even when the physical differences separating the stimuli have been equated Flege, 1984, Liberman, 1957, Liberman et al., 1957, Liberman et al., 1967, Pisoni et al., 1982, Werker and Tees, 1984b. Despite native-language phonetic perception, adults are capable of learning new languages, and thereby of learning to distinguish nonnative phonetic contrasts. Interestingly, even amongst adults with very similar language backgrounds, considerable individual differences exist in their ability to improve following phonetic training Polka, 1991, Priutt et al., 1990, Strange and Dittman, 1984, Strange et al., 1989, Werker et al., 1981. This finding leads to important questions regarding the functional neural substrates underlying the perception of native versus newly learned, nonnative speech sounds, and more specifically, regarding possible differences in functional anatomy between individuals who successfully learn new speech sounds and those who do not benefit from training.

The neural correlates of phonetic perception have been studied using functional brain imaging techniques such as PET and fMRI. These experiments have involved auditory presentation of stimuli including words, speech syllables, and meaningless speech sounds, and tasks used have included passive listening, phoneme monitoring, discrimination, or identification, and rhyming judgments. Generally, the results have shown the involvement of regions in and around what is classically known as “Wernicke's area”, including left-sided activations in perisylvian temporoparietal areas including the supramarginal and angular gyri Binder et al., 1996, Binder et al., 1997, Démonet et al., 1994a, Paulesu et al., 1993, Petersen et al., 1988, Zatorre et al., 1992, Zatorre et al., 1996. Consistent with functional imaging work, there is also evidence from lesion studies that deficits in phonological processing may arise from damage to perisylvian regions in and around Wernicke's area, including the left superior temporal gyrus and the supramarginal gyrus Benson, 1967, Benson et al., 1973, Geschwind, 1970, Geschwind, 1971. Results of functional imaging work specifically examining phonetic perception have also typically shown activity in the superior temporal gyrus (STG) bilaterally Binder et al., 1994, Hickok and Poeppel, 2000, Jäncke et al., 1998, Mazoyer et al., 1993, Mummery et al., 1999.

The involvement of regions in and around the frontal speech area classically known as Broca's area in phonological processing has been the subject of controversy. Results of some studies involving receptive speech-related tasks have not shown activation in this region Petersen et al., 1989, Rumsey et al., 1992. In contrast, a larger number of studies have shown its involvement in purely receptive language tasks that make certain specific demands Burton et al., 2000, Démonet et al., 1992, Démonet et al., 1994b, Fiez et al., 1995, Zatorre et al., 1992, Zatorre et al., 1996. The frontal regions identified differ across studies, making the interpretation of the roles of such regions more difficult. Although speech perception has not been investigated extensively in aphasic patients with lesions in and around Broca's area, existing studies have shown deficits in phonetic discrimination Blumstein et al., 1977, Tallal and Newcombe, 1978, and in temporal perception (Tallal and Newcombe, 1978) in such patients.

Plasticity of auditory function resulting from training and experience has been shown using techniques such as single cell recordings in animals Kraus and Disterhoft, 1982, Recanzone et al., 1993, magnetoencephalography (Pantev et al., 1999), and event-related potentials (ERP) Kraus et al., 1995, Tremblay et al., 1998 in humans. For example, behavioral training of two slightly different native speech stimuli in adults results in a significant change in the duration and magnitude of the mismatch negativity (MMN) (Kraus et al., 1995), an auditory cortical response to acoustic change that is introduced in a repetitive stimulus sequence Näätänen et al., 1978, Näätänen et al., 1993. This physiological change precedes behavioral discrimination improvements (Tremblay et al., 1998), suggesting that the MMN is a measure of preattentive learning (see Kraus and Cheour, 2000). A number of studies show hemispheric asymmetries in the MMN Alho et al., 1998, Csépe, 1995, Tervaniemi et al., 2000. Tremblay et al. (1997) showed that MMNs elicited by nonnative speech syllables were initially symmetrical, but that they became enhanced over the left hemisphere following training.

The aim of the present study was to determine how the pattern of brain activity may change as a result of training with speech sounds from a nonnative language. Subjects were scanned using fMRI before and after a 2-week period of phonetic training with a Hindi dental–retroflex contrast. During scanning, a native phonetic contrast was used as a control. A noise control condition was also used to subtract out lower level acoustic processing, and to make the results more comparable to those of previous studies on phonetic processing Binder et al., 2000, Zatorre et al., 1992. We wanted to address the following questions. First, does the identification of newly learned speech sounds recruit the same neural substrates as does the identification of a known, native phonetic contrast, or are new areas recruited? The second question relates to whether we can differentiate “learners” from “nonlearners” on the basis of their pattern of activation while they classify the new speech sounds. We predicted firstly that the native identification task would reveal the bilateral involvement of superior temporal regions, stronger in the left than in the right hemisphere, of the left temporoparietal region, and of the left inferior frontal gyrus (IFG) in and adjacent to Broca's area. Second, based on the above reported lateralization of the MMN response to nonnative speech sounds following training, we predicted that before training, the neural response to nonnative speech sounds would be bilateral, but that it would be more left lateralized after training. We also predicted that after training, the pattern of activation outside of the auditory regions (i.e., in the left temporoparietal and inferior frontal regions) would be similar to that found in the native condition. This prediction is also based on results of neuroimaging studies of language function in healthy bilinguals, some of which show that at the single word level, brain regions subserving the native language (L1) and the second language (L2) in fluent bilinguals appear to overlap Chee et al., 1999, Illes et al., 1999, Klein et al., 1994, Klein et al., 1995. Last, based on the assumption that more successful task performance recruits underlying neural substrates more actively, we predict that correlations between a behavioral learning measure and the blood-oxygenation-level-dependent (BOLD) signal during the posttraining nonnative task would reveal a positive relationship between learning and signal in left prefrontal and left temporoparietal speech areas.

Section snippets

Subjects

Ten right-handed monolingual English-speaking participants (4 men), ranging in age from 20 to 29 participated in the study. None had been exposed to or had experience with languages in which the retroflex speech sound is phonologically represented.

Stimulus selection

We selected the dental–retroflex place-of-articulation contrast, which is used in languages of India such as Hindi or Urdu. Retroflex consonants require a relatively complex articulation, they are rare across languages (only 11% of the world's

Behavioral results

During familiarization, subjects could identify the native /da/ and /ta/ sounds 100% of the time. None of the subjects could hear any difference between the dental /da/ and retroflex /da/ sounds, all subjects identified both of these sounds as the dental /da/. The following are the behavioral results of identification performance during scanning. One out of the 10 subjects did not respond to over 50% of the pre- and posttraining identification trials, therefore we excluded this subject's

Behavioral results

The behavioral results followed the expected pattern. They indicate that the training procedure was effective in producing an overall improvement in subjects' identification of the dental–retroflex contrast during the posttraining relative to the pretraining fMRI test sessions, although not all subjects learned to the same extent. This finding is consistent with results of a previous behavioral study (Golestani et al., submitted for publication), in which we showed, using the same paradigm as

Acknowledgements

We thank Pierre Ahad for help in creating the synthetic stimuli and Rhonda Amsel for statistical advice. We also thank Michael Petrides, Valentina Petre, Pascal Belin, Keith Worsley, Marc Bouffard, and Peter Neelin for technical assistance and consultation, and Bruce Pike for access to the MNI Brain Imaging Centre facilities.

Funding was provided by the Canadian Institutes of Health Research (Operating grants 11541 and 14995) and by the McDonnell-Pew Cognitive Neuroscience Program.

References (96)

  • G Hickok et al.

    Towards a functional neuroanatomy of speech perception

    Trends Cogn. Sci.

    (2000)
  • J Illes et al.

    Convergent cortical representation of semantic processing in bilinguals

    Brain Lang.

    (1999)
  • L Jäncke et al.

    Intensity coding of auditory stimuli: an fMRI study

    Neuropsychologia

    (1998)
  • P.W Jusczyk

    Language acquisition: speech sounds and the beginning of phonology

  • T Klingberg et al.

    Microstructure of temporo-parietal white matter as a basis for reading ability: evidence from diffusion tensor magnetic resonance imaging

    Neuron

    (2000)
  • N Kraus et al.

    Response plasticity of single neurons in rabbit auditory association cortex during tone-signalled learning

    Brain Res.

    (1982)
  • A.M Liberman et al.

    The motor theory of speech perception revised

    Cognition

    (1985)
  • P Lieberman et al.

    Speech production, syntax comprehension, and cognitive deficits in Parkinson's disease

    Brain Lang.

    (1992)
  • R Näätänen et al.

    Early selective attention effect on evoked potential reinterpreted

    Acta Psychol.

    (1978)
  • C Pantev et al.

    Short-term plasticity of the human auditory cortex

    Brain Res.

    (1999)
  • B Pfleiderer et al.

    Visualization of auditory habituation by fMRI

    NeuroImage

    (2002)
  • E.R Pickett et al.

    Selective speech motor, syntax and cognitive deficits associated with bilateral damage to the putamen and the head of the caudate nucleus: a case study

    Neuropsychologia

    (1998)
  • R.A Poldrack

    Imaging brain plasticity: conceptual and methodological issues—A theoretical review

    NeuroImage

    (2000)
  • K.N Stevens et al.

    Quantal aspects of consonant production and perception: a study of retroflex stop consonants

    J. Phon.

    (1975)
  • P Tallal et al.

    Impairment of auditory perception and language comprehension in dysphasia

    Brain Lang.

    (1978)
  • J.F Werker et al.

    Cross-language speech perception: evidence for perceptual reorganization during the first year of life

    Infant Behav. Dev.

    (1984)
  • R.J Wise et al.

    Brain regions involved in articulation

    Lancet

    (1999)
  • R.J Zatorre et al.

    Functional and structural imaging of the human auditory system

  • S Aglioti et al.

    Neurolinguistic and follow-up study of an unusual pattern of recovery from bilingual subcortical aphasia

    Brain

    (1996)
  • K Alho et al.

    Processing of novel sounds and frequency changes in the human auditory cortex: magnetoencephalographic recordings

    Psychophysiology

    (1998)
  • D.F Benson et al.

    Conduction aphasia: a clinicopathological study

    Arch. Neurol.

    (1973)
  • C.T Best et al.

    Examination of perceptual reorganization for nonnative speech contrasts: Zulu click discrimination by English-speaking adults and infants

    J. Exp. Psychol. Hum. Percept. Perform.

    (1988)
  • J.R Binder et al.

    Functional magnetic resonance imaging of human auditory cortex

    Ann. Neurol.

    (1994)
  • J.R Binder et al.

    Function of the left planum temporale in auditory and linguistic processing

    Brain

    (1996)
  • J.R Binder et al.

    Human brain language areas identified by functional magnetic resonance imaging

    J. Neurosci.

    (1997)
  • J.R Binder et al.

    Human temporal lobe activation by speech and non-speech sounds

    Cereb. Cortex

    (2000)
  • R.L Buckner et al.

    Dissociation of human prefrontal cortical areas across different speech production tasks and gender groups

    J. Neurophysiol.

    (1995)
  • D.K Burnham

    Developmental loss of speech perception: exposure to and experience with a first language

    Appl. Psycholinguist.

    (1986)
  • M.W Burton et al.

    The role of segmentation in phonological processing: an fMRI investigation

    J. Cogn. Neurosci.

    (2000)
  • M.W.L Chee et al.

    Mandarin and English single word processing studied with functional magnetic resonance imaging

    J. Neurosci.

    (1999)
  • D.L Collins et al.

    Automatic 3D intersubject registration of MR volumetric data in standardized Talairach space

    J. Comput. Assist. Tomogr.

    (1994)
  • V Csépe

    On the origin and development of the mismatch negativity

    Ear Hear.

    (1995)
  • J.F Démonet et al.

    The anatomy of phonological and semantic processing in normal subjects

    Brain

    (1992)
  • J.F Démonet et al.

    A PET study of cognitive strategies in normal subjects during language tasks: influence of phonetic ambiguity and sequence processing on phoneme monitoring

    Brain

    (1994)
  • J.A Fiez

    Phonology, semantics, and the role of the left inferior prefrontal cortex

    Hum. Brain Mapp.

    (1997)
  • J.A Fiez et al.

    PET studies of auditory and phonological processing: effects of stimulus characteristics and task demands

    J. Cogn. Neurosci.

    (1995)
  • J.E Flege

    The effect of linguistic experience on Arabs' perception of the English /s/ vs /z/ contrast

    Folia Linguist.

    (1984)
  • N Geschwind

    The organization of language and the brain

    Science

    (1970)
  • Cited by (203)

    • Cross-linguistic influences of L1 on L2 morphosyntactic processing: An fNIRS study

      2022, Journal of Neurolinguistics
      Citation Excerpt :

      Learning a second language (L2) yields both functional and neuroanatomical changes (e.g., Golestani & Zatorre, 2004; Jasińska et al., 2017; Jasińska & Petitto, 2013; Jasińska & Petitto, 2014; Klein et al., 2014; Mechelli et al., 2004).

    View all citing articles on Scopus
    View full text