Abstract
Are experts born with particular predispositions, or are they made through experience? We examined brain structure in expert phoneticians, individuals who are highly trained to analyze and transcribe speech. We found a positive correlation between the size of left pars opercularis and years of phonetic transcription training experience, illustrating how learning may affect brain structure. Phoneticians were also more likely to have multiple or split left transverse gyri in the auditory cortex than nonexpert controls, and the amount of phonetic transcription training did not predict auditory cortex morphology. The transverse gyri are thought to be established in utero; our results thus suggest that this gross morphological difference may have existed before the onset of phonetic training, and that its presence confers an advantage of sufficient magnitude to affect career choices. These results suggest complementary influences of domain-specific predispositions and experience-dependent brain malleability, influences that likely interact in determining not only how experience shapes the human brain but also why some individuals become engaged by certain fields of expertise.
Introduction
Expertise has been shown to have both functional and structural correlates in the human brain. For example, expert golfers show a different pattern of neural activity than novice golfers when planning shots (Milton et al., 2007), and London taxi drivers have a larger posterior hippocampal volume than matched controls (Maguire et al., 2000). It can be difficult to establish, however, the extent to which these effects relate to preexisting differences between the novice and expert groups, or whether these effects mainly arise from training-induced plasticity. Here we investigate brain anatomy in expert phoneticians—individuals who are specialized in the study of phonetics. We aimed to distinguish experience-dependent plasticity from brain structural features that existed before the onset of expertise training.
Phoneticians typically spend one to four years of formal training learning to identify speech sounds and to transcribe them into an international phonetic alphabet. In many languages, there is no one-to-one correspondence between pronunciation and orthography, and narrow phonetic transcription involves parsing and identifying speech sounds so as to provide a direct correspondence between sound and symbol. Phoneticians are thus very quick and skilled at identifying even subtle regional accents. Unlike other kinds of expertise, such as musical skills, phonetic transcription skills are acquired in adulthood, and we can precisely measure the amount of phonetic transcription training and experience. This allowed us to specifically test the effects on brain structure of extensive and naturalistic training (i.e., not in the laboratory) during adulthood.
To more generally characterize brain structural features underlying this form of language expertise, we also tested for brain structural differences between phoneticians and nonexpert controls. Functional brain imaging and transcranial magnetic stimulation studies have shown that the posterior portion of the left inferior frontal gyrus (the left pars opercularis) and the left supramarginal gyrus are involved in and even necessary for phonological processing and for subvocal rehearsal in verbal working memory (Paulesu et al., 1993; Zatorre et al., 1996; Burton et al., 2000; Nixon et al., 2004; Gough et al., 2005; Hartwigsen et al., 2010). Further, previous findings have shown that brain structure in the left auditory cortex, the parietal cortices, and the left inferior frontal cortex partly predicts individual differences in the perception and production of foreign speech sounds in healthy nonexperts (Golestani et al., 2002, 2007; Golestani and Pallier, 2007). We thus aimed to extend these normative findings to expertise by testing for group differences in left auditory, frontal, and parietal cortex morphology between phoneticians and nonphoneticians. Findings are discussed in terms of experience-dependent plasticity and of genetic studies suggesting differential degrees of experience-dependent malleability of different brain regions.
Materials and Methods
Participants
Seventeen phoneticians (11 women) and 16 age- and gender-matched controls (nine women) participated in the study. All participants were healthy (screened for neurological and psychiatric problems) and right-handed. The mean age of the phoneticians was 39.76 (SD, 13.17) years and that of the controls was 36.35 (SD, 10.55) years. Ten of the 17 phoneticians were native English speakers, four were early bilinguals (i.e., they started learning two languages in parallel early in life), and the remaining three were native speakers of a language other than English. All participants gave informed consent and the study was approved by the National Hospital and Institute of Neurology's joint medical ethics committee.
Phoneticians were recruited via a posting to a mailing list dedicated to phonetics. They worked in fields including academia, speech and language therapy, radio broadcasting (e.g., pronunciation experts), and forensics. Within the group, phoneticians had 1–9 years (mean, 2.8 ± 2.3 years) of formal training in phonetic transcription. This range provided the necessary variation for a correlation that explicitly tested how brain structure was related to the number of years of phonetic training. The phoneticians' ability to learn unfamiliar phonetic contrasts was tested using a synthetic Hindi dental-retroflex contrast. Prior work has already established that the speed at which healthy (nonexpert) individuals learn new phonetic contrasts on this task is positively related to brain structure in left Heschl's gyrus (Golestani et al., 2007). The choice of this task allowed us to integrate the current study with previous findings. Details of the stimuli and training procedure can be found in Golestani et al. (2002, 2007). See Other findings, below for further details on the language backgrounds and early language experience of our participants.
Image acquisition and analysis
We used a 1.5 tesla Siemens Sonata MRI scanner to acquire high resolution, three-dimensional, T1-weighted images using a phased array coil (scanning parameters: matrix, 256 × 224 × 176; isotropic resolution, 1 mm; repetition time, 20.66 ms; echo time, 8.42 ms; inversion time, 640 ms; α = 25°).
Table 1 provides an overview of the different analysis methods used, the motivation for using these, the regions and measures tested, and the results obtained.
Overview of morphometric analyses
Exploratory, whole-brain analysis: voxel-based morphometry
Voxel-based morphometry (VBM) (Ashburner and Friston, 2000), implemented in SPM5 (Wellcome Department of Imaging Neuroscience, London, UK), was used to look for brain structural differences between phoneticians and controls. This method allows an exploratory search throughout the brain; it does not require subjective identification of anatomic boundaries and, hence, the use of arbitrary or conventional definitions of particular brain structures (Golestani et al., 2002).
For VBM, the anatomical images were processed in three steps, as follows: tissue segmentation, linear spatial normalization, and smoothing at 6 mm. Note that VBM does not provide a direct measure of the volume or surface area of specific structures. Specifically, since VBM is based on tissue-segmented gray or white matter maps, it is particularly sensitive to differences in morphology that exist at gray/white matter boundaries, and such sensitivity may hide the presence of morphological differences that lie within a gray or white matter region or structure per se. Given that we complemented VBM with morphometric analyses, including manual and automatic volume and surface area extraction (see Image acquisition and analysis, below), we did not modulate the images (i.e., adjust for volume) during preprocessing. We compared the smoothed gray and white matter images between the phoneticians and controls on a voxel-by-voxel basis using independent samples t tests. We used an uncorrected voxelwise threshold of p < 0.005 with small-volume corrections in regions of interest predicted by previous studies, these being the left and right transverse gyri (Golestani et al., 2007). For the small volume corrections, we used the 40% thresholded probability maps of the macroanatomically defined left and right transverse gyri in the phoneticians.
Hypothesis-driven, region of interest analyses
Transverse gyri of the auditory cortex: macroanatomical labeling.
We performed macroanatomical labeling on the left and right transverse gyri of the auditory cortex on anatomically normalized images using Anatomist (http://brainvisa.info/). This software allows simultaneous real-time viewing and voxel labeling on sagittal, coronal, and horizontal planes, and computation of three-dimensional volumes. We labeled multiple transverse gyri in cases where they existed. The choice of this labeling criterion was motivated by previous results showing gross morphology differences in this region between faster and slower phonetic learners (Golestani et al., 2007), and by the location of the VBM result in the left auditory cortex (see Results, below). Our transverse gyrus measure includes the subregion of the planum temporale that is adjacent to Heschl's gyrus. It is known that the primary auditory cortex (PAC) is usually confined to the transverse gyrus in cases where only one is present, but that when multiple transverse gyri are present, the more anterior one is most likely to include PAC but the most posterior one(s) is (are) likely to include secondary auditory cortex (Pfeifer, 1920; Rademacher et al., 2001). Hence, the planum temporale includes secondary (and tertiary, etc.) transverse gyri and, as such, our inclusion of multiple transverse gyri allowed us to simultaneously examine regions likely to include both the primary auditory cortex as well as planum temporale subregions directly adjacent to core primary auditory cortex.
The macroanatomical definition of the transverse gyrus used for labeling was as follows (Fig. 1). The first transverse sulcus (FTS) always constituted the anterior boundary. The posterior boundary was the most posterior transverse sulcus, this being posterior to second or third transverse gyri when they existed, or being Heschl's sulcus in the case of a single or split transverse gyrus. The medial boundary was drawn from the medial end of the FTS to the medial end of the most posterior transverse sulcus, and the inferior boundary was drawn along a plane running from the depth of the FTS to the depth of the posterior temporal sulcus or of Heschl's sulcus.
Transverse gyrus landmarks and boundaries shown on left hemisphere with two transverse gyri.
The rater was blind to participant group and to hemisphere, and the labeling was performed twice. The total (left and right) transverse gyrus volumes were significantly correlated across labeling sessions (Pearson's r = 0.984), providing evidence for labeling reliability.
Transverse gyri of the auditory cortex: gross morphology.
We quantified the gross morphology of the transverse gyri as follows. We gave a score of 1 if there was one transverse gyrus as viewed on a horizontal section, 2 if there were two, and 3 if there were three. We gave a score of 1.25 if the gyrus was split such that the intermediate sulcus (SI) extended to approximately one-quarter of the length of Heschl's gyrus (HG), 1.5 if it extended to approximately half the length of HG, and 1.75 if it extended to approximately three-quarters the length of HG. Again, the rater was blind to group and to hemisphere, and the ratings were performed twice and were found to be reliable (Person's r = 0.948 for the mean left vs right transverse gyrus splitting measure). Validation of the findings was further provided by the automated gyrification measure (see below).
Independent samples t tests were used to compare transverse gyrus volumes and the degree of splitting between groups. Based on previous findings showing that faster phonetic learners have a larger left Heschl's gyrus and greater left transverse gyrus splitting or duplication than slower learners (Golestani et al., 2007), these tests were unidirectional. A measure of asymmetry [lateralization index (LI)] was then calculated on the manual gyrification measure (Gm) for each participant as follows: LI = [(GmR − GmL)/(GmR + GmL)/2], where R and L indicate right and left, respectively (cf. Penhune et al., 1996). Rightward asymmetries result in positive values, and local gyrification was considered to be asymmetric when |LI| > 0.10.
We also tested for experience-dependent differences in these measures within the phoneticians group. For this, we performed Pearson's correlations between years of formal phonetic transcription training (see Participants, above) and these measures.
Transverse gyri of the auditory cortex: automated measure of transverse gyrus local gyrification.
To further examine gross morphology differences between groups, Freesurfer was used to automatically provide a measure of mean local gyrification (mean curvature) in the transverse gyri of all participants and in both hemispheres. LI was then calculated on the automated gyrification measure (Ga) for each participant as follows: LI = [(GaR − GaL)/(GaR + GaL)/2] (cf. Penhune et al., 1996). Rightward asymmetries result in positive values, and local gyrification was considered to be asymmetric when |LI| > 0.10.
Frontal and parietal regions of interest
Freesurfer (Fischl et al., 2004; Desikan et al., 2006) was used to automatically extract gray matter volumes and surface areas of the left pars opercularis and of the left supramarginal gyrus. We tested for differences between the phoneticians and controls in these measures using independent samples t tests, and performed Bonferroni corrections for the number of regions of interest (ROIs) tested. Based on previous findings that a greater tissue density positively predicts individual differences in phonetic processing in these regions (Golestani and Pallier, 2007; Golestani et al., 2007), all tests were unidirectional. We also tested for experience-dependent differences in these measures within the phoneticians group. For this, we used partial correlations controlling for overall cortical volume to correlate years of phonetic transcription training with these measures. Given that partial correlations are sensitive to outliers, we excluded two outliers who had the most years of phonetic transcription training (>2 SDs above the mean) and who strengthened some of the results (see below).
We were not able to provide measures of surface area for the transverse gyri that are methodologically comparable to those derived for the supramarginal gyrus/pars opercularis because the Freesurfer software does not allow computation of surface area on regions that are imported from other sources, such as those defined by the macroanatomical labeling.
Other findings
Language experience
To statistically account for language experience in the phoneticians group, we evaluated language experience in the following manner. Each individual answered a detailed, nonstandardized questionnaire about their language experience background, including the number of languages that they had learnt formally and informally, the fluency with which they spoke each language, and the age at which they had acquired each language. The phoneticians were more multilingual than were controls. On average, they had had formal language instruction in up to 10 languages (mean number of languages, 5.6 ± 2.1), whereas the controls had had formal language instruction in up to four languages. Phoneticians also received more years of education than the controls (means: phoneticians, 19.29 ± 1.64 years in school plus university; controls, 15.23 ± 3.56 years; t(31) = 3.85, p = 0.001, n = 33, two-tailed). Knowledge of each language was weighted according to the age at which it had been learnt (higher weight for languages learnt earlier in life) and according to the fluency with which it was spoken (higher weight for more fluently spoken languages). The following weights were used: 1) proficiency: not fluent = 1, somewhat fluent = 2, moderately fluent = 3, quite fluent = 4, very fluent = 5, native = 6; 2) age of acquisition: ages 21+ = 1, ages 13–20 = 2, ages 7–12 = 3, ages 1–6 = 4, at birth = 5. A continuous language score was then calculated for each individual and this measure was used to determine whether language experience could account for some of the observed results.
Educational level
To ensure that the educational level of the phoneticians does not explain the structural effects (phoneticians had an average of 19.3 ± 1.6 years of education in total), we tested to see whether the number of years of education predicts various structural measures within the phoneticians.
Relationship between auditory, frontal, and parietal cortex ROI measures and years of transcription training, including two outliers
We repeated the partial correlations examining the relationship between various ROI measures and the amount of transcription training, including the two outliers (all 17 phoneticians).
Relationship between years of transcription training and brain structure in control regions
To ensure that our results were specific to the left pars opercularis, we also used partial correlations, controlling for overall cortical volume, to examine the relationship between years of transcription training and brain structure in several control regions: the right hemisphere homolog to the left pars opercularis, and the left and right lateral occipital gyri.
Relationship between brain structure and nonnative phonetic learning in phoneticians
To determine whether individual differences in skill among the phoneticians predicts aspects of brain structure in this group, we behaviorally tested the phoneticians on their perceptual learning of a synthetic Hindi dental-retroflex contrast. This procedure and the stimuli used were the same as have been used in several studies examining brain structural correlates of phonetic learning (Golestani et al., 2002, 2007), and it has been shown that healthy (nonexpert) individuals who learn to hear this contrast more rapidly have a larger left Heschl's gyrus than do slower learners. We calculated a continuous learning rate measure (L scores) on how quickly participants learned to hear the dental-retroflex contrast during an identification-with-feedback training paradigm, and used this to test for relationships between nonnative phonetic learning and brain structure.
Results
Table 1 provides an overview of the exploratory and hypothesis-driven analyses and results.
Voxel-based morphometry
VBM showed that phoneticians are more likely than controls to have more white matter in the right Heschl's gyrus (t = 3.34; Pfamily-wise error corrected = 0.05 after small-volume correction) (Fig. 2) compared with nonexperts. There was a similar trend in the left hemisphere, in a region just posterior to left Heschl's gyrus (LHG) (t = 2.80; p < 0.005 uncorrected) (Fig. 2), at a location where a second or third transverse gyrus would be expected to lie in cases of transverse gyrus splitting/duplication. This more posterior region behind LHG was not included in our a priori regions of interest but is noted here for completeness. There were no gray matter differences between groups.
VBM anatomical difference in left and right transverse gyri: statistical parametric map showing greater white matter density in phoneticians compared with controls (MNI coordinates: left: x = −56, y = −22, z = 4; and right: x = 50, y = −16, z = 2). Note that for display purposes, images are presented at a threshold of 0.01, uncorrected.
ROI analyses
We subsequently examined brain structure in three ROIs: the auditory cortex bilaterally, the left pars opercularis, and the left supramarginal gyrus.
Transverse gyri
First, to further characterize the VBM finding described above, we macroanatomically defined and labeled (Fig. 1) the left and right transverse gyri of the auditory cortex on anatomically normalized images. Previous findings in healthy, nonexpert adults have shown that people who are faster at learning to hear and identify foreign speech sounds have not only a larger left Heschl's gyrus than slower phonetic learners, but also that they are more likely to have multiple or split left transverse gyri than slower learners (Golestani et al., 2007). Note that the location of the above left hemisphere result (posterior to Heschl's gyrus, at the location of secondary or tertiary transverse gyri in brains where these exist), suggests that phoneticians may be more likely than nonphoneticians to have multiple or split left transverse gyri. Thus, based on previous findings in healthy individuals, and based on the above VBM result, we tested for differences in both transverse gyrus volumes and also for local differences in brain shape, or gross morphology. We found that phoneticians have larger left (t(31) = 2.02, p < 0.05) and right (t(31) = 3.07, p < 0.005) transverse gyri compared with nonexperts (Fig. 3), but the number of years of phonetic training was not related to the left (r = −0.33, p = 0.19) nor right (r = 0.09, p = 0.75) transverse gyrus volumes within the phoneticians group. The effects we observed were specific to the transverse gyri; additional analyses of the area that Freesurfer labels the “posterior banks of the superior temporal sulcus” (the supratemporal plane behind Heschl's gyrus) found no significant difference for phoneticians compared with controls in volume (t(31)= −1.21, p = 0.24) or surface area (t(31)= −1.65, p = 0.11), and no significant relationship between the number of years of phonetic transcription training and left volume (r = −0.21, p = 0.47) or surface area (r = −0.17, p = 0.56). Both the whole brain VBM and region of interest analyses therefore focus the effects of interest to the transverse gyri rather than the surrounding area.
Transverse temporal gyri manual labeling: group differences and fifty-percent thresholded probability maps for phoneticians (red) and nonexperts (blue).
We also tested for gross morphology differences in the transverse gyri between groups, and found that phoneticians are more likely to have multiple or split left (t(31) = 2.39, p < 0.01) but not right (t(31) = 0.19, p > 0.05) transverse gyri than controls (Fig. 4). Specifically, 14 of 17 (82%) of the phoneticians but only 7 of 16 (44%) of the controls had either a duplicate or a split left transverse gyrus, and 13 of 17 (77%) of the phoneticians and 11 of 16 (69%) of the controls had either a duplicate or a split right transverse gyrus. A formal measure of asymmetry showed the same result: there was rightward asymmetry in the degree of splitting of the transverse gyri in the controls (LI, 0.20 ± 0.41), whereas in the phoneticians, the degree of splitting in this region was symmetrical (LI, −0.02 ± 0.28). There was also a significant group difference in the lateralization index, with phoneticians showing less rightward asymmetry than controls (t(31) = 1.77, p < 0.05, one-tailed). In support of this finding, a complementary, automated measure of local gyrification (mean cortical folding, or curvature) in the transverse gyri showed a similar result: there was a trend toward lesser right > left asymmetry in the phoneticians compared with the controls (t(31) = 1.49, p = 0.07, one-tailed).
Hemisphere by group means showing degree of transverse gyrus splitting, or duplication, showing greater likelihood of multiple or split left transverse in phoneticians compared with controls.
There was no relationship between the number of years of phonetic training and degree of left (r = −0.19, p = 0.47) nor right (r = −0.01, p = 0.98) transverse gyrus splitting within the phoneticians group.
Left pars opercularis and supramarginal gyrus
Finally, we examined brain structure in two other ROIs known to be involved in phonetic processing: the left pars opercularis and the left supramarginal gyrus. We found that phoneticians had a larger left pars opercularis volume compared with controls (t(31) = 2.02, p < 0.05, Bonferroni correction). A similar trend was observed for the surface area of the left pars opercularis (t(31) = 1.51, p = 0.07). There were no group differences in the volume or surface area of the left supramarginal gyrus.
Partial correlations controlling for overall cortical volume on the volume and surface area of these regions revealed that within the phoneticians group, years of phonetic transcription training positively predicts the surface area of the left pars opercularis (r = 0.64, p < 0.01, Bonferroni correction) (Fig. 5). A similar trend was observed with the volume of the left pars opercularis (r = 0.49, p = 0.04, Bonferroni correction) (Fig. 5). There were no significant effects of phonetic transcription training on the volume or surface area of the left supramarginal gyrus.
Scatter plots showing relationship between phonetic transcription training and left pars opercularis surface area and volume within the phoneticians group.
Note that the amount of foreign language experience did not predict various structural measures in the three regions of interest within the phoneticians group (see Other findings, below). Also, partial correlations showed no relationship between transcription training and brain structure in control regions (the right hemisphere homolog of the left pars opercularis, and unrelated visual control regions, see Other findings, below). Thus, results appear to be specific to the left pars opercularis, and are not driven by more general factors that may be confounded with training.
We also tested the phoneticians on a measure of nonnative speech sound learning to examine whether this measure of phonetic ability may predict aspects of brain structure within the phoneticians group. Specifically, we administered a short training protocol that evaluates the rate of perceptual learning of a synthetic Hindi dental-retroflex contrast (for details of the stimuli and of the training procedure, see Golestani et al., 2002, 2007). Analyses examining relationships between learning rate and brain structure within the phoneticians group did not, however, reveal significant results (see Other findings, below). This may be because this highly preselected expert group had all received extensive training with speech perception and transcription and were all working as professional phoneticians. Consequently, the correlation between brain structure and rate of learning of one particular synthetic nonnative phonetic contrast did not adequately capture existing variability in performance.
Other findings
Language experience
The amount of foreign language experience did not predict various structural measures within the phoneticians group. Specifically, partial correlations, controlling for cortical volume, did not show significant relationships within the 15 phoneticians between foreign language experience and the left pars opercularis volume (r = −0.19, p = 0.51) or surface area (r = −0.15, p = 0.62), and the left supramarginal gyrus volume (r = −0.43, p = 0.12) or surface area (r = −0.30, p = 0.30). Further, Pearson's correlations did not reveal a relationship across the 17 phoneticians between foreign language experience and left (r = −0.07, p = 0.79) nor right (r = 0.03, p = 0.91) transverse gyrus volumes, and the left (r = −0.10, p = 0.69) nor right (r = 0.13, p = 0.63) transverse gyrus splitting measures (all above tests are two-tailed).
Educational level
The number of years of education did not predict various structural measures within the phoneticians group. Specifically, partial correlations, controlling for cortical volume, did not show significant relationships within the 15 phoneticians between education and the left pars opercularis volume (r = −0.34, p = 0.24) or surface area (r = −0.28, p = 0.33), and the left supramarginal gyrus volume (r = 0.22, p = 0.46) or surface area (r = 0.07, p = 0.82). Further, Pearson's correlations did not reveal a relationship across the 17 phoneticians between education and left (r = 0.26, p = 0.32) or right (r = 0.29, p = 0.26) transverse gyrus volumes, and the left (r = −0.09, p = 0.74) or right (r = 0.21, p = 0.41) transverse gyrus splitting measures (all above tests are two-tailed).
Relationship between auditory, frontal, and parietal cortex ROI measures and years of transcription training, including two outliers
Results are very similar to those found when excluding the two outliers. Specifically, in all 17 phoneticians, partial correlations controlling for overall cortical volume show no significant relationships between phonetic training and the left supramarginal gyrus volume (r = −0.10, p = 0.72) or surface area (r = −0.01, p = 0.97). As when excluding the two outliers, there is a relationship between transcription training and structure of the left pars opercularis. Specifically, transcription training significantly predicts the volume of the left pars opercularis (r = 0.54, p < 0.05, Bonferroni correction) (surface area was significantly predicted when excluding the outliers), and there is a similar trend for the surface area (r = 0.42, p = 0.05).
Pearson's correlations also revealed no relationship within the 17 phoneticians between phonetic training and the left (r = −0.33, p = 0.19) or right (r = 0.09, p = 0.75) transverse gyrus volumes, and the left (r = −0.19, p = 0.47) or right (r = −0.01, p = 0.98) transverse gyrus splitting measures.
Relationship between years of transcription training and brain structure in control regions
There was no relationship between phonetic training and the volume of the right pars opercularis (r = −0.1, p = 0.74), the left lateral occipital gyrus (r = 0.02, p = 0.96), or the right lateral occipital gyrus (r = 0.12, p = 0.70). Similarly, there was no relationship between phonetic training and the surface area of the right pars opercularis (r = 0.24, p = 0.41), the left lateral occipital gyrus (r = 0.33, p = 0.26), or the right lateral occipital gyrus (r = 0.38, p = 0.18).
Relationship between brain structure and nonnative phonetic learning in phoneticians
Multiple regression analyses between the L scores and brain structure throughout the brain in a voxel-based morphometry analysis did not reveal any significant relationship between brain structure and skill within the phoneticians group. Also, there was no relationship within this group between the L scores and the left (r = −0.29, p = 0.27) and right (r = −0.12, p = 0.65) transverse gyrus volumes, or with the left (r = 0.02, p = 0.96) or right (r = −0.29, p = 0.27) transverse gyrus splitting measures.
Discussion
We find complementary evidence for experience-dependent plasticity and for brain structural features that could have existed before the onset of expertise training. The effect of experience-dependent plasticity was observed in the left pars opercularis, which was also found to be larger in phoneticians than matched controls, with the amount of phonetic transcription training within the phoneticians group predicting the surface area of this region. The left pars opercularis lies within the posterior portion of the left inferior frontal gyrus, and together with the more anterior left pars triangularis, it constitutes the classic anterior Broca's speech region. Broca's area is constituted of several anatomically distinct subregions (Amunts et al., 1999; Newman et al., 2003; Gough et al., 2005; Amunts et al., 2010; Kaplan et al., 2010). It has been shown to be involved in a range of linguistic and nonlinguistic tasks, including working memory and sequencing of verbal and nonverbal information (Braver et al., 1997; Gelfand and Bookheimer, 2003), and activation in this region has been shown to be modulated by the complexity of language processing (Sharp et al., 2010). In relation to language, it has been argued that the pars triangularis is involved in semantic processing, whereas the more posterior pars opercularis is involved in phonetic processing (Buckner et al., 1995; Price et al., 1996; Fiez, 1997; Poldrack et al., 1999; Gold and Buckner, 2002; Papoutsi et al., 2009). The finding that transcranial magnetic stimulation of the left pars opercularis disrupts phonetic processing suggests that it is not only implicated but that it is necessary for phonologically based working memory mechanisms (Nixon et al., 2004; Gough et al., 2005). The pars opercularis has also been implicated in subvocal rehearsal in verbal working memory more generally (Paulesu et al., 1993; Démonet et al., 1994), and in the extraction and manipulation of phonetic segments in verbal working memory (Zatorre et al., 1996; Burton et al., 2000). This phonetic segmentation process parallels some of the demands of phonetic transcription, where the ability to effectively parse and identify phonetic segments and to provide a one-to-one correspondence between sounds and symbols is very important. Our findings suggest that the acquisition of expertise performing this very specific linguistic task results in structural plasticity in the associated Broca's area subregion.
We also find gross morphology differences in the left auditory cortex between phoneticians and controls. Previously, we showed that faster phonetic learners have greater left transverse gyrus splitting or duplication than slower learners (Golestani et al., 2007). Here we extend this finding to an expert population and also show that the number of years of phonetic transcription training does not predict this aspect of gross morphology within the phoneticians group. Previous studies have shown transverse gyrus morphology to be normatively asymmetric, with greater likelihood of multiple transverse gyri in the right but not the left hemisphere (Pfeifer, 1920; von Economo and Horn, 1930; Beck, 1955; Campain and Minkler, 1976; Chi et al., 1977), as can be seen in our nonexpert, control group (t(15) = 1.89; p < 0.05; mean right, 1.58 ± 0.59; mean left, 1.25 ± 0.39). Other studies (Penhune et al., 1996; Warrier et al., 2009), however, have shown that the gross morphology of the transverse gyri is quite variable. Our results suggest that this variability may be explained by differences in the populations tested, ones that may in part be explained by systematic differences in domain-specific skills (Leonard et al., 2006). Specifically, we propose that relatively greater gyrification of the left transverse gyrus, which includes primary and secondary auditory cortex in the hemisphere dominant for speech, could be a neuroanatomical marker of skill (whether within normal variability or expertise) in the domain of detailed auditory processing of speech. In line with this interpretation, it has been shown that cortical gyrification at birth predicts functional outcome later in life in preterm infants (Dubois et al., 2008).
As mentioned above, the amount of transcription training did not predict the gross morphology or volume of the left transverse gyrus within the phoneticians group. It is known that the development of the transverse gyrus is established during the 31st week of gestational age, and that secondary (or tertiary) gyri appear after the 36th week of gestation (Chi et al., 1977). Also, there is no evidence, as of yet, that secondary or tertiary transverse gyri could grow during adulthood, and it has been shown that the development of the transverse gyri is stabilized by the age of seven both morphologically (Leonard et al., 1998; Preis et al., 1999) and myeologenetically (Yakovlev and Lecours, 1967). Our results therefore suggest that the left auditory cortex gross morphology differences could have existed before the onset of phonetic training. Further, it is possible that the presence of such a morphological feature may make it more likely for individuals to become phoneticians or to work in other domains requiring detailed auditory processing. In other words, the presence of such a feature may have an impact on professional specialization and on career choices. Consistent with our interpretation, genetic studies show that Broca's area morphology may be more malleable to experience than Heschl's gyrus morphology. In contrast, Heschl's gyrus shows higher heritability estimates than Broca's area (Peper et al., 2007). In other words, we have found evidence for a potential brain structural intermediate phenotype, or structural marker (Meyer-Lindenberg and Weinberger, 2006), not for disorder or disease, but for a domain-specific aptitude that can, with adequate opportunity and training, lead to expertise.
Greater folding, or gyrification, can arise from developmental mechanisms explained by the radial unit model cortical evolution (Rakic, 1988). According to this model, a higher number of radial units, or minicolumns, results in an increase in cerebral gyrification and surface area. Greater gyrification and surface area, in turn, allow for a greater capacity for establishing new patterns of brain connectivity. This radial unit model could explain individual differences in observed local aspects of brain morphology. Thus, our observations that phoneticians have greater gyrification in the left auditory cortex and overall larger transverse gyri bilaterally could be explained by more complex patterns of brain connectivity in phoneticians compared with controls.
Greater gyrification could also arise from other aspects of microstructural differences between individuals, such as ones that have been proposed to explain left > right asymmetries in core auditory regions. Here, neuronal tract tracing has revealed that there are interhemispheric differences in the organization of the intrinsic microcircuitry of the primary auditory cortex such that more functionally distinct columnar systems are included per surface unit in the left than in the right area 22 (Galuske et al., 2000).
In conclusion, our results demonstrate that extensive training with speech sound analysis, and maybe even with articulation-based approaches, can lead to plastic changes in Broca's area. This has potential implications for training-related remediation for individuals with phonological difficulties (e.g., developmental dyslexia), and in the domain of second language learning. This is in line with other studies showing, for example, that greater musical expertise starting at a young age is associated with more gray matter in the auditory cortex (Gaser and Schlaug, 2003).
Footnotes
This work was supported by a Marie Curie International Incoming Fellowship from the European Commission awarded to N.G., and by Wellcome Trust grants to C.J.P. and S.K.S. We thank M. Eckert, C. Leonard, R. Amsel, R. Davis, B. Draganski, S. Kloppel, A. Brennan, J. Glensman, D. Bradbury, and the phoneticians who participated in this study for their help.
- Correspondence should be addressed to Narly Golestani, University Medical School, 1 Rue Michel-Servet, CH-1211 Geneva, Switzerland. n.golestani{at}ucl.ac.uk