Abstract
Hearing loss is one of the most common complaints in adults over the age of 60 and a major contributor to difficulties in speech comprehension. To examine the effects of hearing ability on the neural processes supporting spoken language processing in humans, we used functional magnetic resonance imaging to monitor brain activity while older adults with age-normal hearing listened to sentences that varied in their linguistic demands. Individual differences in hearing ability predicted the degree of language-driven neural recruitment during auditory sentence comprehension in bilateral superior temporal gyri (including primary auditory cortex), thalamus, and brainstem. In a second experiment, we examined the relationship of hearing ability to cortical structural integrity using voxel-based morphometry, demonstrating a significant linear relationship between hearing ability and gray matter volume in primary auditory cortex. Together, these results suggest that even moderate declines in peripheral auditory acuity lead to a systematic downregulation of neural activity during the processing of higher-level aspects of speech, and may also contribute to loss of gray matter volume in primary auditory cortex. More generally, these findings support a resource-allocation framework in which individual differences in sensory ability help define the degree to which brain regions are recruited in service of a particular task.
Introduction
Speech comprehension relies on a distributed network of brain regions that minimally includes bilateral superior and middle temporal gyri, left prefrontal and premotor cortex, and left inferior temporal cortex. However, at its heart, successfully understanding speech depends on a listener's ability to process a complex acoustic signal. In normal aging, auditory processing is often affected by hearing loss, arising from mechanical and neural disruptions along the pathway between the outer ear and auditory cortex (Morrell et al., 1996; Schneider, 1997). Not surprisingly, these challenges to acoustic analysis impede older adults' understanding of spoken language. In addition to sensory declines, however, adult aging is associated with changes in cognitive function in domains critical for speech comprehension, including working memory and information processing speed (Wingfield and Stine-Morrow, 2000). These findings suggest that age-related changes in speech comprehension are due to a complex combination of perceptual and cognitive factors (Wingfield et al., 2005). To date, however, there is no account linking sensory change to the cortical speech processing network that might explain this interaction.
The neural response to degraded speech suggests that effects of both age and hearing should be evident in older listeners. When young adults with no report of hearing difficulty are presented with acoustically degraded speech, they show changes in the degree and localization of neural recruitment throughout the speech processing network (Davis and Johnsrude, 2003; Obleser et al., 2007; Harris et al., 2009). We hypothesized that challenges to peripheral hearing would produce qualitatively similar effects, resulting in an impoverished acoustic signal reaching auditory cortex. The reduced clarity of this signal should alter the degree of neural activation in response to speech, and over time may also lead to structural transformation in auditory cortex. Although we would predict individual differences in hearing ability to exert this influence regardless of age, the loss of auditory sensitivity and increased variability in hearing associated with normal aging (Humes et al., 2010) suggest that such neural effects would be more apparent in older listeners. Behavioral data support this expectation: older adults with poorer hearing perform worse on speech comprehension tasks not only compared to older adults with better hearing (Stewart and Wingfield, 2009; Adank and Janse, 2010; Tun et al., 2010), but also compared to young adults with equivalently poor hearing (Wingfield et al., 2006).
Here, we investigate whether normal variations in hearing ability affect the structure or function of neural systems supporting speech comprehension in older adults. We report the results of two studies: a functional magnetic resonance imaging (fMRI) study examining effects of hearing ability on neural activity, and a voxel-based morphometry (VBM) study examining the relationship between hearing ability and cortical brain volume. We hypothesize that reduced hearing ability will be associated with differences in language-related (and not merely acoustic) processing, reflecting an interaction of perceptual and cognitive factors at the neural level. Although critically important for speech perception, these results also speak to the broader issue of how variability in sensory processing is expressed at multiple levels of cognitive function.
Materials and Methods
Participants.
Participants in the fMRI study were 16 healthy right-handed adults aged 60–77 years [mean (M) = 64.9, SD = 4.9; 9 female, 7 male], all with normal self-reported hearing. Audiograms based on pure-tone audiometry for these participants are shown in Figure 1A. We summarized hearing ability using a pure tone average (PTA; the average threshold in each participant's better ear for 1, 2, and 4 kHz). PTAs in participants' better ears ranged from 10 to 33.3 dB hearing level (HL) (M = 18.4, SD = 6.9). As PTAs ≤25 dB HL are considered clinically normal for speech (Hall and Mueller, 1997), all but three listeners had clinically normal hearing, and these three had what would typically be considered a mild hearing loss. Hearing did not differ across ear (left PTA mean = 21.3 dB HL, SD = 6.98; right PTA mean = 20.1 dB HL, SD = 8.60), t(15) = 0.87, p = 0.40, nor did better-ear PTA correlate with age (Pearson r = 0.10, p = 0.70).
Participants in the VBM study were 25 adults (16 of whom participated in the fMRI study) between the ages of 60–77 years (M = 66.3 years, SD = 5.5 years; 12 female, 13 male). All were right handed and had good self-reported health and hearing. Audiograms for the VBM participants are shown in Figure 1B. PTAs in participants' better ears ranged from 10 to 38.3 dB HL (M = 21.3, SD = 7.8); again, this placed most listeners in the clinically normal hearing range, with a few having mild to moderate hearing loss. Hearing in this group was largely symmetrical (left PTA mean = 24.3 dB HL, SD = 8.47; right PTA mean = 24.1 dB HL, SD = 10.91) and did not differ across ear, t(24) = 0.12, p = 0.91, nor did better-ear PTA correlate with age (Pearson r = 0.28, p = 0.17).
Written informed consent was obtained from all participants according to a protocol approved by the University of Pennsylvania Institutional Review Board.
Materials.
Stimuli were 240 six-word sentences that contained either a subject-relative (e.g., “Boys that help girls are nice”) or syntactically more complex object-relative (e.g., “Boys that girls help are nice”) center-embedded clause (Peelle et al., 2010b). Because this difference hinged solely on word order, sentences were equated for lexical factors across syntactic condition. Sentences were presented at three speech rates (mean duration = 1.1 s, SD = 0.22); given the lack of a rate effect in a previous analysis (Peelle et al., 2010b), in the current analysis we have collapsed across speech rate.
Procedure.
Each participant in the fMRI study heard all 240 sentences, half with subject-relative and half with object-relative structure. Following each sentence, listeners pressed a button to indicate whether the person performing the action in the sentence was a male or a female. Hearing ability was assessed using pure-tone thresholds, measured on a Maico MA39 audiometer (Maico Diagnostics) using standard audiometric practices.
MRI data acquisition and processing.
MRI data were acquired on a Siemens Trio scanner (Siemens Medical Systems) at 3 T, beginning with acquisition of a T1-weighted structural volume using an MPRAGE sequence [repetition time (TR) = 1620 ms, echo time (TE) = 3 ms, flip angle = 15°, 1 mm slice thickness, 192 × 256 matrix, voxel size = 0.98 × 0.98 × 1 mm]. Blood oxygenation level-dependent functional MRI images were acquired with 3 mm isotropic voxels, flip angle = 15°, TR = 8 s, acquisition time (TA) = 3 s, TEeff = 30 ms, and a 64 × 64 matrix. We used a sparse imaging design in which the TR was longer than the TA to allow presentation of sentences with minimal acoustic scanner noise.
Analysis of the fMRI data was performed using SPM5 software (Wellcome Trust Centre for Neuroimaging). For each participant, images were realigned to the first image, coregistered to the structural image, and normalized to Montreal Neurological Institute (MNI) space using unified segmentation (Ashburner and Friston, 2005), including resampling to 2 × 2 × 2 mm voxels, and spatially smoothed with a 10 mm full-width at half-maximum (FWHM) Gaussian kernel. Only sentences that resulted in a correct behavioral response were included in the analysis. Each event onset was convolved with a canonical hemodynamic response function to produce a predicted neural response; additional regressors were included to account for session effects. The modeled conditions for the effect of syntactic complexity were object-relative > subject-relative sentences. To assess the degree to which individual hearing ability predicted language-driven activity, a one-sample t test was used, with hearing level as a mean-centered covariate. For this analysis, a single contrast image reflecting the effect of syntactic complexity was entered for each participant, as well as their PTA. We then tested the hypothesis that hearing ability predicted the magnitude of this effect by examining the parameter estimate reflecting the PTA covariate relative to the error across participants.
For the VBM study, data were analyzed with SPM8. Before normalization using the diffeomorphic DARTEL approach, each individual's structural image was coregistered to an MNI-space average template distributed with SPM using normalized mutual information and segmented into tissue classes using unified segmentation as implemented in SPM8, making use of tissue probability maps for gray matter, white matter, and CSF. Segmented images were normalized to MNI space (preserving volume by Jacobian modulation) and smoothed using a 10 mm FWHM Gaussian kernel.
Statistical maps for the MRI analyses were rendered on 3D MNI-space templates from SPM8 and overlaid on slices from an MNI-space template included with MRIcron (Rorden and Brett, 2000).
Results
Participants performed well on the task, showing the anticipated advantage for subject-relative sentences (mean accuracy = 93%, SD = 5.1) compared to the more difficult object-relative sentences (mean accuracy = 80%, SD = 10.5). We examined participants' behavioral data to see whether there was any evidence for a relationship between comprehension performance and hearing ability for the more complex object-relative sentences by running bivariate Pearson correlations. There was no significant relationship between hearing ability and either accuracy (Pearson r = 0.47, n.s.) or response times to correct responses (Pearson r = −0.17, n.s.). Thus, behaviorally, mild hearing loss did not appear to affect participants' performance.
The primary comparison of interest was to see whether hearing ability significantly predicted neural responses related to linguistic complexity. We operationally defined language-driven activity as increased activity for grammatically complex object-relative sentences compared to subject-relative sentences. We then conducted a whole-brain correlation analysis, entering each participant's hearing ability (PTA) as a mean-centered covariate against estimates of language-driven neural activity. We used a cluster-defining threshold of p < 0.005 (uncorrected), corrected at p < 0.05 across the whole brain using cluster extent (Worsley et al., 1992) (FWHM = 14.6 × 14.2 × 11.9 mm, resel count = 510.1). There were several regions in which participants with poorer hearing showed reduced linguistic activation, shown in Figure 2 and listed in Table 1. They were largely bilateral and included superior temporal gyri, thalamus, and brainstem. There were no regions in which listeners with worse hearing showed increased activity related to linguistic complexity.
To assess the degree to which these differences included primary auditory regions, we used the SPM anatomy toolbox (Eickhoff et al., 2005) to delineate regions TE1.0 and TE1.1 of bilateral primary auditory cortex (Morosan et al., 2001) as regions of interest: voxels included had at least an 80% chance of belonging to these subdivisions based on cytoarchitectonic characteristics of postmortem samples. These auditory regions are outlined in blue in Figure 2B, and clearly encompass areas of significant hearing-related differences. To ensure that these effects were not driven by age differences, we extracted the mean parameter estimates from these primary auditory cortex regions of interest and performed a multiple regression analysis in which we included both age and PTA as predictors. For left auditory cortex, the overall model fit was significant, F(2,15) = 5.6, p = 0.017, explaining 47% of the total variance. Age did not show a significant contribution (β = −0.14, SE = 0.14, standardized β = −0.20, t = 0.99, p = 0.34), but hearing did (β = −0.31, SE = 0.09, standardized β = −0.63, t = 3.08, p < 0.01). A similar pattern held in right auditory cortex, with the overall model fit also being significant, F(2,15) = 5.5, p = 0.019, explaining 46% of the variance. Again, age did not contribute significantly (β = −0.14, SE = 0.17, standardized β = −0.17, t = 0.82, p = 0.43), but PTA did (β = −0.37, SE = 0.12, standardized β = −0.64, t = 3.09, p < 0.01).
To examine whether the observed changes in thalamic activity might impact frontal and temporal components of the speech processing network, we characterized the cortical connectivity of the thalamic cluster using the Oxford Thalamic Connectivity Atlas (Behrens et al., 2003), which provides probabilistic connectivity values based on diffusion tractography. Figure 2C shows the significant thalamic activation outlined in white overlaid with connectivity probabilities for each of seven cortical regions; the strongest cortical connectivity is to prefrontal cortex, followed by premotor and temporal cortices.
These initial results demonstrate a coupling between hearing ability and neural activity, suggesting more generally a link between sensory ability and cortical structural integrity—that is, given widespread age-related gray matter loss (Good et al., 2001), these functional changes may reflect an acceleration of underlying cortical atrophy. To explicitly test this possibility, we performed a voxel-based morphometric analysis of gray matter volume in 25 older adults. For all participants, we calculated PTA values for their better ear, and correlated these values with estimates of regional gray matter volume obtained from segmented structural MRI images. We focused our analysis on the same cytoarchitectonically defined probabilistic primary auditory regions of interest we used in interpreting the fMRI data. As control regions, we used left and right motor cortex (Brodmann area 4a) (Geyer et al., 1996). For all regions, we constructed a statistical model that included a second-order polynomial expansion of PTA, total gray matter, total intracranial volume (TIV), and a constant term. For display purposes, we plotted the gray matter residuals, having removed effects of total gray matter and total intracranial volume; statistics displayed are based on the full model, but just for the linear term of PTA (the quadratic component had a negligible contribution). As shown in Figure 3, this analysis showed a significant relationship between hearing ability and gray matter volume in right auditory cortex, and a nonsignificant trend in left auditory cortex. We found no relationship between hearing ability and gray matter volume in motor cortex.
Finally, we performed an additional exploratory whole-brain analysis using the same approach to identify any other regions showing a similar relationship located outside auditory cortex. For this analysis, we used a voxelwise threshold of p < 0.001 (uncorrected), and included total gray matter and TIV as covariates. There were no regions outside of right superior temporal gyrus that had a significant negative relationship with peripheral hearing ability. This effect remained after including age as a covariate in the design matrix.
Discussion
The hierarchy of cortical regions that support speech processing begins with primary auditory cortex before branching into multiple parallel processing pathways (Davis and Johnsrude, 2003; Hickok and Poeppel, 2007; Rauschecker and Scott, 2009). Here we demonstrate that peripheral hearing acuity predicted both the neural response to speech and gray matter volume in these primary auditory regions. These findings provide some of the first measures of neural activity to support behavioral evidence of perceptual-cognitive interactions in connected speech processing, and provide a potential explanation for how they may contribute to age-related changes in spoken language comprehension.
Although it is well accepted that age-related hearing loss affects auditory processing, there has been increasing agreement that these sensory declines increase listening effort in the course of successful speech comprehension. Behaviorally, this increase in effort can exert influence on a variety of cognitive operations (Rabbitt, 1991; Stewart and Wingfield, 2009; Miller and Wingfield, 2010; Piquado et al., 2010; Tun et al., 2010; Heinrich and Schneider, 2011; Zekveld et al., 2011). Our current observation of changes in neural activity in core auditory processing areas makes a mechanistic neurobiological account of these interactions tractable. Importantly, we observed changes in activation that were associated specifically with grammatically complex sentences relative to simpler sentences. Thus, modulations of neural activity related to sensory loss appear to impact not simply the ability to process the perceptual aspects of the speech signal, but the resources required to perform higher-level cognitive operations. This is consistent with the view that, during speech comprehension, cognitive and neural resources are dynamically assigned based on acoustic and linguistic task demands and the individual abilities of listeners (Peelle et al., 2010b; Tyler et al., 2010; Obleser et al., 2011).
It is worth noting that increased listening effort has often been associated with increased activity in prefrontal and premotor cortices (Davis and Johnsrude, 2003, 2007; Peelle et al., 2010a), areas with high probabilistic connections to regions of the thalamus in which we found hearing-related changes in neural activity. The thalamus is essential for transfer of ascending auditory information to prefrontal and premotor areas, playing a crucial function in the network of regions supporting speech comprehension. In addition, poorer-hearing participants also showed less syntax-related activity in the left putamen, which in concert with frontal regions also plays an important role in prediction and timing for auditory tasks (Grahn and Rowe, 2009; Kotz and Schwartze, 2010). Thus, although we observed direct changes in neural activity primarily in auditory regions, it is likely these changes reverberate throughout a much broader network.
Our finding that gray matter density in primary auditory areas is predicted by peripheral hearing ability suggests a link between sensory stimulation and cortical volume. In previous studies of single word recognition, decreases in gray matter volume in primary auditory cortex have been linked to both behavioral performance and neural activity elsewhere in the speech comprehension system (Eckert et al., 2008; Harris et al., 2009). Although we cannot make a definitive conclusion regarding the nature of this linkage, there is some support for interpreting the direction of causality in this relationship. Reorganization of sensory cortex is common when sensory input is altered (Rauschecker, 1995), including tonotopic reorganization of auditory cortex following peripheral hearing loss (Schwaber et al., 1993; Cheung et al., 2009). Thus, we think it plausible that changes in older adults' peripheral hearing ability had a causal role in reducing gray matter volume in auditory cortex.
To the extent that these functional and structural changes are associated with perceptual difficulty in processing speech sounds, these interactions fit well within a broader resource-allocation framework of speech comprehension. That is, the neural activity required to successfully navigate a cognitive task is a function of both task demands and individual cognitive ability. Perceptual demands can be increased externally (e.g., through acoustic properties of the stimuli) or internally (e.g., through biologically mediated sensory decline). From this vantage point, it may be less constructive to think about a single network of brain regions supporting any given task. Rather, the degree of activity—and in some instances, the set of regions active—form a family of networks that can be dynamically called upon as a function of both task demand and individual ability. Although in cases of more extreme individual differences (as when comparing young and older adults) these patterns may appear to be easily dichotomized, ultimately this may not prove to be the best way to characterize the underlying interindividual variability.
Taken together, our data provide a mechanism whereby changes in peripheral hearing impact cortical speech processing networks, and suggest that sensory acuity has cascading consequences for the neural processes supporting both perceptual and higher-level cognitive functions. Of particular note is the fact that these variations are seen within listeners with age-normal hearing ability; in cases of greater hearing loss, we would expect these findings to be exaggerated. Finally, our results also reinforce the modulatory influence of individual differences in sensory ability on brain activity, and thus the advantage of incorporating these sources of variability in models of speech and language processing.
Footnotes
This research was supported by the National Institutes of Health (NS54575, AG019714, AG17586, AG15116, AG32953, NS53488, and NS44266). V.T. is funded by a National Science Foundation Graduate Fellowship. We also gratefully acknowledge support from the W. M. Keck Foundation. We thank Michael Bonner for helpful comments on this manuscript, the radiographers at the Hospital of the University of Pennsylvania for their assistance with data collection, and our volunteers for their participation.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Jonathan Peelle, Department of Neurology, University of Pennsylvania, 3400 Spruce Street, Philadelphia, PA 19104, peelle{at}gmail.com