Abstract
Learning to read proceeds smoothly for most children, yet others struggle to translate verbal language into its written form. Poor readers often have a host of auditory, linguistic, and attention deficits, including abnormal neural representation of speech and inconsistent performance on psychoacoustic tasks. We hypothesize that this constellation of deficits associated with reading disorders arises from the human auditory system failing to respond to sound in a consistent manner, and that this inconsistency impinges upon the ability to relate phonology and orthography during reading. In support of this hypothesis, we show that poor readers have significantly more variable auditory brainstem responses to speech than do good readers, independent of resting neurophysiological noise levels. Thus, neural variability may be an underlying biological contributor to well established behavioral and neural deficits found in poor readers.
Introduction
As infants, we learn language from analyzing the properties of our acoustic environment (Saffran et al., 1996). This process depends on the nervous system's ability to access and respond to sound in a faithful and consistent manner. When input to the auditory system is not coherent from one instance to the next, this can interrupt the ability to link sound to meaning and impede language development (Friel-Patti and Finitzo, 1990). We hypothesize that stable neural representation of sound supports the sound-to-meaning connections crucial for the development of language and reading skills. Children with dyslexia, who often exhibit impairments in auditory-based perceptual skills and auditory neurophysiology, may suffer from variable interactions with sound. Behavioral manifestations of auditory impairments in dyslexia include impaired perception of speech in background noise (Bradlow et al., 2003; Ziegler et al., 2009), speech discrimination (Bogliotti et al., 2008), rhythm perception (Goswami et al., 2011; Huss et al., 2011), temporal processing (Van Ingelghem et al., 2001; Cohen-Mimran and Sapir, 2007), and difficulty directing and switching auditory attention (Hari and Renvall, 2001; Facoetti et al., 2010; Lallier et al., 2010). Neurophysiological abnormalities include delayed and harmonically impoverished responses from the auditory brainstem (Banai et al., 2009; Basu et al., 2010; Hornickel et al., 2011; Hornickel et al., 2012b), reduced subcortical and cortical representation of stimulus differences (Banai et al., 2005; Hornickel et al., 2009; Noordenbos et al., 2012), and atypical cortical activity in the auditory and reading networks (Kraus et al., 1996; Pugh et al., 2001; Wible et al., 2002; Abrams et al., 2006; Cao et al., 2006; Bonte et al., 2007; Gou et al., 2011), which can be predictive of later reading ability when assessed in young children (for review, see Gabrieli, 2009). This constellation of behavioral and neurophysiological deficits combine, resulting in impaired perception of speech that can adversely affect reading acquisition. We hypothesize that an inconsistency of the auditory nervous system in responding to sound may contribute to poor auditory processing and reading impairments in children. To investigate this, we assessed the consistency of auditory brainstem responses across a recording session by measuring brainstem responses to speech syllables from normal hearing children ages 6–13 years with a wide range of reading abilities. We predicted, and found, that children who are poor readers have more variable auditory responses than good readers.
Materials and Methods
Participants
One hundred children (age range, 6–13 years; mean age, 9.7 years; 42 girls) participated. All had normal hearing, passing a hearing screening at a 20 dB hearing level (air conduction) for octaves from 250 to 8000 Hz; normal click-evoked brainstem responses when compared with laboratory norms; no neurological disorders; and a performance IQ >75 (Woerner and Overstreet, 1999). Based on the approximately trimodal distribution of performance on a measure of oral word reading fluency (Test of Oral Word Reading Efficiency, Sight subtest; Torgesen et al., 1999), children were divided into the following three groups: good readers (>120; n = 34); average readers (95–105; n = 34); and poor readers (<90; n = 32). Scores for the average readers straddled the mean, whereas the good and poor readers performed ∼1 SD above and below the mean, respectively. Participants' diagnoses (if applicable), family socioeconomic status (as assessed by mother's education), and attention skills were collected from a parental report via a laboratory internal demographic survey and the ADHD Rating Scale IV (DuPaul et al., 1998). Twenty-eight of the poor readers had an external diagnosis of a reading impairment, and 13 of those children additionally had diagnoses of attention impairments; however, parent ratings of attention abilities were not predictive of the physiologic measures (see greater detail in Results). All procedures were approved by the Internal Review Board at Northwestern University, and parents and children gave their informed consent and assent, respectively.
Stimuli
Stimuli were 170 ms [ba] and [ga] syllables created using a Klatt-based synthesizer (Klatt, 1980). Both stimuli had a 50 ms formant transition in which the first, second, and third formants were dynamic, and stable fundamental frequency, fourth formant, and fifth formant for their duration. See Hornickel et al. (2009) for additional stimulus details. Stimuli were presented at 80 dB SPL in alternating polarities monaurally to the right ear through insert earphones (ER-3, Etymotic Research).
Neurophysiological data recording and reduction
Auditory brainstem responses were collected using a vertical montage (active Cz, forehead ground, ipsilateral earlobe reference) using Ag-AgCl electrodes with impedances <5 kΩ. A total of 6000 artifact-free responses (3000 for each polarity) were obtained for each sound. During the recording, children sat quietly watching a movie and heard the soundtrack in their unoccluded left ear presented at ∼40 dB SPL. Movie watching encouraged compliance during the passive recordings.
Data were pooled across two studies, which used slightly different presentation and collection equipment and methods. For 64 of the participants (18 good readers, 18 average readers, and 28 poor readers), the two stimuli were presented intermixed with six other speech sounds by the Compumedics NeuroScan Stim2 presentation software at a rate of 4.35 stimuli/s (Hz). Responses were collected in Compumedics NeuroScan Acquire, digitized at 20,000 Hz, and off-line bandpass filtered from 70 to 2000 Hz (12 dB/octave roll off). For 36 participants (16 good readers, 16 average readers, and 4 poor readers), stimuli were presented and responses were recorded with SmartEP using the cABR module (Intelligent Hearing Systems). The two stimuli were presented in alternating blocks at a rate of 4.35 Hz, and responses were digitized at 13,333 Hz and on-line bandpass filtered from 50 to 3000 Hz (6 dB/octave roll off).
In both recording systems, responses were epoched into 230 ms windows (40 ms of prestimulus activity), and responses greater than ±35 μV were rejected as artifact.
Data analyses
Response consistency.
Consistency of the auditory brainstem response over time was computed by comparing response subaverages from the first and last halves of the recording (3000 events each). Comparisons were made by correlating the two subaverage waveforms, with r values closer to 1 representing more morphologically coherent subaverages. For the data collected with the Compumedics NeuroScan Acquire system (n = 64), a second analysis technique was used in which an average of the 3000 even-numbered events was compared with an average of the 3000 odd events Correlations between the two analysis techniques were high (formant transition: r = 0.800; vowel: r = 0.882), and we suggest that they are reliably reflecting the same underlying mechanism of variability present throughout the recording. The even–odd analysis was not possible for the data collected on the Intelligent Hearing Systems SmartEP system because of how the data are recorded and exported.
Response consistency calculations were made separately for the portions of the response reflecting the formant transition (7–60 ms) and the steady-state vowel (60–180 ms). For statistical analyses, response consistency was collapsed across the two stimuli to form one metric. All data were Fisher transformed before statistical analyses; in Figures 1 and 2, and Table 1, response consistency values are reported as r values. See Hornickel et al. (2012a) for additional details.
To confirm that we could collapse data from the two recording systems, we compared children from the two studies pooled across the good and average reader groups and found no significant differences between the recording systems in response consistency values for the formant transition (Compumedics NeuroScan system: r = 0.657; Intelligent Hearing Systems system: r = 0.667; t(66) = −0.299, p > 0.766) or vowel portions of the responses (Compumedics NeuroScan system: r = 0.678; Intelligent Hearing Systems system: r = 0.693; t(66) = −0.467, p > 0.642). The poor reader group was excluded from this analysis due to a disproportionate number of children in that group participating in the study using the Compumedics NeuroScan system (28 of 32 children).
Additionally, response consistency was calculated as the correlation between the click-evoked brainstem responses collected at the start and at the end of the recording session (3000 sweeps each). Due to data recording errors, click response files were missing for four children (three good readers and one poor reader).
Baseline neurophysiological activity and response amplitude.
Baseline neurophysiological activity was measured as the root mean square (RMS) amplitude of the response 40 ms time window preceding each sound presentation (−40–0 ms) and for the entire response window (0–180 ms). As with response consistency, data from responses to the two stimuli were averaged together. Because prestimulus amplitude was collected in the absence of the evoking sound, it was thought to reflect resting neurophysiological activity or neurophysiological noise.
Statistical analyses
As IQ was marginally correlated with response consistency across the group, a multivariate analysis of covariance (ANCOVA) was used to determine whether group differences existed for response variability (both time ranges), prestimulus RMS, and response RMS. Follow-up independent t tests were conducted when appropriate, and Mann–Whitney U tests were used for the even/odd comparison due to unequal group sizes. Paired t tests were used to compare response consistency in the formant and vowel regions within individuals. Statistical analyses were performed in SPSS (IBM), and outliers greater than ±2 SDs were corrected to exactly ±2 SDs.
Results
As predicted, there was a main effect of reading group for consistency of the brainstem response to speech in the formant transition (Multivariate ANCOVA, F(2,92) = 6.14, p = 0.003) with a trending effect for the vowel response (F(2,92) = 2.86, p = 0.062), but not for neurophysiological noise (prestimulus amplitude), the overall amplitude of the response, or the consistency of the brainstem response to clicks (F(2,92) = 0.86, p > 0.425; F(2,92) = 2.18, p > 0.115; F(2,92) = 1.74, p > 0.180, respectively). Post hoc tests indicated that poor readers had more variable brainstem responses to speech than did good readers (formant transition: p = 0.002; vowel: p = 0.038) and that they were marginally worse than the average readers in the formant transition portion (formant transition: p = 0.055; Table 1; Figs. 1, 2). Within the poor reader group, variability in the formant transition region was greater than in the vowel portion of the response (t(31) = −2.18, p = 0.037), an effect that was marginal for the average readers (t(33) = −1.88, p = 0.069) and absent for the good readers (t(33) = −1.08, p > 0.250).
The mechanisms responsible for increased response variability include fatigue over the course of the experiment and/or intertrial variability (Fig. 2). For a subset of the subjects (n = 64), response variability was also computed for averages reflecting every other sweep. This approach revealed the same deficits in response consistency for poor (n = 29) relative to good readers (n = 18; formant transition: Mann–Whitney U test, −2.88, p = 0.004; vowel: Mann–Whitney U test, −2.11, p = 0.035) that were seen for the first–last analysis, including marginal deficits for poor readers relative to average readers in the formant transition (n = 18; Mann–Whitney U test, −1.67, p = 0.095). A fatigue-only model would predict no differences between good and poor readers in the consistency of the response over alternating trials. The decreased response consistency among alternating trials observed here suggests that poor readers exhibit trial-by-trial variability of their responses, which impedes how the brain responds to the dynamic elements of speech.
Relationships between reading ability and response consistency do not appear to be mediated by participant factors such as age, socioeconomic status, and attention, as these factors are not correlated with response consistency within the dataset and thus were not included as covariates (formant transition: r = −0.011, p > 0.91; r = −0.114, p > 0.281; r = −0.145, p > 0.16, respectively; vowel: r = −0.010, p > 0.92; r = −0.037, p > 0.728; r = −0.151, p > 0.14, respectively). Additionally, boys and girls within each reading group did not differ on response consistency measures in the formant transition (good: Mann–Whitney U test, −1.33, p > 0.193; average: Mann–Whitney U test, −0.28, p > 0.796; poor: Mann–Whitney U test, −0.56, p > 0.584) or vowel (good: Mann–Whitney U test, −0.02, p > 0.986; average: Mann–Whitney U test, −0.42, p > 0.691; poor: Mann–Whitney U test, −1.69, p > 0.096) portions of the response. While these and other factors are known to influence reading ability and auditory brainstem function, the relationships seen here specifically reflect greater neural variability in response to speech in poor readers and are not directly mediated by participant characteristics.
Discussion
Overall, our results suggest that neural inconsistency in response to sound is a biological mechanism that may contribute to reading impairment. Deficits in auditory task performance may be traced to variability in responses and reaction times that have been documented for children who are poor readers (Roach et al., 2004; Cohen-Mimran and Sapir, 2007; Li et al., 2009; Moore et al., 2010; Van De Voorde et al., 2010). Here we demonstrate that children who are poor readers have more variable neural responses to speech, reflecting an inconsistency in the brain's response to sound from trial-to-trial. While we cannot determine whether auditory brainstem variability was present since birth in our participants, it is likely that initial variability in speech sound representations led to weaker phonological development in the preschool years, and weaker phonological development failed to tune the auditory system to selectively represent meaningful speech sound differences as being important. Children with reading difficulties appear to be more sensitive to perceiving and encoding nonmeaningful speech contrasts than their typically developing peers (Bogliotti et al., 2008; Noordenbos et al., 2012), and our current and prior results suggest that they may have additional difficulty in forming robust internal representations of meaningful sounds (Hornickel et al., 2009, 2011), leading to deficits in perception and ultimately in reading. Weaker stability of auditory brainstem representation would also likely lead to less efficient learning of sound statistics and neural adaption to sound repetition, an effect previously linked to speech-in-noise perception, language ability, reading, and musical aptitude in children (Bonte et al., 2007; Chandrasekaran et al., 2009; Evans et al., 2009; Strait et al., 2010). Therefore, we believe that the neural variability seen in the poor readers in our study reflects a combination of “bottom-up” and “top-down” influences, in which neurophysiological impairments and lack of selective attention to meaningful speech sounds interact in a negative feedback loop, likely on a variety of developmental timescales.
Because auditory brainstem responses are generated by the simultaneous and synchronous firing of neurons throughout auditory system nuclei (for review, see Chandrasekaran and Kraus, 2010), a failure in the synchronicity of activity, greater receptor adaptation or fatigue, and/or slower recovery from firing (i.e., longer refractory periods) would result in broader and delayed response peaks and greater interspike variability (Don et al., 1977; Schaette et al., 2005). Additionally, inconsistencies in the response may be due to increased neural “noise”; but increased response variability in poor relative to good readers reported here is not due to greater resting neural activity. Previous studies suggest children with dyslexia have higher internal noise (Sperling et al., 2005), leading to greater difficulty in excluding external noise (Bradlow et al., 2003; Anderson et al., 2010). The present results reveal that the greater internal noise seen for children with dyslexia is independent of general neurophysiological noise and manifests as increased jitter among responses to sound. Additionally, weaker response consistency in poor readers is seen only in response to complex speech-like sounds and not acoustically simple click stimuli, with the greatest deficits seen in response to the formant transition portion of the syllable. Greater neural variability may therefore contribute to deficits in auditory brainstem function that have been reported previously by negatively impacting the fidelity with which neurons represent the most acoustically complex and linguistically meaningful parts of speech sounds.
Here we provide evidence that poor readers have less stable auditory nervous system function than do good readers, a relationship that is not mediated by participant factors such as age, sex, attention, IQ, and socioeconomic status. Although causality cannot be determined, heightened variability in nervous system function may underlie fluctuations in directed attention and impaired speech understanding due to inconsistent encoding. Our results suggest that good readers profit from a stable neural representation of sound, and that children who have inconsistent neural responses are likely at a disadvantage when learning to read. Encouragingly, response consistency can be improved with auditory training, particularly for children with the most variable responses before training (Hornickel et al., 2012a).
Footnotes
This work was supported by National Institutes of Health Grant R01DC01510, the Hugh Knowles Center of Northwestern University, and an Interdisciplinary Research Fellowship from the Northwestern University Cognitive Science program. We thank Erika Skoe, Trent Nicol, and Samira Anderson for their review of the manuscript; the members of the Auditory Neuroscience Laboratory for their assistance with data collection; and the children and their families for participating.
The authors declare no conflicts of interest, financial or otherwise.
- Correspondence should be addressed to Nina Kraus, 2240 Campus Drive, Evanston, IL 60208. nkraus{at}northwestern.edu