Abstract
Although acquired amusia is a relatively common disorder after stroke, its precise neuroanatomical basis is still unknown. To evaluate which brain regions form the neural substrate for acquired amusia and its recovery, we performed a voxel-based lesion-symptom mapping (VLSM) and morphometry (VBM) study with 77 human stroke subjects. Structural MRIs were acquired at acute and 6 month poststroke stages. Amusia and aphasia were behaviorally assessed at acute and 3 month poststroke stages using the Scale and Rhythm subtests of the Montreal Battery of Evaluation of Amusia (MBEA) and language tests. VLSM analyses indicated that amusia was associated with a lesion area comprising the superior temporal gyrus, Heschl's gyrus, insula, and striatum in the right hemisphere, clearly different from the lesion pattern associated with aphasia. Parametric analyses of MBEA Pitch and Rhythm scores showed extensive lesion overlap in the right striatum, as well as in the right Heschl's gyrus and superior temporal gyrus. Lesions associated with Rhythm scores extended more superiorly and posterolaterally. VBM analysis of volume changes from the acute to the 6 month stage showed a clear decrease in gray matter volume in the right superior and middle temporal gyri in nonrecovered amusic patients compared with nonamusic patients. This increased atrophy was more evident in anterior temporal areas in rhythm amusia and in posterior temporal and temporoparietal areas in pitch amusia. Overall, the results implicate right temporal and subcortical regions as the crucial neural substrate for acquired amusia and highlight the importance of different temporal lobe regions for the recovery of amusia after stroke.
SIGNIFICANCE STATEMENT Lesion studies are essential in uncovering the brain regions causally linked to a given behavior or skill. For music perception ability, previous lesion studies of amusia have been methodologically limited in both spatial accuracy and time domain as well as by small sample sizes, providing coarse and equivocal information about which brain areas underlie amusia. By using longitudinal MRI and behavioral data from a large sample of stroke patients coupled with modern voxel-based analyses methods, we were able provide the first systematic evidence for the causal role of right temporal and striatal areas in music perception. Clinically, these results have important implications for the diagnosis and prognosis of amusia after stroke and for rehabilitation planning.
Introduction
Functional and structural neuroimaging studies in healthy subjects have provided evidence for the large-scale involvement of bilateral temporal, frontal, parietal, and subcortical networks in the neural processing of music (Samson et al., 2010; Zatorre and Salimpoor, 2013; Koelsch, 2014). This evidence is, however, largely correlational, and lesion-based studies are needed to pinpoint which brain regions are crucial and directly related to music processing. Amusia is a neural disorder characterized by severe impairment of music perception and/or production caused by abnormal brain development (congenital amusia) or brain damage (acquired amusia). Deficit in perceiving fine-grained pitch changes is the hallmark symptom of amusia, but also other domains of music, such as rhythm, can be affected (Stewart et al., 2006).
Although acquired amusia is relatively common after stroke (ranging from 35 to 69%; Ayotte et al., 2000; Schuppert et al., 2000; Särkämö et al., 2009), the exploration of its neuroanatomical basis has been limited to symptom-led and lesion-led studies of individual cases or small (N ≤ 20) patient groups. Acquired amusia has been associated with damage to different temporal, frontal, parietal, and subcortical regions (Stewart et al., 2006), but results regarding lesion lateralization (left/right) and type of musical deficit have been mixed, with some studies reporting spectral (e.g., pitch) or temporal (e.g., rhythm) deficits mainly after right hemisphere damage (Kester et al., 1991; Rosslau et al., 2015) or after both left and right hemisphere damage (Liégeois-Chauvel et al., 1998; Ayotte et al., 2000; Schuppert et al., 2000). Overall, constrained by small sample sizes and low spatial accuracy, these studies provide only coarse information about which specific brain areas are crucial for perceiving the different elements of music.
Voxel-based lesion-symptom mapping (VLSM) is a method for analyzing the relationship between focal brain damage and behavior by using the same voxel-based procedures used in analyzing functional neuroimaging data (Bates et al., 2003). Compared with traditional lesion analysis methods, VLSM is more sophisticated and avoids some of the limitations of classic lesion-led or symptom-led approaches (see Materials and Methods). Previously, VLSM has been used with stroke patients to determine lesion–symptom associations related to aphasia (Geva et al., 2012; Henseler et al., 2014; Fridriksson et al., 2015) as well as attention (Grandjean et al., 2008) and motor deficits (Schoch et al., 2006). In the music domain, VLSM has never been used before, whereas voxel-based morphometry (VBM), a method for analyzing gray matter (GM) and white matter (WM) differences between groups or across time (Ashburner and Friston, 2000), has been used in studies of congenital amusia (Hyde et al., 2006, 2007; Mandell et al., 2007; Albouy et al., 2013), musical training (Vaquero et al., 2016), and music-based interventions in stroke patients (Särkämö et al., 2014).
For congenital amusia, VBM and cortical-thickness studies have implicated structural abnormalities in superior temporal and inferior frontal areas as a key mechanism underlying amusia (Hyde et al., 2006, 2007; Mandell et al., 2007; Albouy et al., 2013). However, the results are mixed regarding the lateralization and type (increased/decreased cortical thickness or volume) of these abnormalities, perhaps reflecting the multifaceted nature of congenital amusia (Omigie et al., 2012). Acquired amusia and congenital amusia may stem at least in part from different neural bases, since the latter reflects not just impaired music perception, but a developmental deficit in acquiring musical syntax or a lack of exposure to music (Stewart, 2008). To uncover the brain regions crucial for music perception, systematic and longitudinal research on the neural basis of acquired amusia and its recovery is still needed. Clinically, this information is also important for establishing a more accurate diagnosis and prognosis of amusia and for rehabilitation planning.
Using a large (N = 77) sample of stroke patients, the aim of this exploratory combined VLSM-VBM study was to determine the specific lesion patterns associated with acquired amusia and with changes in GM volume (GMV) and WM volume (WMV) related to amusia recovery in a 6 month follow-up.
Materials and Methods
Subjects and study design.
Subjects (N = 77) were stroke patients enrolled in two music intervention studies in Helsinki and Turku. Fifty patients were recruited during 2004–2006 from the Department of Neurology of the Helsinki University Central Hospital (HUCH) and 27 patients during 2013–2014 from the Department of Neurology of the Turku University Hospital. All of the patients had an MRI-verified acute ischemic stroke or intracerebral hemorrhage in the left or right hemisphere, primarily in middle cerebral artery (MCA) territory, had cognitive or motor deficits, and were right-handed. All patients had normal hearing. Patients with prior neurological or psychiatric disease or substance abuse were not included. Demographic and clinical characteristics of the patients are presented in Table 1. The individual studies were approved by the Ethics Committees of the Hospital District of Helsinki and Uusimaa and of the Hospital District of Southwest Finland. Studies were performed in conformance with the Declaration of Helsinki. All patients signed an informed consent and received standard stroke treatment and rehabilitation. All participants underwent a behavioral assessment and an MRI within 3 weeks of the stroke. Behavioral assessment was repeated during the follow-up at 3 months and MRI was repeated at 6 months poststroke.
Demographic and clinical characteristics of the patients
Behavioral assessment.
Music perception was evaluated by using a shortened version (Särkämö et al., 2009) of the Montreal Battery of Evaluation of Amusia (MBEA; Peretz et al., 2003) at the acute stage (<3 weeks after stroke) and at the 3 month poststroke stage as a part of a larger neuropsychological testing battery. We used the MBEA Scale and Rhythm subtests as indices of musical pitch and rhythm perception, respectively, and their average score as an overall index of music perception (referred to hereafter as MBEA total score; Särkämö et al., 2009). Following our previous study (Särkämö et al., 2009) and the established cutoff values of the original MBEA (Peretz et al., 2003), patients with the MBEA total score of <75% were classified as amusic [amusic, N = 49; nonamusic (NA), N = 28]. The amusic and NA groups were relatively well matched demographically and clinically (Table 1), but the amusic group had less education, larger overall lesion volume, and higher incidence of neglect. Importantly, there was no significant difference on the prestroke musical background of the patients. The Rhythm and Scale subtest scores correlated strongly (r = 0.71) at the acute stage, and therefore only parametric analyses were carried out.
For the longitudinal VBM analyses (using only the data from the largest cohort of 47 patients from Helsinki), we further subdivided the amusic patients to those who showed and who did not show recovery on the MBEA from the acute to the 3 month stage. Amusic patients with >20% increase in MBEA score were classified as recovered amusics (RAs; N = 13; mean, 42%; range, 21–85%), amusic patients with a ≤20% increase in MBEA score were classified as nonrecovered amusics (NRAs; N = 16; mean, 5%; range, −21 to 19%), and patients above the acute stage cutoff were classified as NAs (N = 18). Overall, the three groups differed with respect to education, lesion volume, and neglect, but there were no differences between the RA and NRA patients. The prestroke musical background of the groups was also comparable. Across patients, changes in the Rhythm and Scale scores from acute to 3 months did not correlate significantly (r = 0.25, p = 0.085).
To control the specificity of the lesion patterns to amusia, aphasia was also assessed using the Aphasia Severity Rating Scale (ASRS) from the Boston Diagnostic Aphasia Examination (BDAE; Goodglass and Kaplan, 1983). In addition, the performance of the patients in three language tests was used to derive the clinical ASRS estimate: Verbal Fluency Test [listing words from a semantic category (animals); Lezak et al., 2012], shortened Token Test (De Renzi and Faglioni, 1978), and shortened Boston Naming test (Laine et al., 1993).
MRI data acquisition and preprocessing.
Patients from Helsinki were scanned with a 1.5T Siemens Vision scanner of the HUCH Department of Radiology to obtain high-resolution T1 images (flip angle, 15°; TR = 1900 ms; TE = 3.68 ms; voxel size, 1.0 × 1.0 × 1.0 mm). Patients from Turku were scanned using a 3T Siemens Verio scanner of the Medical Imaging Centre of Southwest Finland. The structural MRI was a T1-weighted MPRAGE image (flip angle, 9°; TR = 2300 ms; TE = 2.98 ms; voxel size, 1.0 × 1.0 × 1.0 mm).
Normalization of MRI images to a standard template is a necessity for precise comparisons between different subjects or groups, which is especially important when dealing with abnormal brain tissue, such as in stroke patients. To achieve optimal normalization with no postregistration lesion shrinkage or out-of-brain distortion, cost function masking (CFM) was applied (Brett et al., 2001). To define the cost function masks, A.J.S. and T.S. created binary masks of the lesioned areas by manually depicting, on a slice-by-slice basis, the precise boundaries of the lesion directly into the T1 image (Ripollés et al., 2012). An overlap image of all patients' lesions and a statistical power map are shown in Figure 1. The MRIcron software package (http://people.cas.sc.edu/rorden/mricron/index.html, RRID:SCR_002403) was used to do the aforementioned lesion tracing (Rorden and Brett, 2000).
Lesion-overlap maps for all subjects. A, Lesion distribution for the whole sample. Warmer areas indicate areas of greater lesion overlap. Accordingly, the color scale ranges from 2 to 26 overlapping subjects. B, Statistical power map for VLSM analyses with 77 patients. The map is thresholded at Z > 4 to view lesion areas with sufficient statistical power to detect an effect of Z ≥ 4.
T1 images and binary lesion masks were then processed using the Statistical Parametric Mapping software (SPM8, Wellcome Department of Cognitive Neurology, University College London, RRID:SCR_007037) under Matlab 8.0.0 (MathWorks; version R2012b, RRID:SCR_001622). Unified segmentation (Ashburner and Friston, 2005) with medium regularization and CFM was applied to T1 images, segmenting them precisely into GM, WM, and CSF probability maps before normalizing them into the Montreal Neurological Institute (MNI) space. This technique has been widely used with stroke patients (Crinion et al., 2007; Andersen et al., 2010; Ripollés et al., 2012). To preserve the original signal strength during the normalization, GM and WM segmented images were modulated. Smoothing of the tissue probability maps was performed by using an isotropic spatial filter (FWHM, 6 mm) to reduce residual interindividual variability. In addition, the lesion masks defined in native space were also registered to MNI space using the normalization parameters obtained during the segmentation process.
Voxel-based lesion symptom mapping.
VLSM enables analysis of the relationship between local brain tissue damage and behavioral data in a voxel-by-voxel basis (Bates et al., 2003). By providing a detailed functional map of lesioned brain regions affecting in a given task, VLSM is more sophisticated than traditional lesion analysis, where either a lesion-led or a symptom-led approach has been used. In lesion-led studies, patients are grouped on the basis of lesions to a particular brain region and then tested for behavioral differences on a given domain compared with a control group (Chao and Knight, 1998). Although this provides information about the functional roles of the lesioned brain areas, the spatial resolution of this method is coarse and often overlooks lesion areas outside the specific region of interest (areas outside the specific injury site are not considered affecting the behavioral performance and valuable information may be lost). In symptom-led studies, the brain region contributing to the cognitive deficit is being deduced from lesion-overlap images of patients sharing the same behavioral deficit (Dronkers, 1996). To classify patients, this method requires a specific cutoff value on a behavioral task, and parametric analysis is not possible. Thus information of varying performance across a broad spectrum outside cutoff values is lost. As distinct from the previous, VLSM does not require patient grouping by lesion site or behavioral data cut-off, although allows binary variables as well. In three-dimensional lesion maps, formed from MRI images, each voxel is analyzed and the presence, or absence, of a lesion is correlated with the behavioral data. Finally, statistical results and maps of areas associated with the behavioral deficits are shown.
VLSM was performed with nonparametric mapping software (Chris Rorden's NPM, version 6, June 2013) using the normalized acute phase lesion maps for all patients. Parametric analyses were performed using the MBEA total (Scale and Rhythm average) score, Rhythm score, Scale score, and the BDAE-ASRS score. Also, the following binary analyses were performed: amusic versus NA; amusic (no aphasia) versus aphasic (no amusia); amusic versus NA (all aphasics excluded); aphasic versus nonaphasic (all amusics excluded); and amusic versus amusic and aphasic. All voxels damaged ≥10% of the patients were included in the statistical analysis (Dovern et al., 2011; Mirman et al., 2015; Timpert et al., 2015). Correction for multiple comparisons was achieved by using a false discovery rate (FDR)-corrected p < 0.05 threshold.
Voxel-based morphometry.
Morphometric analysis was performed using SPM8. Individual smoothed GM and WM images were entered into a second-level analysis using a Group (RA/NRA/NA) × Time (Acute/6 months) mixed between-subject and within-subject ANOVA. Six different Group (RA > NRA, RA > NA, NRA > NA) × Time (Acute >6 months, 6 months > Acute) contrasts were calculated. Unless otherwise noted, results were thresholded at a whole-brain uncorrected p < 0.001 threshold at the voxel level and a FWE-corrected p < 0.05 at the cluster level with a cluster extent of >50 contiguous voxels. Anatomical and cytoarchitectonical areas were identified using the Automated Anatomical Labeling Atlas (Tzourio-Mazoyer et al., 2002) included in the xjView toolbox (http://www.alivelearn.net/xjview8/, RRID:SCR_008642). As VBM uses voxel intensities and different acquisition settings in different MRI scanners can have a great effect on the results, we used only patients from the Helsinki study with acute and 6 month images (N = 47). As stated earlier, patients were from a larger music intervention study (Särkämö et al., 2008), but importantly the intervention and control groups of this previous study did not differ in the recovery from amusia as indexed by change in the MBEA average score (Särkämö et al., 2009). Additionally, the number of NA and amusic patients in the music intervention group was approximately the same (Särkämö et al., 2009). Therefore, the results were not biased in any way by the intervention received.
Results
Lesion patterns associated with amusia and aphasia
To identify lesion areas associated with amusia and aphasia, we first performed parametric VLSM analyses for the MBEA total scores and BDAE-ASRS scores across all patients. Lower MBEA total scores were associated with lesions in the right superior temporal gyrus (STG), middle temporal gyrus (MTG), Heschl's gyrus (HG), putamen, and insula (Fig. 2A). In contrast, lower BDAE-ASRS scores were associated with lesions in the left STG and insula (Fig. 2B). We then performed binary VLSM analyses directly comparing amusic versus NA patients and aphasic versus nonaphasic patients. As in the parametric analyses, amusia was associated with lesions in the right STG, HG, putamen, and insula (Fig. 2C), and aphasia was associated with lesions in the left STG and insula (Fig. 2D).
VLSM results for amusia and aphasia. A, MBEA Scale and Rhythm average score (%). B, BDAE-ASRS score. C, Amusic versus NA patients. D, Aphasic versus nonaphasic patients. Amusia is based on MBEA Scale and Rhythm average score (cutoff, 75%) and aphasia is based on BDAE-ASRS (cutoff, <5). N = 77. Neurological convention is used with MNI coordinates at the bottom left of each slice. All statistical maps are thresholded at a FDR-corrected p < 0.05 threshold. INS, Insula; PUT, putamen.
To control for the comorbidity of amusia and aphasia (41% of amusics had concurrent aphasia, 65% of aphasics had concurrent amusia), we performed the same binary analyses first excluding aphasic patients and then comparing pure amusic versus NA patients. Correspondingly, we excluded amusic patients and compared pure aphasic versus nonaphasic patients. This yielded essentially the same results as the first binary analysis, i.e., lesions in the right STG, MTG, HG, putamen, and insula were linked to amusia (Fig. 3A), and lesions in the left HG and insula were linked to aphasia (Fig. 3B). Similarly, when amusic patients were compared with patients with both amusia and aphasia, lesions in the right STG, MTG, putamen, and insula were again significant (Fig. 3C). We additionally compared purely amusic (no aphasia) to purely aphasic (no amusia) patients, and lesions associated with amusia comprised yet again the right STG, putamen, and insula, and in aphasia the lesion pattern was localized in the left HG and insula (Fig. 3D).
Binary lesion analyses with pure amusics and pure aphasics. A–D, Comparisons between A, amusic versus NA patients (all aphasic patients excluded; N = 47); B, aphasic versus nonaphasic patients (all amusic patients excluded; N = 28); C, amusic versus aphasic and amusic patients (N = 49); and D, amusic (no aphasia) versus aphasic (no amusia; N = 40). Amusia is based on MBEA Scale and Rhythm average score (cutoff, 75%) and aphasia is based on BDAE-ASRS (cutoff, <5). Neurological convention is used with MNI coordinates at the bottom left of each slice. All statistical maps are thresholded at a FDR-corrected p < 0.05 threshold. INS, Insula; PUT, putamen.
To determine whether the lesion patterns were similar for pitch and rhythm perception, we performed separate VLSM analyses for the MBEA Scale and Rhythm scores. Parametric analyses (Fig. 4A,B) indicated a largely overlapping pattern of lesions in the right STG, MTG, HG, insula, and basal ganglia (BG; putamen, caudate, pallidum) for low scores on both Scale and Rhythm subtests (Fig. 4C). Differences in amount of voxels damaged in separate anatomical areas are presented in Table 2. The same anatomical areas were seen in both Scale and Rhythm, but the lesions associated with low Rhythm scores covered a larger proportion of the areas. When the lesion pattern associated with poor performance in the Rhythm subtest was further analyzed by limiting the results only to the most significant areas (t value >4.5), the right putamen and caudate remained as the most significant components.
VLSM results for scale and rhythm amusia. A, MBEA Scale score. B, MBEA Rhythm score. C, An overlap image comparing Rhythm (red) and Scale (blue) subtests' parametric results (N = 77). Neurological convention is used with MNI coordinates at the bottom left of each slice. All statistical maps are thresholded at a FDR-corrected p < 0.05 threshold. GP, Globus pallidum; PUT, putamen.
Anatomical correlates of VLSM results for MBEA Scale and Rhythm scores
GM and WM changes associated with amusia recovery
The NRAs had greater GMV decrease in the right STG and MTG than the NAs, as shown by a Time (Acute >6 months) × Group (NRA > NA) interaction (Fig. 5A). No significant changes in WM were found in the aforementioned interaction. Other contrasts (NRA > RA; RA > NA) did not yield any significant results.
VBM results of 47 Helsinki patients. Time (Acute >6 months) × Group (NRAs > NAs) in GMV or WMV interaction. A, GM NRAs (N = 16) versus NAs (N = 18). B, GM (hot) and WM (cold) nonrecovered rhythm amusics (N = 11) versus nonrhythm amusics (N = 18). C, GM nonrecovered pitch amusics (N = 13) versus nonpitch amusics (N = 17). Bar plots for GMV differences in Acute >6 months. Bar, mean; error bar, SEM. The results are reported at an uncorrected p < 0.001 threshold at the voxel level (extent threshold: k > 50 voxels) using MNI coordinates. Clusters for analysis A and B pass a p < 0.05 FWE-corrected threshold at the cluster level.
When the two MBEA subtests (Rhythm and Scale) were analyzed separately, we found that in the Rhythm subtest the NRAs showed more GMV decrease in anterior temporal areas than the NAs (Fig. 5B). In this comparison, there was also greater WMV decrease in inferior temporal areas in the NRAs than the NAs (Fig. 5B). In the Scale subtest, the NRAs showed greater GMV decrease in the posterior temporal/temporoparietal area than the NAs [interaction: Time (Acute >6 months) × Group (NRA, Scale > NA, Scale; p < 0.01 uncorrected; Fig. 5C]. All significant GMV and WMV decreases are shown in Table 3.
GMV and WMV decreases (6 month–acute)
Discussion
The main finding of the present study was the distinct pattern of lesions in right temporal, insular, and striatal areas underlying acquired amusia after stroke, with additional temporal GMV and WMV decrease observed in patients with persistent amusia. To our best knowledge, this is the first large-scale VLSM-VBM study to systematically explore the neural basis of acquired amusia and its recovery. Importantly, both lesion studies and neuroimaging studies of healthy subjects are needed to better understand the neural systems underlying a given cognitive task (Price and Friston, 2002; Rorden and Karnath, 2004), and our results provide more direct evidence for the role of the aforementioned regions in music perception. Moreover, compared with results from congenital amusics, who are born with a dysfunctional music perception system or are not able to acquire musical syntax during early development, leading to a life-long musical deficit, results from acquired amusia can reveal information about the neuroanatomy of a normally developed and previously functioning music perception system.
Lesions associated with amusia and lower MBEA total scores were all located in the right MCA territory whereas the lesions associated with aphasia and lower BDAE-ASRS scores were located in the left MCA territory. This is in line with previous group and case studies that have linked the right MCA territory (Ayotte et al., 2000; Rosslau et al., 2015) and especially the right temporal lobe lesions (Mazzoni et al., 1993; Kohlmetz et al., 2003; Terao et al., 2006; Särkämö et al., 2010; Hochman and Abrams, 2014) to acquired amusia. Previous VLSM studies on aphasia have also indicated lesions in the left STG and insula (Geva et al., 2012; Henseler et al., 2014; Fridriksson et al., 2015). In the present sample, 41% of the amusic patients had at least minor aphasia, which is similar to the percentages (43–55%) reported in previous amusia case and group studies (Schuppert et al., 2000; Stewart et al., 2006) and suggests that musical and language deficits can often occur together. Although the present sample size prevented us from performing a full 2 × 2 analysis [amusia (no/yes) × aphasia (no/yes)] or separate analyses within left hemisphere damage (LHD) and right hemisphere damage (RHD) subgroups, we were able to control for the comorbidity of these disorders by performing binary analysis on amusics (no aphasia) versus aphasics (no amusia), amusics versus NAs with all aphasics excluded, aphasics versus nonaphasics with all amusics excluded, and amusics versus amusics and aphasics. Importantly, compared with the primary analyses (Fig. 2), these secondary analyses (Fig. 3) yielded essentially the same key areas, thereby providing further support for the dissociation of the neuroanatomical correlates of amusia and aphasia. This finding is important given the longstanding discrepancy between the results of lesion studies and functional neuroimaging studies of healthy subjects, the former reporting cases of selective impairment of music and language (Sidtis and Volpe, 1988; Piccirilli et al., 2000; Mendez, 2001; for review, see Peretz and Coltheart, 2003) and the latter often reporting significant anatomical overlap in responses to music and speech (Maess et al., 2001; Abrams et al., 2011; Rogalsky et al., 2011; for a recent meta-analysis, see LaCroix et al., 2015). Although our results point to a clear lateralization of amusia and aphasia, music processing is a largely bilateral process, and studies with larger samples of LHD patients are still needed to determine how different left hemisphere regions overlap or separate in amusia and aphasia.
Separate VLSM analyses of the parametric MBEA Scale and Rhythm scores revealed a largely overlapping lesion pattern in the aforementioned right hemisphere network. Naturally, stroke lesions follow the vasculature of the brain and thus are not restricted to a single functional brain area. To pursue the most crucial areas in VLSM results, the t-value cutoff can be adjusted to only view the most significant areas in that certain lesion pattern. When this approach was applied to the lesion pattern associated with poor performance in the Rhythm subtest, the right BG remained the most significant element. To our best knowledge, this is the first lesion study to directly implicate the right BG in rhythm perception deficits. In previous studies of healthy subjects, BGs have been linked closely to rhythm analysis (Penhune et al., 1998; Grahn and Brett, 2009; Grahn and Rowe, 2009; Alluri et al., 2012). Recently, professional pianists were also shown to have skill-related plastic GMV changes in the putamen (Vaquero et al., 2016). Another key area associated with rhythm processing is the cerebellum (Penhune et al., 1998; Sakai et al., 1999); however, due to our inclusion criteria, cerebellar involvement in the rhythm amusia could not be evaluated.
In line with previous studies, we found lesions in the right superior temporal cortex to be associated with pitch amusia (Liégeois-Chauvel et al., 1998; Ayotte et al., 2000; Terao et al., 2006; Hochman and Abrams, 2014). Converging evidence from neuroimaging studies of healthy subjects indicates that right superior temporal areas are crucial for pitch and melodic processing (Griffiths et al., 1998; Gutschalk et al., 2002; Patterson et al., 2002; Tramo et al., 2002; Hyde et al., 2008) and that their volume correlates with musical practice (Seither-Preisler et al., 2014) and absolute pitch (Wengenroth et al., 2014). In all of our analyses, right insular lesions were also significantly associated with amusia. Lesions in the insula have previously been linked to pitch amusia after stroke (Terao et al., 2006; Hochman and Abrams, 2014), but may also represent damage to WM fibers in the right frontotemporal pathway, the connectivity of which has been found to be decreased in congenital amusics (Loui et al., 2009; Albouy et al., 2013).
In the longitudinal VBM analysis, persistent amusia was linked to increasing GMV decline in the right STG, where structural abnormalities have also been observed in congenital amusics (Hyde et al., 2007; Albouy et al., 2013). GM anomalies in congenital amusics are associated with decreased connectivity between the right STG and the right frontotemporal areas (Hyde et al., 2006; Albouy et al., 2013). Correspondingly, we observed a WMV decrease in a region of this same pathway in the NRA patients. Interestingly, further GM atrophy was present in anterior temporal areas in rhythm amusics and posterior temporal areas in pitch amusics. Similar within-hemisphere functional segregation with the anterior STG and superior temporal sulcus showing more sensitivity to changes in the temporal domain and posterior regions showing more sensitivity to changes in the spectral domain has been reported both in animals (Bendor and Wang, 2008) and in humans (Warren et al., 2005; Jamison et al., 2006; for a meta-analysis, see Samson et al., 2010). Also in temporal lobectomy patients, anterior temporal lesions have been linked to deficits in the perception of temporal cues (meter, tempo; Kester et al., 1991; Liégeois-Chauvel et al., 1998).
Another feature of spectral versus temporal processing observed in neuroimaging studies is their lateralization: the right auditory cortex (AC) is more specialized in fine-grained spectral processing and the left AC in rapid temporal processing of sounds (Zatorre and Belin, 2001; Jamison et al., 2006). However, no clear lateralization effects for musical rhythm processing have been observed in healthy subjects (Samson et al., 2010; Alluri et al., 2012). Meanwhile, lesion studies have provided mixed results, some reporting rhythm or meter discrimination deficits in RHD patients (Kester et al., 1991; Schuppert et al., 2000; Rosslau et al., 2015) or in both LHD and RHD patients (Liégeois-Chauvel et al., 1998; Ayotte et al., 2000). Unfortunately, our sample was not sufficiently powered to detect lesion patterns or GMV/WMV changes for pitch and rhythm amusia separately in LHD and RHD, and larger studies are still needed.
As for the limitations of our study, the groups in the VLSM and VBM studies showed differences on some demographical (education) and clinical (lesion volume, spatial neglect) factors, which may influence the results of the binary analyses. Lower education level (Cox et al., 2006) and larger lesion size (Pan et al., 2006) are associated with higher severity of the stroke, which may lead to a higher likelihood of amusia. The coincidence of neglect and amusia is expected given their similar lesion locations (Chechlacz et al., 2012). Importantly, the RA and NRA patients did not show any significant differences in demographic data. There was only one patient with any formal musical training in our sample. Evaluating the potential facilitating effect of musical training on amusia recovery would call for a larger sample of patients with different levels of musical background. Also, the possible protective effect of higher education in acquired amusia still needs further investigation. Due to the study population (34 LHD vs 43 RHD patients), a mild imbalance was visible in the lesion-overlap image but importantly this difference did not show up as a statistical limitation (Fig. 1): areas in both hemispheres had sufficient statistical power. Because the patients were initially recruited to intervention studies that required some degree of verbal communication and speech comprehension, patients with global or more severe aphasia were in most cases not able to participate, which explains why some of the classical aphasia regions (e.g., Broca's area) do not show up in our VLSM lesion patterns for aphasia.
In conclusion, the results of the present study show that damage to the right temporal areas, insula, and putamen forms the crucial neural substrate for acquired amusia after stroke. Persistent amusia is associated with further atrophy in the right STG and MTG, locating more anteriorly for rhythm amusia and more posteriorly for pitch amusia. It is possible that lesions associated with acquired amusia damage the right frontotemporal pathway, explaining the WMV decrease, possibly leading to neuronal degeneration and consequent GMV decrease. Overall, compared with older lesion studies, these findings provide a more detailed and fine-grained view of the neuroanatomy of amusia and, conversely, highlight which areas are crucial and necessary for normal music perception. In clinical stroke practice, these results have important implications for more accurate identification of amusia, better prediction of the outcome of amusia recovery, and more precise targeting of music-based rehabilitation methods.
Footnotes
This work was supported by the Academy of Finland program (1257077, 1277693), Turku University Hospital Research Funding, the Finnish Brain Research and Rehabilitation Foundation, the Ella and Georg Ehrnrooth Foundation, the Signe and Ane Gyllenberg Foundation, the Finnish Cultural Foundation, the National Doctoral Programme of Psychology, the Jenny and Antti Wihuri Foundation, the Formación de Profesorado Universitario program (AP2010-4170), and the Generalitat de Catalunya (2014 SGR1413). We thank the staffs of the HUCH Department of Neurology, Turku University Hospital Department of Neurology, and other rehabilitation hospitals in the Hospital District of Southwest Finland and Helsinki metropolitan area for their collaboration. We thank Prof. Mari Tervaniemi, Prof. Riitta Parkkola, Prof. Taina Autti, Dr. Heli Silvennoinen, Jani Saunavaara, PhD, and radiographers Ulla Anttalainen (†), Riku Luoto, Pentti Pölönen, and Tuija Vahtera. We also thank the patient subjects and their families for their participation and effort.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Aleksi Sihvonen, Cognitive Brain Research Unit, Institute of Behavioural Sciences, Siltavuorenpenger 1 B, FI-00014 University of Helsinki, Finland. ajsihv{at}utu.fi