Abstract
We detected and mapped a dynamically spreading wave of gray matter loss in the brains of patients with Alzheimer's disease (AD). The loss pattern was visualized in four dimensions as it spread over time from temporal and limbic cortices into frontal and occipital brain regions, sparing sensorimotor cortices. The shifting deficits were asymmetric (left hemisphere > right hemisphere) and correlated with progressively declining cognitive status (p< 0.0006). Novel brain mapping methods allowed us to visualize dynamic patterns of atrophy in 52 high-resolution magnetic resonance image scans of 12 patients with AD (age 68.4 ± 1.9 years) and 14 elderly matched controls (age 71.4 ± 0.9 years) scanned longitudinally (two scans; interscan interval 2.1 ± 0.4 years). A cortical pattern matching technique encoded changes in brain shape and tissue distribution across subjects and time. Cortical atrophy occurred in a well defined sequence as the disease progressed, mirroring the sequence of neurofibrillary tangle accumulation observed in cross sections at autopsy. Advancing deficits were visualized as dynamic maps that change over time. Frontal regions, spared early in the disease, showed pervasive deficits later (>15% loss). The maps distinguished different phases of AD and differentiated AD from normal aging. Local gray matter loss rates (5.3 ± 2.3% per year in AD v 0.9 ± 0.9% per year in controls) were faster in the left hemisphere (p < 0.029) than the right. Transient barriers to disease progression appeared at limbic/frontal boundaries. This degenerative sequence, observed in vivo as it developed, provides the first quantitative, dynamic visualization of cortical atrophic rates in normal elderly populations and in those with dementia.
Introduction
Strategies to chart how Alzheimer's disease (AD) spreads dynamically in the brain are vital for understanding how it progresses and for mapping treatment effects. A dynamic map of AD could uncover the path of degeneration for different brain systems and define a powerful biological marker for clinical trials. Such brain maps would visualize the order in which cortical systems are affected in living populations and identify how changes correlate with cognitive decline.
This study used serial brain imaging and cortical mapping to visualize how AD spreads spatially and temporally in the brain. It reveals anatomically selective, stage-specific deficits. It also allows us to visualize the sequence in which deficits appear. It also provides the first detailed, three-dimensional (3D) quantitative maps of cortical gray matter and whole brain changes over time in any disease.
In early AD, intraneuronal filamentous deposits, or neurofibrillary tangles (NFTs), accumulate within neurons. These deposits are composed of hyperphosphorylated tau-protein (Hulstaert et al., 1999). This cellular pathology disrupts axonal transport and induces widespread metabolic decline. The resulting neuronal loss is observable as gross atrophy with magnetic resonance imaging (MRI). Temporoparietal association cortices and the medial temporal lobe are severely atrophied in AD (DeCarli, 2000), with the entorhinal cortex and hippocampus the earliest and most severely affected (Janke et al., 2001; Thompson et al., 2001b). Profound atrophy is also observed in the posterior cingulate gyrus and adjacent precuneus. Specific atrophic patterns differentiate AD from frontotemporal, semantic, and Lewy body dementias (O'Brien et al., 2001; Studholme et al., 2001). Patients with AD show minimal primary visual, sensorimotor, and frontal atrophy until late in the disease. Before symptom onset in AD, and also in those at genetic risk, gray matter loss is detectable in the anterior hippocampal/amygdala region (Lehtovirta et al., 2000; Reiman et al., 2001).
Of particular interest is the temporal sequence of deficits in AD because they spread across the cortex. Braak and Braak (1997)noted at autopsy that NFT distribution was initially restricted to entorhinal cortices, spreading to higher-order temporoparietal association cortices, then to frontal, and ultimately, to primary sensory and visual areas (Delacourte et al., 1999; Price and Morris, 1999).
We set out to determine whether a similar wave of cortical atrophy could be mapped in patients while they were alive. The goal was to visualize the transit of the disease within cortex and relate it to cognitive decline. Recently, we used dynamic brain mapping to uncover the trajectory of cortical change as schizophrenia develops in the teenage brain (Thompson et al., 2001d) and as normal adolescents lose gray matter (Sowell et al., 2001). In dementia, we expected a similarly selective profile of brain changes, instead emerging from temporal cortices and sparing primary cortices until late in the disease. We also hypothesized first that frontal and association cortices would be progressively enveloped as cognitive function declined, and second, that the pathology would evolve differently in each brain hemisphere, with the left hemisphere engulfed earlier and more severely than the right.
Materials and Methods
Subjects
Over a time interval of 3 years, we used longitudinal MRI scanning (two scans, baseline, and follow-up) and cognitive testing to study a group of subjects with AD as their disease progressed. A second, demographically matched group of healthy, elderly control subjects was also imaged longitudinally (two scans) as they aged. The AD subject group consisted of 12 patients who were scanned twice (six men, six women; mean age ± SE at first scan 68.4 ± 1.9 years; at final scan, 69.8 ± 2.0 years; mean interval between first and last scans, 1.5 ± 0.3 years). These patients were diagnosed with AD using criteria listed in the Diagnostic and Statistical Manual of Mental Disorders (American Psychiatric Association, 2000), and they had a typical clinical presentation. They also fulfilled the standards for probable AD according to criteria established by the National Institute of Neurological Disorders and Stroke and the Alzheimer's Disease and Related Disorders Association (McKhann et al., 1984). All patients were reassessed at intervals of 3–6 months with a full clinical evaluation, and their cognitive status was evaluated using the Mini-Mental State Exam (MMSE) (Folstein et al., 1975). During the study, subjects' cognitive status declined rapidly from an initial MMSE score of 17.7 ± 1.9 to 12.9 ± 2.5 (mean change 5.5 ± 1.9 points; p < 0.00054, one-tailed t test; maximum score 30). This corresponds approximately to a transition from moderate to severe AD. At the same time, a group of 14 healthy elderly controls was scanned twice (seven men, seven women; age at first scan 71.4 ± 0.9 years; age at final scan 74.0 ± 0.9 years; mean interval between first and last scans 2.6 ± 0.3 years). During the study, the control subjects' cognitive status remained stable. Their MMSE score was 29.5 ± 0.3 at both baseline and follow-up, with no change.
Exclusion criteria for the two groups included the presence of white matter lesions on T2-weighted MRI scans, preexisting psychiatric illness or head injury, and history of substance abuse or depression as measured by the Geriatric Depression Scale (Yesavage, 1988). All subjects were right-handed. Informed consent was obtained from all participants before scanning. AD and control samples consisted of subjects who were entirely different from the group of patients with AD and controls from whom data were reported in our previous cross-sectional study (Thompson et al., 2001b). Three disease stages were effectively mapped in this new, longitudinal sample. These broadly correspond to healthy aging and the transition from moderate to severe AD (with MMSE scores of 29, 18, and 13, respectively).
MRI scanning
Patients and controls were scanned identically with the same scanning protocol over time. Each subject underwent two MRI scans separated by >1 year. Images were acquired on a 2 Tesla Bruker Medspec S200 whole-body scanner (Bruker Medical, Ettingen, Germany) at the Centre for Magnetic Resonance (University of Queensland, Australia). A linearly polarized birdcage head-coil (Bruker Medical) was used for signal reception. Three-dimensional T1-weighted images were acquired with an inversion recovery segmented 3D gradient echo sequence (known as magnetization prepared rapid gradient echo; MP-RAGE) to resolve anatomy at high resolution. Acquisition parameters were as follows: inversion time (TI)/repetition time (TR)/echo time (TE) = 850/1000/8.3 msec; flip angle = 20°; 32 phase-encoding steps per segment; a 23 cm field of view. Images were acquired in an oblique plane perpendicular to the long axis of the hippocampus (Jack et al., 1998), with an acquisition matrix of 256 × 256 × 96, and zero-filled to 2563.
Image processing and analysis
Serial images acquired across the 2 year time span were processed as follows. Briefly, for each scan, a radio frequency bias field correction algorithm eliminated intensity drifts attributable to scanner field inhomogeneity, using a histogram spline sharpening method (Sled et al., 1998). Images were then normalized by transforming them to a standard 3D stereotaxic space in a two-step process that retained information on brain change over time. First, each initial T1-weighted scan was aligned linearly (registered) to a standard brain imaging template (International Consortium for Brain Mapping nonlinear average brain template, ICBM152) (Evans et al., 1994) with automated image registration software (Collins et al., 1994). Follow-up scans were then rigidly aligned to the baseline scan from the same subject (Collins et al., 1994). These mutually registered scans for each patient were then linearly mapped into ICBM space by combining the intrapatient transform with the previously computed transform to stereotaxic space.
Tissue maps
To equalize image intensities across subjects, registered scans were matched by histogram. A supervised tissue classifier generated detailed maps of gray matter, white matter, and CSF. Briefly, 120 samples of each tissue class were interactively tagged to compute the parameters of a Gaussian mixture distribution that reflects statistical variability in the intensity of each tissue type (Zijdenbos and Dawant, 1994). A nearest-neighbor tissue classifier assigned each image voxel to a particular tissue class (gray, white, or CSF) or to a background class (representing extracerebral voxels in the image). The inter-rater and intra-rater reliability of this protocol, and its robustness to changes in image acquisition parameters, have been described previously (Sowell et al., 1999). Gray and white matter maps were retained for subsequent analysis.
Three-dimensional cortical maps
A surface model of the cortex was automatically extracted (MacDonald et al., 2000) for each subject and time point as described in our previous studies (Thompson et al., 2001c). This software creates a mesh-like surface that is continuously deformed to fit a cortical surface tissue threshold intensity value from the brain volume. The software was modified to permit high-resolution extraction of both the lateral and medial hemispheric surfaces, including the cingulate, primary visual cortex, and corpus callosum (Fig.1). The intensity threshold was defined as the MRI signal value that best differentiated cortical CSF on the outer surface of the brain from the underlying cortical gray matter.
Cortical pattern matching
An image analysis technique known as cortical pattern matching was used to better localize disease effects on cortical anatomy over time and to increase the power to detect systematic changes. The approach models and controls for gyral pattern variations across subjects, and it visualizes average maps of cortical change in a population and encodes its variance and any group differences.
On the basis of cortical models for each subject at different time points, a 3D deformation vector field was computed measuring shape change in the brain surface across the time interval (Thompson et al., 2001c). This directly accommodates any brain shape changes when comparing cortical gray matter within a subject across time. The deformation reconfigures the earlier cortex into the shape of the later one, matching the entire gyral patterns and cortical surfaces in the pair of 3D image sets.
Matching cortical anatomy across subjects. A second deformation was computed that matches gyral patterns across all the subjects in the study as well as the deformation that matches anatomy over time. This allows data to be averaged and compared across corresponding cortical regions [the algorithm for this is described inThompson et al. (2000a,b)]. As shown in Figure 1, a large set of 72 sulcal landmarks per brain is used to constrain the mapping of one cortex onto another. This associates corresponding cortical regions across subjects. An image analyst (K.M.H.) who was blind to subject diagnosis, gender, and age traced each of 30 sulci in each hemisphere on the surface rendering of each subject's brain (13 on the medial surface, 17 on the lateral surface). On the lateral brain surface these included the Sylvian fissure; central, precentral, and postcentral sulci; superior temporal sulcus (STS) main body, STS ascending branch, and STS posterior branch; primary and secondary intermediate sulci; and inferior temporal, superior and inferior frontal, intraparietal, transverse occipital, olfactory, occipitotemporal, and collateral sulci. On the medial surface these included the callosal sulcus, the inferior callosal outline, the paracentral sulcus, anterior and posterior cingulate sulci, the outer segment of a double parallel cingulate sulcus (where present) (Ono et al., 1990), the superior and inferior rostral sulci, the parieto-occipital sulcus, the anterior and posterior calcarine sulci, and the subparietal sulcus. In addition to contouring the major sulci, a set of six midline landmark curves bordering the longitudinal fissure was outlined in each hemisphere to establish hemispheric gyral limits. Spatially registered gray scale image volumes in coronal, axial, and sagittal planes were available simultaneously to help disambiguate brain anatomy. Landmarks were defined according to a detailed anatomical protocol (Steinmetz et al., 1990; Leonard, 1996; Sowell et al., 2001; Hayashi et al., 2002) based on the Atlas of the Cerebral Sulci (Ono et al., 1990). This protocol is available on the internet (Hayashi et al., 2002) and has known inter-rater and intra-rater reliability, as reported previously (Sowell et al., 2001).
Average cortical model construction. To create an average 3D cortical model for each group of subjects (e.g., elderly normal or AD), the following steps were used (Figs. 1,2) [for details, see Thompson et al. (2001)]. For each subject, all sulcal/gyral landmarks (Fig.1b,c) were flattened into a 2D plane along with the cortical model (Fig. 1e). Technical issues of minimal distortion and the computation of these flat mappings are addressed inThompson et al. (2001c). A color code (Fig. 1f) retains the original 3D position of each cortical point as a red, green, and blue color triplet plotted in the 2D parameter space (Fig.1f). Once data are in this flat space, sulcal features are aligned across subjects with a warping technique. To illustrate this process, Figure 2a shows part of the flat map of one subject and its corresponding color code (Fig.2c). An average set of sulcal curves, derived from many subjects, is overlaid on the flat map (Fig. 2a). These maps are warped (Fig. 2b) so that individual sulcal features in them are driven into correspondence with the average set of sulcal curves. Figure 2d shows how this warping process affects a regular grid ruled over an individual color-coded flat map. The warped color images (Fig. 2d) from many subjects are averaged together pixel by pixel and decoded; it can be shown mathematically that this image averaging creates a crisp average cortical model with gyral features in their mean anatomic locations (Fig.2f) (Thompson and Toga, 1997; Thompson et al., 2000a,b). The point of this procedure is that the computational matching of sulci avoids destructive cancellation of features (Fischl et al., 1999). This cancellation happens when images are directly averaged together (Fig. 2e). Common features, reinforced in the group average, appear in their group mean anatomic locations (Fig.2f). It is important to note that local measures of gray matter density (see below) (Fig. 3) may be convected along with these warps and plotted on the average cortex, before statistical analysis. Confounding effects of cross-subject anatomical variance are greatly reduced, thereby empowering detection of disease effects.
Averaging cortical gray matter maps. Given that the deformation maps associate cortical locations with the same relation to the primary folding pattern across subjects, a local measurement of gray matter density was made in each subject and averaged across equivalent cortical locations. To quantify local gray matter, we used a measure termed gray matter density, which has been used in many previous studies to compare the spatial distribution of gray matter across subjects (Wright et al., 1995; Bullmore et al., 1999; Sowell et al., 1999; Ashburner and Friston, 2000; Mummery et al., 2000; Rombouts et al., 2000; Baron et al., 2001; Good et al., 2001; Thompson et al., 2001a,b). This measures the proportion of gray matter in a small region of fixed radius (15 mm) around each cortical point. Given the large anatomic variability in some cortical regions, high-dimensional elastic matching of cortical patterns (Thompson et al., 2000a,b,2001a) was used to associate measures of gray matter density from homologous cortical regions first across time and then also across subjects (Fig. 3). One advantage of cortical matching is that it localizes deficits relative to gyral landmarks; it also averages data from corresponding gyri, which would be impossible if data were mapped only linearly into stereotaxic space. Annualized 4D maps of gray matter loss rates within each subject were elastically realigned for averaging and comparison across diagnostic groups (Fig. 4).
Mapping gray matter loss
Statistical maps were generated indicating locally the degree to which gray matter loss rates were statistically linked with diagnosis and cognitive performance (MMSE scores) (Fig.5). To do this, at each cortical point, a multiple regression was run to assess whether the gray matter loss rate at that point depended on the covariate of interest (e.g., test scores, diagnosis). The p value describing the significance of this linkage was plotted at each point on the cortex using a color code to produce a statistical map (Figs. 5, 6,7).
Permutation testing. Maps identifying these linkages were computed pointwise across the cortex and assessed statistically by permutation. We preferred this process over using an analytical null distribution to avoid assuming that the smoothness tensor of the residuals of the statistical model were stationary across the cortical surface. The presence of significant effects in maps of statistics can be tested typically using parametric inference based on Gaussian random field theory (Friston et al., 1996) or by nonparametric (e.g., permutation) methods, both of which have been applied widely in functional (Holmes et al., 1996) and structural brain imaging (Bullmore et al., 1999; Sowell et al., 1999). We used permutation here to avoid making assumptions about the spatial covariance of the residuals (Nichols and Holmes, 2002) and to avoid complex corrections for data localized on surfaces [for a discussion of this issue, see Thompson et al. (2000a,b)]. Permutation methods, used here, measure the distribution of features in statistical maps (such as the area with statistics above a predetermined threshold) that would be observed by accident if the subjects were randomly assigned to groups. This computed distribution is then used to compare the features that occurred in the true experiment with those that occurred by accident in the random groupings. A ratio is computed describing what fraction of the time an effect of similar or greater magnitude to the real effect occurs in the random assignments. This is the chance of the observed pattern occurring by accident. This fraction provides an overall significance value for the map (corrected for multiple comparisons) (Nichols and Holmes, 2002).
To define a corrected significance value (i.e., an overall pvalue) for a map, we established a primary threshold ofp = 0.01 for all significance maps and measured the total area of the cortex with statistics more significant, at a voxel level, than this threshold [as in previous work, for example, Sowell et al. (2001)]. This threshold (p = 0.01) was chosen a priori on the basis of our previous work in an independent, cross-sectional sample (Thompson et al., 2001b) to optimize detection of broad, diffuse effects. The previous study suggested that large areas of cortex would be diffusely affected at a voxel-wise significance level stronger than 0.01. The total area of the suprathreshold regions on the surface was computed. This area measure was used in the permutation tests to control the number of false positives per map. A total suprathreshold area statistic was chosen rather than a measure of cluster extent or peak height (Friston et al., 1996) to sensitize the permutation test to the detection of subtle but diffuse effects that occur over large (and possibly disconnected) regions of cortex. Despite the selection of a low significance threshold at the voxel level to define clusters, the overallp value for the map still controls for type I error, because the threshold to define significant clusters is the same in the real and in the null simulations from which the permutation distribution is drawn. The total area of the average surface with suprathreshold statistics was used rather than the number of surface vertices with suprathreshold statistics because the total area is invariant to the sampling density of points on the surface (so long as the surface is sufficiently highly sampled).
Control for multiple comparisons. In this type of statistical map, the number of tests that are computed is 65,536 per hemisphere, because this is the number of vertices in the surface model of each cortical hemisphere. However, these tests are highly correlated, and the spatial covariance of their residuals is incorporated into the null distribution; this controls for the number of independent tests (Thompson et al., 2000a,b).
Specification of a region of interest. Because we expected highly significant effects, we did not in general restrict the anatomical search space for the permutation tests, as is common in functional imaging (Friston et al., 1996). Permutations were conducted over the entire hemisphere (this results in conservative control over type I error, because features anywhere on the cortex enter the null distribution). Left and right hemispheres were assessed separately. In one case, we assessed whether the medial surface was affected. To do this we created a volume of interest (or search region) that contained the medial hemispheric surface only, using the surface lines traced at the interhemispheric margin to define its boundary on the average surface. Suprathreshold effects in this cortical region were then assessed to compute a permutation distribution on their total area, and an overall corrected p value was derived for the medial wall effects.
In each case, the covariate vector was permuted 1 million times on an SGI RealityMonster supercomputer with 32 internal R10000 processors, and a null distribution was developed for the area of the average cortex with statistics above a fixed threshold (p < 0.01) in the significance maps. (Post hoc tests revealed that the corrected pvalues were robust to differences in the choice of this primary threshold.) An algorithm was then developed to report the significance probability for each map as a whole (Thompson et al.,2000a,b,2001a), so the significance of the loss patterns could be assessed after the appropriate correction for multiple comparisons. Separate maps were made to show average rates of loss (Fig. 5) and the significance of this loss in patients relative to controls (Fig.6a,b).
Results
Mapping average gray matter deficits in AD
In patients with AD, a highly significant gray matter deficit was observed in a broad anatomical region encompassing bilateral temporal and parietal cortices (Fig. 6a) (using permutation tests for an effect of diagnosis; p < 0.00495 left hemisphere, p < 0.0154 right hemisphere). The most significant impairments occurred in temporal and parietal regions (Fig. 6a, red), where deficits exceeded 15% (Fig. 6c). More intriguing was the anatomical specificity of the loss. A sharp division occurred in the loss maps (Fig. 6b, blue), with central and post-central gyri displaying minimal loss compared with the parietal association cortices immediately posterior to them. Primary sensory and motor cortices were comparatively spared in the disease (with a 0–5% deficit on average in the central and postcentral gyri (Fig.6b, S/M).
Figure 6e–h (bottom row) shows the average deficit pattern in AD 1.5 years later, after a rapid decline in MMSE score from 17.7 ± 1.9 to 12.9 ± 2.5 (p < 0.00054). Again, the cortex exhibits substantial gray matter loss, with deficits intensifying still further in the most severely affected regions (Fig.6e,f) (using permutation tests for an effect of diagnosis; p < 0.00027 left hemisphere,p < 0.00056 right hemisphere). Two features are apparent. First, the frontal cortices, initially only mildly affected (with 6–10% loss), are now severely affected (deficits exceeded 15% everywhere) (Fig. 6g), and second, the sensory and motor territory is still comparatively spared (Fig. 6f,blue), despite the frontal spread of the deficits.
Dynamic, four-dimensional maps
The time course of these gray matter losses, as they emerge over a period of cognitive decline lasting 1.5 years, is observed in the accompanying video sequences (see supplementary data, available at:http://www.loni.ucla.edu/∼thompson/AD_4D/dynamic.html). These 4D data were computed from the maps with a previously described algorithm (Thompson and Toga, 1997). The transit of deficits from temporoparietal into frontal cortices is dramatic and so is the sparing of sensorimotor areas.
Medial wall effects
Given the interest in early limbic changes in AD, we also mapped the loss pattern across the medial wall of the brain hemispheres (Fig.8). Again, sulcal pattern matching was used to pool information from corresponding cortical regions (e.g., cingulate) across subjects and to create the average maps of gray matter deficits in AD.
In the left hemisphere, the entire medial cortex was in deficit at the first scan (Fig. 8b). In the right hemisphere, we found it intriguing that the anatomy was segregated into three major systems (Fig. 8a). (1) Greatest impairment (>15%) was observed in the temporal/entorhinal regions and the parietal lobule, and, perhaps surprisingly, the adjacent occipital/visual cortices; (2) the cingulate/paralimbic belts (10 to 15% loss) were significantly but less severely affected (Fig. 8a, yellow); (3) frontal regions, and in particular orbitofrontal regions (Fig.8a), were comparatively spared (0–5% loss) (Fig.8a, blue). The left hemisphere was significantly more severely impaired than the right (p < 0.029 for disease-specific asymmetry; corrected p < 0.0018 left hemisphere, p < 0.014 right hemisphere, for the effect of diagnosis at the initial scan on the medial wall; permutation tests). Most left hemisphere regions showed a >15% deficit relative to healthy controls (Fig. 8b), consistent with our earlier cross-sectional studies in an independent sample (Thompson et al., 2001b). Both left and right hemisphere maps suggested that limbic regions are more intensely impaired than frontal cortices in AD. This remained true even late in the disease when multiple systems are severely affected (Fig. 8c). After a sharp drop in MMSE score from 18 to 13, most of the cortex was engulfed (Fig. 8,bottom) (corrected p < 0.00015 left hemisphere, corrected p < 0.00015 right hemisphere, for the effect of diagnosis on the medial wall; permutation tests). Even then, some cortical regions were mostly intact (Fig.8c, blue), specifically in the frontal and sensorimotor cortices. A frontal band (0–5% loss) was sharply delimited (Fig. 8c) from the limbic and temporoparietal regions that showed severest deficits in AD (>15% loss). This pattern is consistent with the hypothesis that AD pathology spreads centrifugally from limbic/paralimbic to higher-order association cortices (Mesulam, 2000).
Cognitive correlates
It was also important to confirm that these structural differences were functionally significant. To do this, we tested whether gray matter differences linked with differences and declines in cognitive performance, as quantified by MMSE scores. MMSE scores were lower in the AD group than in controls both initially (p< 2.85 × 10−7) and after 1.5 years (p < 2.6 × 10−8). When rates of change were considered, MMSE decline over time was highly significant in AD (p < 0.00054), whereas no changes were detected in the controls (mean change, 0.0; p > 0.05).
Highly significant linkages were found relating lower cognitive scores to greater gray matter deficits (Fig. 5). These correlations were observed in all brain regions in which there was significant loss (Fig.5a–d), including the temporalparietal and limbic cortices. Linkages were also found between frontal gray matter reduction and lower MMSE scores, but only at the later time point, when frontal gray matter was in significant deficit (Fig. 5e). As expected, no correlations were found between gray matter differences in sensory and motor cortices and cognitive performance (Fig. 5b,blue, S/M). These maps support the idea that structure/cognition effects are regionally specific in AD, at least initially (Fig. 5b). Correlations were strongest in regions with greatest average loss (left cingulate and left temporal and parietal cortices) (Fig. 5d). These correlation maps were confirmed to be significant by permutation (p < 0.0013 left hemisphere, p< 0.0028 right hemisphere, when mean MMSE score was 18, andp < 0.0028 left hemisphere, p < 0.0047 right hemisphere, when mean MMSE score had declined to 13).
Mapping rates of loss
The availability of repeat scans from the same subjects allowed rates of tissue loss to be computed locally for each subject and each location on the cortex. These changes are shown as a map (Fig. 4). This illustrates the group average rate of gray matter loss across the cortical surface, in patients and controls. Even the healthy controls showed a trend for diffuse gray matter loss (Fig. 4a–d) (p < 0.08) at an annual rate of 0.91 ± 0.92% per year overall (left hemisphere 1.16 ± 1.41% per year; right hemisphere 0.67 ± 1.25% per year). However, very few regions exceeded 1% annual gray matter loss (Fig. 4, top). This approximates the tissue loss rate seen in normal adolescence (Thompson et al., 2001d). The anatomy of the healthy controls therefore remained relatively stable, whereas their MMSE score remained unchanged at 29.5 ± 0.3.
Patients with AD lost significant gray matter (p< 0.05 for overall annual loss of gray matter) (Fig. 4e–h) and at a significantly more rapid rate than controls (p < 0.042), with a total gray matter loss rate of 5.03 ± 2.28% per year (left hemisphere 5.43 ± 3.29% per year; right hemisphere 4.64 ± 3.31% per year). Regions with a prominent 4–5% annual loss included the right cingulate, temporal, and frontal cortices bilaterally (Fig. 4, bottom row). The loss rate patterns were also more spatially diffuse than the deficit maps at baseline and follow-up (Figs. 5, 6). In summary, the leading edge of the region with significant deficits spreads somewhat centrifugally (from medial temporal/limbic to frontal regions). Nonetheless, the loss rate maps show progressively intensifying deficits at the leading edge (frontal cortex) as well as in regions that are already severely affected, (e.g., the lateral surfaces of the temporal lobes).
Loss rate asymmetries
In patients with AD, the left hemisphere lost gray matter faster than the right (0.79% per year faster) (p < 0.04). This is consistent with previous reports of the left hemisphere being more severely affected in AD, both metabolically and structurally (Friedland and Luxenberg, 1988; Loewenstein et al., 1989; Johnson et al., 1998; Janke et al., 2001; Thompson et al., 2001b). A trend for faster gray matter loss in the left hemisphere was also observed in controls (0.50% per year faster; p < 0.057).
Logically, if an asymmetric loss process had already been occurring for many years, one might expect the left hemisphere to have significantly less gray matter than the right at the baseline scan, in both patients and controls. This was confirmed to be the case. Figure 7 shows the average gray matter asymmetry in the controls, with a prominent sensorimotor region showing 15–20% less gray matter in the left hemisphere (Fig. 7a) compared with its counterpart on the right. These maps were created by subtracting the gray matter map in the left hemisphere from that on the right, in each individual, before averaging data from corresponding cortical regions across subjects. To account for gyral pattern asymmetries (Geschwind and Levitsky, 1968), this subtraction was performed after computing an additional warp in the cortical parameter space to align each subject's left and right gyral patterns. Both controls (p < 0.0069) (Fig. 7c,d) and patients (p < 0.0091) had significantly less cortical gray matter in the left hemisphere. In AD, there was also an increase in asymmetry over time (p < 0.028) in the overall quantity of gray matter (including deep nuclei as well as cortex). This was not detected in controls. In AD, the left hemisphere had 2.4% (±2.0%) less gray matter overall than the right at baseline, and 3.5% (±2.0%) less gray matter at follow-up.
This indicates faster left hemisphere cortical degeneration in AD, at least during the time interval observed in this study. When the components of this gray matter asymmetry are factored apart, the loss process in AD therefore occurs first at an asymmetric rate (left faster than right), and second, on top of a prevailing gray matter asymmetry (left less than right) observed in AD and even in healthy elderly subjects.
Whole-brain atrophic rates
Because cortical gray matter maps discriminated patients from controls so strongly (with p < 0.005 initially andp < 0.0005 at follow-up; permutation tests), we wanted to test whether simpler measures would also reveal disease-related differences or rates of change. The goal was to compare volumes and maps to see which detected losses most effectively.
Whole-brain atrophic rates (i.e., the loss rate for total cerebral volume) were found to discriminate patients and controls. Total cerebral volume was computed from the cortical models (Fig. 1), and its annual rate of change was computed for all subjects in the study. Overall cerebral volume loss rates were significantly faster in AD than in controls, with a loss rate of 5.22 ± 2.04% per year in AD (p < 2.3 × 10−5), compared with 0.88 ± 0.15% per year in controls (p < 0.013 for significant loss in controls, p < 0.003 for group difference; both hemispheres pooled). In patients with AD, the left hemisphere lost volume at 5.86 ± 8.60% per year, compared with 0.99 ± 0.59% in controls (p < 0.019 for significant loss in controls, p < 0.023 for group difference), whereas with AD the right hemisphere lost volume at 4.57 ± 5.61% per year, compared with 0.88 ± 0.46% in controls (p < 0.0083 for significant loss in controls,p < 0.0095 for group difference; all tests one-tailed). These disease effects were found in multiple regressions that controlled for age and gender (although effects of these covariates were not significant; overall multiple r = 0.48; p < 0.006). A trend for faster left hemisphere loss also was observed for cerebral volume (p < 0.055), in line with the significantly faster gray matter loss rate mapped in the left hemisphere.
To determine whether the cerebral volume loss rate was attributable primarily to gray or white matter degeneration, we also evaluated white matter loss rates. We did not find a faster white matter loss rate in AD relative to controls (p = 0.41), although both groups lost significant white matter over time (p < 0.035). The white matter loss rate in AD (3.30 ± 2.06% per year; left hemisphere 3.20 ± 2.91% per year, right hemisphere 3.40 ± 3.03% per year) was comparable with that in controls (2.72 ± 1.44% per year; left hemisphere 2.91 ± 2.09% per year, right hemisphere 2.52 ± 2.07%/year), and no asymmetries were detected (p > 0.1).
Discussion
This study used new longitudinal brain mapping techniques to chart the transit of structural deficits across the cortex in AD. When the deficit data are visualized in 4D, a dynamic, spreading wave of loss is observed. The left hemisphere was engulfed fastest, with the right after a similar sequence after a time lag. In the right hemisphere, sharp boundaries appeared in the deficit patterns at the cingulate/frontal border on the medial wall (Fig. 8c), indicating differential susceptibility, at least transiently, in mild to moderate AD. Regionally selective atrophy was found in distinct disease phases, with initial sparing of frontal and then only sensorimotor cortices. Deficits and rates of change were coupled with declining cognitive status and quantified by MMSE scores. As a supplement to clinical measures of disease progression, which can be notoriously variable (DeCarli, 2000), cortical maps quantitatively store information on changes expected in AD, offering a standard against which drug effects can be calibrated.
The path of disease progression is appreciated most clearly in the video sequences (see supplementary data). These suggest a spatially complex model of different atrophic patterns as AD progresses. Three main features are observed: (1) the overall deficit pattern spreads through the brain in a temporal–frontal–sensorimotor sequence, with a time lag in the right hemisphere; (2) the left hemisphere degenerates faster than the right, and this asymmetric loss rate increases the existing asymmetry in cortical gray matter (L<R) found in healthy elderly subjects; and (3) some phylogenetically older brain systems are spared late in the disease (e.g., sensorimotor cortices). The resultingin vivo maps differentiate AD from normal aging and visualize atrophic patterns associated with specific disease stages.
Pathology
The deficit sequence also matches the trajectory of NFT distribution observed postmortem in patients with increasing dementia severity at death (Braak and Braak, 1991, 1997). Consistent with the deficit maps observed here, NFT accumulation is minimal in sensory and motor cortices, but it occurs preferentially in entorhinal pyramidal cells, the limbic periallocortex (layers II/IV), the hippocampus/amygdala and subiculum, the basal forebrain cholinergic systems, and subsequently, in temporoparietal and frontal association cortices (layers III/V) (Pearson et al., 1985; Arnold et al., 1991). Neuropathologic studies also reveal that cortical layers III and V selectively lose large pyramidal neurons in association areas (Brun and Englund, 1981). Immunocytochemical studies report 11–50% synaptic loss in superior temporal and inferior parietal cortices in AD (Clinton et al., 1994). A heavy loss of cholinergic axons occurs, with a variable decrease in cholinoreceptive pyramidal neurons (Mesulam, 2000). Gray matter deficits may indicate a depletion in cholinoreceptive neurons and perhaps a reduced receptiveness to cholinergic therapy (Hampel et al., 2002). Gomez-Isla et al. (1997)noted that in AD, both neuronal loss and NDT density were correlated and increased in parallel with the duration and severity of illness, whereas the number of senile plaques and amyloid burden in the superior temporal sulcus were not related to neuronal loss, number of NFTs, or duration of disease.
Gray matter atrophy observed with MRI may also be attributable to a combination of processes other than, or in addition to, neuronal loss, including cell shrinkage, reduced dendritic extent, and synaptic loss (for review, see McEwen, 1997; Uylings and de Brabander, 2002). In healthy aging, age-related neuronal loss does not occur in most regions of the neocortex (Terry et al., 1987; Morrison and Hof, 1997) and appears specific to the frontal cortex (de Brabander et al., 1998) and some hippocampal regions (e.g., CA1 and the subiculum) (Simic et al., 1997). By contrast, marked neuronal loss occurs in early AD (Gomez-Isla et al., 1997), with severe early losses in layer II of the entorhinal cortex. Normal age-related cortical changes may be attributable in part to cell shrinkage (Shimada, 1999), reduced dendritic length (Flood et al., 1987; Hanks and Flood, 1991), and changes in perfusion, fat and water content, and other chemical constituents (Weinberger and McClure, 2002). Age-related dendritic reduction may be region- and lamina-specific (Uylings and Brabander, 2002). Nakamura et al. (1984) found the greatest reductions in layer V pyramidal basal dendrites with normal aging, and dentate granule cells also display significantly reduced apical dendritic length (>40% in the dentate gyrus) (Hanks and Flood, 1991). In summary, changes observed here in normal aging may primarily reflect cell shrinkage, reductions in dendritic extent, and synaptic loss; the changes in AD may reflect a combination of these processes as well as substantial neuronal loss (Gomez-Isla et al., 1997).
Metabolism
Metabolic scans in AD, acquired with [18F]-fluorodeoxyglucose positron emission tomography, show a similar deficit pattern. Decreased metabolic activity is found in temporal and parietal lobes and in the posterior cingulate cortices (Mazziotta et al., 1992; Mega et al., 1997). Frontal deficits typically occur later. These decreases predict cognitive decline rate (on the MMSE) and also survival (Jagust et al., 1996). A recent review of vascular and perfusion changes in AD (de la Torre, 2002) noted that cerebral hypoperfusion typically predates cortical hypometabolism in AD. Microvascular changes may therefore contribute to changing cortical metabolism and neocortical atrophy in AD, because these changes progress in a similar sequence.
It is interesting that gray matter loss at autopsy is predominantly cortical in patients with AD who are <80 years of age (Hubbard and Anderson, 1981). This induces corpus callosum (Thompson et al., 1998;Hampel et al., 2002) and thalamic atrophy (Jernigan et al., 1991), leading to a widespread cortical disconnection syndrome. The transition of AD pathology into frontal association cortices suggests a degeneration of synaptically linked cortical pathways, with a relative sparing of phylogenetically older, sensorimotor cortices. Occipital regions are also atrophic in this study; changes are not always detected in metabolic or perfusion studies of AD, in which visual cortices often serve as a control region in PET or single photon emission computed tomography imaging studies (Jagust et al., 1996).
Differential susceptibility
The anterior and ventromedial temporal lobes may be especially susceptible to AD pathology. The vulnerability of neocortical association areas may relate to their degree of functional connectivity with limbic structures, in which pathology begins (Arriagada, 1992). As noted in a recent physiological model (Mesulam, 2000), increased expression and phosphorylation of tau occurs in regions with high levels of neuroplasticity (Brion et al., 1994). This risk factor for NFT formation disrupts the cytoskeleton and ultimately leads to cell death. The spread of NFT pathology (Hof and Morrison, 1994; Morrison and Hof, 2002) may originate in limbic regions because of their high levels of baseline plasticity. Later cell loss in association areas may result from cortical remodeling attributable to impaired input activity in limbic–paralimbic neurons that innervate them.
Cortical specificity
In the deficit maps, some barriers to disease progression appear at architectonic boundaries. The sensorimotor division is clearest (Figs. 5b, 6b,f); the right cingulate sulcus (Fig. 8c) also delimits spared frontal from severely affected limbic cortex. These barriers may be transient, but they suggest that structural deficits may differ sharply on either side of known architectonic boundaries, a feature seen in our earlier studies (Thompson et al., 2001b).
Detection power
Practical questions arise in determining the most powerful measure to discriminate AD from healthy aging and in resolving treatment effects. Here, gray matter maps, at a single time point, better discriminated AD (p < 0.00027 at follow-up) than longitudinal loss rates for total cerebral volume (p < 0.003) and rates of overall gray matter loss (p < 0.04). All of these measures correlated significantly with cognitive decline. Brain volume change rates, derived from high-resolution cortical surface models (Fig. 1), may be more effective at discriminating AD than total gray matter changes, which may be more susceptible to partial volume error. Map-based measures showed vastly greater effect sizes than any measures based on volumes (e.g., p < 0.00027 at follow-up). Increased power may result from restricting the search space for disease effects to the cortical sheet. Permutation tests on cluster size (Bullmore et al., 1999; Thompson et al., 2001c) in the significance maps also sensitize the maps for detecting disease effects.
Advantages of this study
This study builds on previous work on AD progression (DeCarli, 2000; Ashburner et al., 2003). The advantages of this study over previous work are that advancing deficits are shown in the form of dynamically changing maps. Cortical pattern matching, a technique used here, also relates deficits to gyral anatomy (e.g., the cingulate/frontal division) (Fig. 8c). The image analysis method involves a high-dimensional registration followed by a test of gray scale differences and is therefore a hybrid in terms of the continuum between deformation morphometry and voxel-based morphometry discussed previously in the literature (Ashburner and Friston, 2001;Bookstein, 2001). As advocated previously (Bookstein, 2001), high-dimensional warping is used to align structures across subjects before comparing gray matter differences. This can increase detection power by reducing the anatomical variance present in image subtraction methods (Fig. 1). Asymmetries and group effects can then be mapped using surface-based statistics, after explicitly modeling cortical pattern differences across hemispheres, subjects, and time (Cannon et al., 2002).
Relation to previous work
Recent techniques to map AD progression use serial image alignment (Woods et al., 1993; Fox et al., 1996; Subsol et al., 1997; Wang et al., 2002), sometimes in conjunction with image deformation techniques (Freeborough and Fox, 1998; Thompson et al., 2000a,b; Janke et al., 2001; Scahill et al., 2002). These techniques produce an overall measure of change (e.g., brain volume loss in percentage) (Fox et al., 1999; Wang et al., 2002) or detailed maps of these changes (Thompson et al., 2000a,b; Janke et al., 2001). Four-dimensional maps of degenerative rates may also be derived from a deformation field that elastically transforms a subject's anatomy from its earlier configuration to its later shape (Janke et al., 2001). Using a brain boundary shift integral technique, Fox et al. (1999, 2000) noted that yearly rates of overall brain atrophy, based on their measures of total cerebral volume, were 2.4 ± 1.1% per year in AD and 0.4 ± 0.5% per year in matched elderly controls (MMSE score 19.6 ± 4.1 and 29.2 ± 1.0 at baseline, for patients and controls, respectively). These measures are slightly lower than ours (5.22 ± 2.04% per year in AD; 0.88 ± 0.15% per year in controls), although our patients are slightly more severely impaired (MMSE score falling from 18 to 13 at follow-up). This may support the idea that atrophic rates accelerate as the disease progresses (Kaye et al., 1999). A limitation of our current study is our use of only two time points per subject, which forces us to assume linear loss during the interscan interval. Future studies using multiple time points per patient (Janke et al., 2001) will reveal whether AD accelerates or progresses nonlinearly over time.
The profound deficits observed here further support the focus on temporal lobe as a site of early and progressive change in AD (Murphy et al., 1993; Kaye et al., 1997; Jack et al., 1998; Laakso et al., 2000). Jobst et al. (1994) noted faster change rates, even before symptom development, in normal individuals who went on to develop mild cognitive impairment. In the future, cortical maps may help to map preclinical brain change in those at genetic risk for AD (e.g., ApoE4 carriers) (Small et al., 2000; Reiman et al., 2001; see alsoThompson et al., 2003).
In summary, the dynamic maps presented here suggest that dynamic structural changes in AD, mapped in living patients, are congruent with earlier cross-sectional metabolic and pathologic changes. This sheds light on the complex pattern and timing of these cortical events. The overall strategy described here also provides quantitative and visual criteria to assess genetic effects on brain structure (Thompson et al., 2001a; Thompson and Toga, 2002) and to map drug effects in clinical trials.
Footnotes
This work was supported by National Library of Medicine Grant R01 LM05639, by National Center for Research Resources Grants P41 RR13642 and M01 RR00865, by a grant from GlaxoSmithKline Pharmaceuticals UK, and by Human Brain Project Grant P01 MH52176 to the International Consortium for Brain Mapping, which is funded jointly by the National Institute of Mental Health and the National Institute on Drug Abuse.
Correspondence should be addressed to Dr. Paul Thompson, Room 4238, Reed Neurological Research Center, Laboratory of Neuro Imaging, Department of Neurology, University of California Los Angeles School of Medicine, 710 Westwood Plaza, Los Angeles, CA 90095-1769. E-mail:thompson{at}loni.ucla.edu.