Introduction

Alzheimer’s disease (AD) is an age-related neurodegenerative disorder that results in progressive loss of cognitive function. AD is characterized by the accumulation of the amyloid-beta (Aβ) peptide into amyloid plaques in the extracellular brain parenchyma and by intraneuronal neurofibrillary tangles caused by the abnormal phosphorylation of the tau protein [1]. Amyloid deposits and tangles are necessary for the post mortem diagnosis of AD [2].

Imaging techniques, such as positron emission tomography (PET), have long been used to visualize brain damage in AD and in mild cognitive impairment (MCI), often a prodrome to AD [3]. There is increasing evidence that reductions in the cerebral metabolic rate of glucose (MRglc), as measured with PET using [18F]-fluoro-2-deoxyglucose (FDG) as the tracer, can be consistently detected in MCI patients compared to age-matched normal controls, mostly involving the parieto-temporal, posterior cingulate, and medial temporal cortices [47]. MRglc is an index of synaptic functioning and density [8], but hypometabolism is not specific to AD, as it is observed in other neurodegenerative disorders (see [9] for review). Moreover, MCI is a clinical diagnosis in need of confirmatory biological evidence for disease. A recent large population-based study showed up to 40% of patients with MCI who were subsequently diagnosed as cognitively normal [10].

The PET tracer, N-methyl[11C]2- (4'-methylaminophenyl)-6-hydroxy-benzothiazole, better known as Pittsburgh Compound-B (PIB), was used to detect amyloid deposition in vivo. Prior PIB-PET studies demonstrate quantitative increases in PIB uptake, reflecting greater amyloid burden, in AD and MCI patients compared to controls [1114]. In AD, PIB uptake is particularly evident in the frontal, parieto-temporal, and posterior cingulate cortices, in keeping with the known distribution of amyloid plaques [1517]. However, recent data also show that many MCI patients fall in between AD and control values for PIB binding, and some clinically normal subjects also show an elevated PIB uptake [13, 14]. These findings are also consistent with clinico-pathology studies showing that typical amyloid lesions are found in both demented and non-demented individuals [18, 19]. These results suggest that the presence of amyloid may be necessary, though not sufficient for the symptoms consistent with the MCI stage of AD. The present study used a newly developed automated region of interest technique to compare the diagnostic value and concordance of FDG-PET and PIB-PET in AD and MCI.

Materials and methods

Subjects

Thirty-seven subjects, including: 17 AD and 13 MCI patients and 7 normal elderly (NL) patients, were examined at the University of Turku, Finland. All subjects underwent thorough clinical examinations including a medical history corroborated by a close informant, neurological and neuropsychological examinations, routine blood analysis, and magnetic resonance imaging (MRI). The AD patients were diagnosed according to the National Institute of Neurological and Communication Disorders and Stroke/Alzheimer’s Disease and Related Disorders Association (NINDS-ADRDA) [20] and DSM-IV criteria [21] by an experienced neurologist. Dementia severity was evaluated with the Mini-Mental State Examination (MMSE) [22]. All AD patients had progressive impairment of memory and impairment in at least one additional field of cognitive function.

All 13 MCI patients met the criteria for “amnestic MCI” [23]. The NL controls were healthy volunteers who contacted Turku PET Centre after announcement in newspaper or in public lectures concerning needed participation in memory studies. None of the controls reported or revealed on examination any neurological or psychiatric disease, prior head trauma, sensory impairment, or subjective cognitive complaints. The PET scans were used for the research evaluations and were not used for selection purposes. The study was approved by the Ethics Committee of Southwest Finland Health Care District.

Brain imaging

PET imaging

All subjects underwent two PET scans, one with PIB and one with FDG on a GE Advance PET scanner (GE Medical Systems, Milwaukee, WI, USA) in the three-dimensional scanning mode (septa retracted), yielding 35 slices with 4.25 mm thickness that covered the whole brain. The spatial resolution (full width at half-maximum) of the camera is 4.3 mm transaxially and 4.3 mm axially. Laser light beams were used for head positioning, with alignment determined by orbitomeatal and sagittal lines. Before the injection of either radiotracer, an 8-min transmission scan with 68Ge rod sources was done for attenuation correction. All imaging data were reconstructed into a 128 × 128 matrix using a transaxial Hanning filter with a 4.6-mm cutoff, and an axial ramp filter with an 8.5-mm cutoff.

PIB-PET

PIB-PET scans were acquired during a 90-min dynamic PET acquisition. [11C]PIB was injected into an antecubital vein as a bolus, with a mean dose of 382 ± 103 MBq, and flushed with saline. The frame sequence of the PIB scan consisted of four 30-s frames, nine 1-min frames, three 3-min frames, ten 5-min frames, and two 10-min frames. PIB-PET parametric images were obtained using the noninvasive Logan graphical analysis [24] using the 60- to 90-min scans and the cerebellum as the reference region to estimate the PIB distribution volume ratios (DVR). This procedure has proved to be reliable and valid for PIB-PET studies in AD [25].

FDG-PET

FDG-PET scans were acquired with a 55-min dynamic PET acquisition. During the uptake period, arterialized venous blood was sampled every 20 s for the first 3 min, every 1 min from 3 to 5 min, every 2.5 min from 5 to 10 min, every 5 min from 10 to 35 min, and every 10 min from 35 to 55 min. The frame sequence of the FDG scan consisted of four 30-s frame, three 1-min frames, and ten 5-min frames. A Patlak plot was used to estimate the brain MRglc [26].

MRI

MRI was performed with a Philips Gyroscan Intera 1.5 T CV Nova Dual scanner (Philips, the Netherlands). MRI included axial spin echo T2-weighted images (TR = 4488 ms; TE = 100 ms, slice thickness = 6 mm, matrix = 512 × 512), and 3D T1-weighted images (TR = 25 ms, TE = 5 ms, slice thickness = 1 mm, matrix = 512 × 512).

Image analysis

All image processing and data analyses were performed at NYU blind to clinical diagnoses. MRI and PET scans were transferred to a Sun Sparc work-station (Sun Microsystems, Mountain View, CA, USA) where PIB- and FDG-PET scans were each co-registered with the corresponding MRI using a three-dimensional method based on minimizing the variance of the signal ratios [27], as implemented in the Multimodal Image Data Analysis System package (MIDAS, version 1.6) [28]. The implementation calls for a preliminary spatial alignment using intrinsic anatomical landmarks.

An MRI-based automated region of interest (ROI) technique was used to sample each individual’s FDG and PIB images. The technique was validated by a manual ROI technique that is described in detail in the Appendix. The template ROI was first developed on seven MRI scans and then transferred to a co-registered MNI PET template. All PET scans were normalized to the PET template by a high-order polynomial transformation [29]. With spatial normalization parameters saved, an inverse transformation is applied to morph the ROIs back to the original FDG-PET. The standard FDG-ROI’s are then transferred to the PIB scan in real space through the co-registration. ROI positioning was verified on the MRI, but no positioning adjustments were made in this project. To maximize gray matter (GM) sampling, a probabilistic grey matter template image was derived from SPM [30] and added as template ROI (see Appendix). Nine MRI and FDG-PET validated automated ROIs were studied including: anterior putamen (APu), grey matter (GM), hippocampus (HIP), inferior parietal lobe (IP), middle frontal gyrus (MFG), posterior cingulate cortex (PCC), cerebellum (C), superior temporal gyrus (STG), thalamus (TH).

The posterior lobe of the cerebellar cortex was used as the reference region for both PIB and FDG analyses. The cerebellum is minimally affected by either MRglc reductions [4, 31] or by amyloid pathology [32, 33] in AD.

Statistical analysis

The general linear model (GLM) univariate analysis of variance (ANOVA), with Tukey post hoc tests, was used to examine demographic, clinical, FDG MRglc, and PIB uptake measures across the three clinical groups. All significant results were confirmed using the nonparametric Mann–Whitney test with Bonferroni correction for multiple comparisons. Categorical demographic variables were examined with Chi-Square analysis and confirmed with Fisher’s exact tests. PIB DVR is expressed as a ratio to cerebellar uptake. MRglc measures were adjusted for cerebellar MRglc as a covariate in the GLM. The bilateral regions showing the largest group effects (as determined by MANOVA) were examined with logistic regressions and ROC curves to assess their diagnostic accuracy in classifying the NL, MCI, and AD groups. The ROC curve was also used to determine optimal cutoff value for MRglc and PIB DVR in separating NL, MCI, and AD. Results were considered significant at p < 0.05. All analyses were done using SPSS 12.0 (SPSS, Chicago, IL 2004, USA).

Results

Clinical data

The NL, MCI, and AD groups were comparable for age, gender, and education (see Table 1). The MMSE was significantly lower in AD subjects than in NL and MCI (p < .05), but did not differ between MCI and NL. MMSE scores in the AD group ranged from 17 to 27, which corresponds to mild to moderate dementia.

Table 1 Subjects’ characteristics

Group differences

FDG-PET

Of the nine regions tested, five regions showed significant post hoc differences between the NL and AD groups (Fig. 1, Table 2), with the AD group showing reduced MRglc compared to NL in the HIP (43%), PCC (21%), IP (18%), MFG (13%), (ps < .05). Significant MRglc reductions in MCI compared to NL were found for the HIP (16%) and IP (13%), and in AD compared to MCI reductions, which were only found in the HIP (23%; p’s < .05). For all group comparisons, the HIP MRglc was the most significant group discriminator (F [2,34] = 33.9, p < .001). No cerebellar differences were observed for any group comparisons.

Fig. 1
figure 1

Scatter plots showing regional cerebellar adjusted MRglc (umol/100 g/min) for the HIP, MFG, IP, and PPC in NL, MCI, and AD groups. The horizontal lines show the group means

Table 2 FDG-PET MRglc data by diagnostic group

PIB-PET

AD patients showed significantly higher PIB uptake compared to both NL and MCI groups (F [2,34] = 14.6, p < .005), respectively, in the GM, MFG, PCC, IP, STG (Fig. 2, Table 3). The MFG was the region showing the highest PIB uptake (F [2,34] = 14.6, p < .001), which was 66% higher in AD compared to NL (p = .0001), and 29% higher compared to MCI (p = .004). There is no significant differences found between NL and MCI; however, the APu region showed a trend for higher PIB uptake in MCI (p = .06). No cerebellar differences were observed for any group comparisons.

Fig. 2
figure 2

Scatter plots showing regional PIB binding from the ROI analysis in the hippocampus, MFG, IP, and PPC in subjects with NL, MCI, and AD. The small horizontal lines show the groups mean values. The large horizontal line shows for reference purposes the DVR set to 1.4

Table 3 PIB-PET DVR data by diagnostic group

PIB-FDG associations

Negative correlations between the FDG and PIB modalities were observed when the three groups were combined: IP (r = −0.43, p = .001), STG (r = −0.41, p = .001), and PCC (r = −0.40, p = .001). However, no significant intraregional correlations were observed within any of the three diagnostic groups.

As shown in Fig. 3, the FDG-PET and PIB-PET data show different regional patterns of involvement in AD compared with NL. The most significantly affected region on FDG was the hippocampus and the middle frontal gyrus on PIB (Fig. 4).

Fig. 3
figure 3

PIB and FDG-PET scans from two representative subjects: a A 71-year-old male AD subject, GDS 5, MMSE 19; b a 65-year-old male NL subject, GDS 1, MMSE 29. Top row: PIB-PET images; bottom row: co-registered FDG-PET images. PET scans are displayed in the axial plane, from the top to the bottom of the brain, at the level of the centrum-semiovale (left), basal ganglia (center), and medial temporal lobe (right)

Fig. 4
figure 4

Mean MCI and AD Z-scores relative to NL. The most significantly affected region for FDG is the hippocampus and for PIB, frontal neo-cortex

Diagnostic models

Logistic regression models were used to examine regional uptake and FDG-MRglc as predictors of group membership. Diagnostic classification accuracy, sensitivity (SS), and specificity (SP) are found in Table 4.

Table 4 Diagnosis classification accuracy, sensitivity, and specificity (%) of PIB-PET and FDG-PET

AD vs. NL

HIP MRglc yielded an accuracy of 92% (Χ 2 [1] = 21.8, p < .001, SS = 100, SP = 88). MFG PIB uptake distinguished AD from NL with 96% accuracy (Χ 2 [1] = 29.0, p < .001, SS = 94, SP = 100). Using a PIB-DVR cutoff of 1.4, three regions including the MFG, PCC, and IP each separated 16 out of 17 AD and 7 out of 7 NL with 94% sensitivity and 100% specificity with an overall accuracy of 96%. The one erroneously classified as AD subject had the PIB-DVR lower than 1.4 in all three regions (Fig. 2). Comparing PIB-MFG and FDG-HIP, there was high diagnostic agreement for the classification of AD (94%) and NL (86%) subjects.

MCI vs. NL

The accuracy for the HIP MRglc in distinguishing MCI from NL was 85% (Χ 2 [1]=9.2, p < .01, SS = 85, SP = 86), and for MFG PIB uptake, the accuracy was 75% (Χ 2 [1] = 7.2, p < .01, SS = 62, SP = 100). Thus, about 60% of the MCI subjects showed an AD-like DVR > 1.4; see Fig. 5). Comparing PIB-MFG and FDG-HIP for the classification of MCI showed, there was poor agreement (54%). Only 7 out of 13 MCI cases showed both high PIB binding and low MRglc. Combining the MFG PIB and HIP-FDG measures improved the classification of NL and MCI to 90% (increment Χ 2 [1] = 4, p < .05, SS = 92, SP = 86). In an effort to relate these MCI data to the clinical findings, we examined the association between MFG PIB uptake and MMSE scores. All lower MMSE 25-27 scoring MCI patients were AD-like (PIB DVR > 1.4) and all NL-like MCI (PIB DVR < 1.4) had MMSE scores >28 (see Fig. 3, t = 2.7[1,11], p < 0.05). There were no regional MRglc differences between the high and low MMSE MCI group. As mentioned above, the MCI patients typically showed low HIP MRglc. Graphic depiction of the MFG-PIB and HIP-FDG relationships by MMSE score is found in Fig. 5. These data show that all lower MMSE 25–27 scores, MCI patients are in the high PIB binding and low MRglc quartile. For the low MMSE group, there was 100% agreement between PIB and FDG, as opposed to 33% for high MMSE group.

Fig. 5
figure 5

Scatter plots showing the combined use of PIB of MFG and FDG of HIP on the classification of the NL, MCI, and AD groups

AD vs. MCI

MFG PIB uptake yielded an overall accuracy of 77% in distinguishing AD from MCI (Χ 2 [1] = 8.9, p < .01). HIP MRglc yielded an accuracy of 80% (Χ 2 [1] = 11.6, p < .01). The combination of MFG PIB uptake and HIP MRglc improved the classification of AD and MCI to 83% (increment Χ 2 [1] = 4.1, p < .01, SS = 82, SP = 85).

Discussion

The uptake of the beta amyloid PET tracer [11C]PIB was significantly increased in AD compared to age-matched healthy controls. This effect was found bilaterally in the middle frontal gyrus, anterior putamen, inferior parietal lobule, and the posterior cingulate cortex. In MCI, a lesser pattern of PIB uptake was found involving the middle frontal gyrus and inferior parietal lobule. This observation is also consistent with findings reported in previous studies [14, 34]. The FDG-PET data demonstrated that AD patients show a pattern of bilateral MRglc reductions in the hippocampus, posterior cingulate, inferior parietal, and frontal cortices, while MCI patients presented with hypometabolism most consistently in the hippocampus and in the parietal cortex. These findings are also consistent with prior FDG-PET studies [6, 7, 37, 38, 39, 40].

The MFG PIB uptake separated 16 out of 17 AD patients from NL control with 100% specificity and 96% sensitivity. This contributes to the view that [11C]PIB-PET will have utility as a diagnostic marker for AD. Only one 73-year-old male AD patient [Global Deterioration Scale (GDS) 5, MMSE 19] showed low PIB retention (DVR = 1.29). The finding of occasional “PIB negative” AD patients has been previously reported [14, 35], and the reason is unclear. Such findings require post mortem clarification. On the other hand, the FDG-PET scan of this AD patient showed evidence for a neurodegenerative disorder consistent with AD, as reflected in the bilateral MRglc reductions in the parieto-temporal posterior cingulate cortices and medial temporal lobes. While direct diagnostic comparison between PIB and FDG imaging is uncommon, a previous study [36] reported PIB-PET to be superior to FDG-PET in classifying AD from NL. Unlike our study, this paper only studied neocortical regions and not hippocampus, which we found most discriminative on FDG-PET. Our study shows that the best regions for FDG-PET (hippocampus) and PIB-PET (middle frontal gyrus) have high diagnostic agreement for AD (94%) and NL (86%) indicating approximately equal value in the clinical diagnosis of AD. The combination of two PET modalities did not improve the diagnostic discrimination between AD and NL. That there was no appreciable increase in the classifications of AD and NL, is attributable to the very high accuracy each modality achieved on its own.

We found that several PIB regions were found to discriminate between MCI and NL with accuracies in the 60–75% range compared with FDG regions that performed in the 70–85% range. FDG-PET appears to be superior to PIB-PET in the classification of NL and MCI. Moreover, the diagnostic agreement between the two PET modalities for MCI was only 54%; four MCI subjects with MRglc reductions showed low PIB retention and one MCI subject with normal MRglc showed high PIB retention. With MCI subjects separated into high PIB uptake (AD-like) and low PIB uptake (NL-like) groups (Fig. 5), we found that the MCI subjects with high MMSE scores (≧28) have NL-like PIB scans and MCI subjects with low MMSE scores have AD-like PIB scans. Moreover, elevated PIB and low MRglc was found in 100% of low performing MCI compared with only 33% agreement for the high performing MCI. These data suggest that low performing MCI subjects with PIB positive scans are at increased risk for dementia. Overall, the data show that the combination of two PET modalities improves the diagnostic discrimination between MCI and NL. Longitudinal studies are needed to clarify the utility of the PIB and FDG-PET imaging in assessing risk for AD.

Our results comparing MCI and AD showed different patterns of regional involvement depending on the PET imaging modality. We observed that the mean PIB values for several regions differed between MCI and AD and that their diagnostic classifications were significant. Consistent diagnostic PIB effects (~70% accuracy) were found in the middle frontal gyrus, posterior cingulate, inferior parietal lobule, and the superior temporal gyrus. For FDG, the hippocampus, the only region that showed a mean MRglc reduction in AD relative to MCI also showed the only significant diagnostic effects (~80%). This result of modality-specific informative regions underlies our second example where the combination of the two PET techniques yields complementary information in the detection of pathology. In our study, the combination of the two PET modalities improved the diagnostic discrimination between MCI and AD and between MCI and NL.

We did not observe an inverse relationship between the regional PIB and FDG data in AD as reported by others [14, 35]. This discrepancy may be in part due to the statistical designs used to study the data. Klunk et al demonstrated a significant correlation in the inferior parietal cortex in a combined group of AD and NL patients. However, they observed that this effect did not remain significant when only AD patients were studied. [14]. While Edison et al. showed lower CMRglc values correlated with higher PIB uptake ratios in temporal (p = .05, r = −0.58) and parietal lobes (p = .041, r = −0.60) in 12 AD subjects, they also observed a high frontal amyloid load in the face of spared glucose metabolism. These preliminary results suggest that the weak inverse correlations observed may be due to either group diagnostic effects or to different regional metabolic vulnerability due to the complex neuropathology of AD. Overall, it appears that amyloid plaques may not be directly responsible for neuronal dysfunction in AD.

In the present study, we describe the continued development of an automated ROI technique custom-tailored for FDG and PIB-PET images (see Appendix for technical details). Several automated tools are used in neuroimaging studies to examine and sample brain regions. Foremost, voxel-based analysis (VBA) techniques with statistical parametric mapping procedures provide examination of statistical effects through the whole-brain on a voxel-by-voxel basis [41, 42]. The basic procedure in VBA involves the spatial normalization and smoothing of each individual PET scan to a spatially standardized brain reference image (i.e., the “template” image) in the stereotactic space, thus enabling automated voxel-by-voxel assessment of statistical effects [41, 42]. However, the MRI-guided ROI technique remains the gold standard for PET sampling, especially in aging and degenerative diseases, because of its superior anatomical precision. On the other hand, the conventional manual ROI sampling is time consuming and operator dependent, and PET images are often acquired without a corresponding 3D research MRI. To examine large data sets with reasonable anatomical precision, we describe the development of an automated ROI technique custom-tailored for sampling the cortical and medial temporal lobe regions affected in AD on FDG and PIB-PET images. These procedures were validated against the gold-standard manual ROI determined on the co-registered MRI scans. The automated ROI data in this project showed high anatomical precision as assessed on the MRI scans and high agreement with respect to the manual MRglc sampling (r’s > 0.90) and manual PIB sampling (r’s > 0.90; detail was described in Appendix). In the present study, the anatomical accuracy of the automated ROIs was excellent in all subjects, and positional adjustments were not made.

Our automated method offers several advantages compared to other commonly used image analysis tools. First, the ROIs are applied via inverse transformation to the original image instead of the spatially normalized image, thus preserving the anatomical shape of the region and avoiding possible sampling errors. Moreover, the anatomical precision of the ROIs can be directly examined on the MRI and PET scans in the original space, and manual adjustments can be made if necessary. Furthermore, we applied a sampling strategy to optimize gray matter sampling, which minimizes partial volume effects and nonspecific white matter binding, which may confound detection of regional changes on PIB-PET.

There are some limitations in this study. First, the recruitment of patients at a university-based unit limits the generalization of the results. Second, we used a probabilistic gray matter sampling technique instead of the traditional MRI-based atrophy correction. While our tests suggest comparability between the two techniques, there remains the possibility that it may not remove partial volume effect thoroughly. Third, this study relied on cross-sectional data where longitudinal follow-up studies are needed to determine the predictive accuracy of PIB-PET and FDG-PET in the MCI progression to AD.

Conclusion

We developed and applied an automated image analysis system for the regional sampling of PIB and FDG PET scans in a cross-sectional study of normal elderly, MCI, and AD. We observed widespread cortical amyloid deposition and widespread reductions in brain glucose metabolism in AD patients that were of equivalent high value in diagnosis. For MCI, the regional patterns were less prominent. For both the AD and the MCI groups, the diagnostically most useful PIB region was the middle frontal gyrus, and the most useful MRglc reductions were found in the hippocampus. For MCI, the two modalities were diagnostically inconsistent, and this contributed to their combined added value. Little evidence was found for an inverse relationship between regional PIB uptake and reduced MRglc. Longitudinal studies are needed to clarify the early and relative predictive utility of PIB and FDG-PET imaging in assessing the risk for progressive cognitive decline among MCI patients and for assessing probable AD patients without significant PIB uptake.