Abstract
Gene expression microarrays provide a powerful new tool for studying complex processes such as brain aging. However, inferences from microarray data are often hindered by multiple comparisons, small sample sizes, and uncertain relationships to functional endpoints. Here we sought gene expression correlates of aging-dependent cognitive decline, using statistical profiling of gene microarrays in well powered groups of young, mid-aged, and aged rats (n= 10 per group). Animals were trained on two memory tasks, and the hippocampal CA1 region of each was analyzed on an individual microarray (one chip per animal). Aging- and cognition-related genes were identified by testing each gene by ANOVA (for aging effects) and then by Pearson's test (correlating expression with memory). Genes identified by this algorithm were associated with several phenomena known to be aging-dependent, including inflammation, oxidative stress, altered protein processing, and decreased mitochondrial function, but also with multiple processes not previously linked to functional brain aging. These novel processes included downregulated early response signaling, biosynthesis and activity-regulated synaptogenesis, and upregulated myelin turnover, cholesterol synthesis, lipid and monoamine metabolism, iron utilization, structural reorganization, and intracellular Ca2+ release pathways. Multiple transcriptional regulators and cytokines also were identified. Although most gene expression changes began by mid-life, cognition was not clearly impaired until late life. Collectively, these results suggest a new integrative model of brain aging in which genomic alterations in early adulthood initiate interacting cascades of decreased signaling and synaptic plasticity in neurons, extracellular changes, and increased myelin turnover-fueled inflammation in glia that cumulatively induce aging-related cognitive impairment.
Introduction
Brain aging processes are enormously complex phenomena that affect multiple systems, cell types, and cellular pathways, and eventually induce cognitive decline and increased risk of Alzheimer's disease (AD) (Landfield et al., 1992;Tanzi and Bertram, 2001). Several biological processes have been associated with normal brain aging and neurodegenerative conditions, including inflammatory responses (Rogers et al., 1996; Murray and Lynch, 1998; Hauss-Wegrzyniak et al., 2000; Andreasson et al., 2001;Finch et al., 2002; Gemma et al., 2002; Wyss-Coray and Mucke, 2002), oxidative stress (Carney et al., 1991; Butterfield et al., 1999;Bickford et al., 2000; Nicolle et al., 2001; Wang et al., 2001), reduced mitochondrial function (Nicotera and Orrenius, 1998; Wallace, 2001), and altered Ca2+ regulation (Landfield and Pitler, 1984; Michaelis et al., 1984; Gibson and Peterson, 1987; Khachaturian, 1989; Disterhoft et al., 1993; Franklin and Johnson, 1994; Lipton and Rosenberg, 1994; Foster and Norris, 1997;Nicotera and Orrenius, 1998; Verkhratsky and Toescu, 1998).
The triggers and consequences of these putative brain aging mechanisms are not well understood, although several are accompanied by altered gene expression (Rogers et al., 1996; Finch and Tanzi, 1997; Chen et al., 2000; Lee et al., 2000; Nicolle et al., 2001). Moreover, specific gene mutations are directly involved in age-dependent neurodegenerative disease (for review, see Price and Sisodia, 1998; Selkoe, 2001;Tanzi and Bertram, 2001). Consequently, gene microarray technology, which can monitor the parallel expression of thousands of genes (Schena et al., 1996; Lockhart and Barlow, 2001), appears to be a powerful new tool for investigating the complex processes of brain aging (Lee et al., 2000; Jiang et al., 2001).
Nonetheless, microarray approaches pose significant resource and bioinformatics problems. With only a few exceptions (Pletcher et al., 2002), microarray studies have not used the formal statistical analyses and sample sizes (power) necessary to estimate expected false positives or detect moderate changes in expression. Thus, given the large multiple-comparison error anticipated in microarray analyses (Miller et al., 2001), both false positive (type I) and false negative (type II) errors are often high. As corollaries, the statistical reliability of microarray results is often clouded, and many potentially important processes have likely been overlooked (Watson et al., 2000; Miller et al., 2001; Becker, 2002). Additionally, it has frequently been difficult to assess which of many hundreds of observed gene alterations are potentially relevant to functional endpoints.
Here we used the advantages of microarray analyses to identify novel brain aging processes with potential relevance to cognitive decline. We addressed the key problems of reliability, sensitivity, and functional relevance noted above by using adequately powered statistical profiling and a strategy of correlating the expression of each gene with a defined functional endpoint (memory performance). Several studies have previously found links between specific genes and plasticity or memory (Gall et al., 1990; Steward et al., 1998; Guzowski et al., 2000; Silva et al., 2000; Cavallaro et al., 2001; Kandel, 2001; Luo et al., 2001,2002; Nicolle et al., 2001). However, this appears to be the first study to correlate massively parallel expression microarrays with behavior across individual subjects. Collectively, the data suggest a new integrative model in which cascades of neuronal and glial processes interact cumulatively to induce aging-related cognitive impairment.
Materials and Methods
Subjects and behavioral testing
All procedures involving rats were performed in accordance with the guidelines set forth by the Institutional Animal Care and Use Committee. Male Fischer 344 rats aged 4 months (young,n = 10), 14 months (mid-aged, n = 10), and 24 months (aged, n = 10) were used in these studies. Animals were trained sequentially on two tasks, first in the Morris spatial water maze (SWM) and then in the object memory task (OMT). Overall, the training/testing sequence lasted 7 d, and hippocampal tissue was collected 24 hr later. Training or testing occurred on each day except for the second and third day of the 7 d sequence.
Morris spatial water maze
Methods used here for cognition assessment in the SWM, a task sensitive to both hippocampal function and aging, have been described previously (Norris and Foster, 1999). Briefly, rats were trained in a black tank, 1.7 m in diameter, that was filled with water (27 ± 2°C). Behavioral data were acquired with a Columbus Instruments tracking system. After habituation to the pool, animals were given cue training with a visible platform (five blocks of three trials, maximum of 60 sec per trial, 20 sec intertrial interval, and a 15 min interval between blocks). Rats remained in home cages under warm air after each block. Cue training was massed into a single day, and the criterion for learning was finding the platform on four of the last six trials. For all animals that met this criterion, spatial discrimination training was initiated 3 d later in which the escape platform was hidden beneath the water but remained in the same location relative to the distal cues in the room. Fifteen minutes after the end of spatial training, a 1 min duration free-swim probe trial with the platform absent was administered, during which crossings over the former platform site (platform crossings) were recorded to test acquisition, followed by a refresher training block. Retention for platform location was again tested 24 hr later using a second 1 min free-swim probe trial.
Object memory task
The OMT is also both sensitive to hippocampal function and affected by aging but is less dependent on physical strength and endurance (Markowska et al., 1998). On the afternoon of the final spatial maze probe trial, animals were administered a habituation session (15 min) in the empty mesh cage to be used for the OMT (63.5 × 63.5 cm). OMT training began 24 hr after habituation and consisted of a 15 min acquisition session during which two three-dimensional objects were placed at opposite sides of the cage, followed by two 15 min retention test sessions at 1 and 24 hr after training. During the acquisition session, the cage contained two sample objects (A and B), and the time spent actively exploring each object was recorded. After 1 hr, the rat was reintroduced into the cage, and the time spent exploring a novel object, C, relative to the familiar object, B, was recorded. On the 24 hr test, familiar object A was reintroduced and object B was replaced by a second novel object, D. Objects were randomized across individuals, and timed measures of exploration were used to calculate a memory discrimination index (DI) as follows: DI = (N −F)/T, where N is time spent exploring the novel object, F is time spent exploring the familiar object, and T is total time spent exploring the two objects. More time spent exploring the novel object (higher DI) is considered to reflect greater memory retention for the familiar object.
Tissue collection
Twenty-four hours after completion of the OMT testing, animals were anesthetized with CO2 gas and decapitated. The brains were rapidly removed and immersed in ice-cold, oxygenated artificial CSF consisting of (in mm): 124 NaCl, 2 KCl, 1.25 KH2PO4, 2 MgSO4, 0.5 CaCl2, 26 NaHCO3, and 10 dextrose. Hippocampi were removed, and the CA1 region from one hippocampus per animal was dissected by hand under a stereomicroscope. The CA1 tissue block from each animal was placed in a microcentrifuge tube and flash frozen in dry ice for RNA isolation. Microarray analyses were performed on hippocampal CA1 tissues from each of the same behaviorally characterized 30 animals (one chip per animal), but one chip was lost for technical reasons, leaving a data set of 29 microarrays (young = 9; mid-aged = 10; aged = 10). Each U34A rat chip (Affymetrix, Santa Clara, CA) contained 8799 probe sets (gene representations).
RNA isolation and Affymetrix GeneChip processing
Total RNA was isolated using the TRIzol reagent and following the manufacturer's RNA isolation protocol (Invitrogen, #15596). One milliliter of TRIzol solution was added to each tube containing the frozen tissue block, and the tissue was homogenized by 10 passages through an 18.5 ga syringe needle. After centrifugation, the RNA was precipitated from the aqueous layer, washed, and dissolved in RNase-free water. RNA concentration and integrity were assessed by spectrophotometry and gel electrophoresis. The RNA samples were stored at −80°C.
Gene expression analyses were performed using the Affymetrix GeneChip System. The labeling of RNA samples, rat GeneChip (RG-U34A) hybridization, and array scanning were performed according to the Affymetrix GeneChip Expression Analysis Manual (r.4.0, 2000). Each animal's CA1 subfield RNA was processed and run on a separate rat gene chip. Briefly, an average yield of 40 μg of biotin-labeled cRNA target was obtained from 5 μg of total RNA from each CA1 sample, of which 20 μg of cRNA was applied to one chip. The hybridization was run overnight in a rotating oven (Affymetrix) at 45°C. The chips were then washed and stained on a fluidics station (Affymetrix) and scanned at a resolution of 3 μm in a confocal scanner (Agilent Affymetrix GeneArray Scanner).
Microarray data analysis
Microarray suite software (MAS 4.0, Affymetrix) calculated the overall noise of the image (Qraw), which was highly similar across arrays in all three age groups (young: 21.81 ± 1.55; mid-aged: 21.25 ± 2.24; aged: 20.66 ± 2.06; NS). “All probe set scaling” was used to set overall intensities of different arrays to an arbitrary target central intensity of 1500. There was no significant difference in the scaling factor across ages (young: 1.58 ± 0.14; mid-aged: 1.46 ± 0.20; aged: 1.63 ± 0.16, NS).
The algorithms used to determine average difference expression (ADE) scores (expression level) and presence/absence calls are described in the Microarray Suite 4.0 Manual and formed the basis for determining expression (relative abundance) of transcripts and whether a particular transcript was reliably detectable, respectively.
Statistical analysis
The presence/absence calls and ADE scores for all probe sets on all 29 arrays were then copied from the MAS pivot table to an Excel 9.0 (Microsoft, SR-1) workbook. The data transformations, filtering, and most statistical analyses of our gene identification algorithm (see Results) were performed within Excel. Statistical tests were performed using a combination of Excel (Microsoft, v.9, SR-1) and SigmaStat (SPSS, v.2).
Absence calls. Each probe set on each chip had a “present,” a “marginal,” or an “absence” call. We considered a probe set present for the purposes of our analysis if, within at least one age group, it showed a present or marginal call in at least 40% of the chips.
Minimum–maximum. For the purposes of one phase of the filtering procedure (see Results), each probe set was normalized according to the formula: Equation 1where x is ADE score, X̅min is the mean for the age group with the lowest ADE score, and X̅max is the mean for the age group with the largest ADE score. Thus, normalized mean values varied between 0 (lowest) and 1 (highest) for each probe set.
Standardization (Z-score). For the purpose of obtaining means within functional categories and graphing, data were normalized using the Z-score method: Equation 2where X̅ is the mean, and SD(x) is the SD of ADE across all age groups for an individual probe set.
Functional categorization by gene ontology. We used two methods for categorizing identified genes. One method assigned genes to broad functional categories that we defined on the basis of gene functions derived from literature searches (see Results), and one method relied on the category assignments in the gene ontology (GO) system. The Gene Ontology Consortium (www.geneontology.org/doc/GO.doc.html) maintains a controlled vocabulary database of functional descriptions for genes. These are divided into three families: biological process, cellular component, and molecular function. We searched the NetAffx site (www.affymetrix.com/analysis/index.affx) for GO terms associated with all unique known gene name/symbols (3789) for the entire RG-U34A chip and were able to associate 3540 of them with GO terms (GO numbers). We used the total list of GO terms within the biological process and molecular function categories to build custom GO functional trees (outlines) that reflected only the functional attributes (and their parent attributes) for the genes that were rated present in our study. The number of times a GO term (or group of terms) appeared at each sub-level of the GO tree was counted, and hierarchical sums were calculated (of the number of occurrences at or below each sub-branch). A similar procedure was performed separately for statistically identified upregulated and downregulated aging-dependent genes. To assess relative over- or underrepresentation of GO functional terms associated with identified genes, we used the binomial statistic. The ratio of “significantly increased (or decreased)/entire chip” formed the basis of the binomial statistic. For testing the null hypothesis, it was assumed that the ratio of identified aging-dependent genes to all genes at each functional level of the tree was similar to the ratio for the entire chip and that deviations of the binomial statistic (at p ≤ 0.05) from that proportion reflected significant over- or underrepresentation of identified genes at that level of the GO tree (Pletcher et al., 2002).
Results
Behavioral results
During cue training in the SWM, all animals were able to locate the visible escape platform (the criterion used for adequate sensorimotor performance) and were then trained on the hidden platform spatial task. During acquisition, aged animals learned the task more slowly (longer latencies) than mid-aged or young but performed similarly to young and mid-aged animals on the 1 hr probe retention test (data not shown). However, an aging-dependent decrease in 24 hr retention, as measured by platform crossings (one-way ANOVA;p ≤ 0.01), was observed on the 24 hr retention probe trial (Fig. 1). Post hocanalysis indicated that young and mid-aged animals exhibited more platform crossings relative to aged animals on this test but did not differ from each other. In the OMT, aged animals performed as well as young or mid-aged on the 1 hr retention test (data not shown), but there was a significant age-related decline in recall (one-way ANOVA;p ≤ 0.001, for the main effect of age) on the 24 hr test (Fig. 1). At 24 hr, young and mid-aged groups were significantly different from the aged group but not from one another (young vs aged:p < 0.001; mid-aged vs aged: p < 0.05; young vs mid-aged: NS, Tukey's post hoc test).
Gene identification algorithm
To identify aging and cognition-related genes (ACRGs) while managing multiple comparison error, we used a multistep gene identification algorithm comprising a priori filtering, ANOVA, and correlation testing (Fig. 2). The filtering was aimed at reducing the total comparisons by excluding unnecessary or less interesting gene probes. Because the number of expected false positives equals the p value percentage of the total number of comparisons tested (tests of individual genes), reducing total comparisons is useful for managing false positive error. That is, if 10,000 genes are tested at the p < 0.05 level of significance, 500 (5%) can be expected to be positive by chance alone (Miller et al., 2001). Such high false positives can rival or obscure true positives and thereby detract from statistical confidence. Therefore, reducing the number of total comparisons on the basis of clearly specified a priori filters can be statistically advantageous.
Data filtering step
We reduced the total gene probe sets to be tested according to three a priori filters (Fig. 2,1a–c). First, all probe sets rated “absent” (4118) by our criteria (see Materials and Methods) were excluded. (However, it should be emphasized that a substantial number of low-abundance neuronal molecules that are known to be expressed in hippocampus, including many ion channels and membrane receptors, were rated absent in this study.) Second, all present transcript sets representing unannotated “expressed sequence tags” (ESTs) (1213) and control probe sets (60) also were excluded. Although ESTs contain true positives, they add comparisons to be managed without providing further insight into known pathways (ESTs are being analyzed separately in a similar analysis). Third, we reasoned that genes that changed with aging at the mid-aged point and then reversed would be less reliable biomarkers of “progressive aging” than genes that changed by mid-age and stabilized or changed further in the aged group. Therefore, we also excluded genes in which the young and the aged groups were not different by at least 75% of the maximal difference among groups. This criterion excluded an additional 1483 probe sets (Fig. 2). (It should be noted that although such “reversing” genes may not be as useful as monotonically altered genes for biomarkers, they might reflect important initiating mechanisms that are subsequently offset by compensatory changes. Therefore, the set of ANOVA-significant genes excluded from analysis by this filter is included on-line). These filters retained 1985 gene probes. If the original 8799 gene probes had all been tested at, for example, the p≤ 0.025 α level, ∼220 false positives would have been expected (2.5% of 8799). However, by using defined a priori filters to decrease the total genes tested, we reduced the absolute number of expected false positives by >75% (to ∼50). Thus, this approach represents a useful adjunct to rigorous statistical corrections for multiple comparison error (Bonferroni correction) (Benjamini et al., 2001; Miller et al., 2001).
Group statistical testing step (ANOVA)
In this second main step of the algorithm (Fig. 2, 2), each of the retained 1985 gene probes was tested by one-way ANOVA across the three age groups (n = 9–10 per group) for a significant effect of aging using a relatively rigorous pvalue (p ≤ 0.001). To estimate the fraction of observed total positives expected to be false because of multiple comparison error, we used a modified version of the false discovery rate (FDR = expected false positives/observed positives) (Benjamini et al., 2001). At p ≤ 0.001, 2 false positives are expected in 1985 comparisons (i.e., 0.001 × 1985), but 77 total positives were observed. Thus, the FDR was 2 of 77 = 0.026, indicating that only 2.6% of the 77 observed total positives should be positive by chance alone and that any single positive result had a 2.6% chance of being a false positive. This FDR compares very favorably with the p ≤ 0.05 α level conventionally accepted for statistical significance and shows that the sample sizes used provided adequate statistical power and sensitivity to detect many more positives than would be expected by chance alone.
On the basis of this favorable FDR, we operationally defined these 77 ANOVA-positive genes (at p ≤ 0.001) as a subset of genes that changed with aging with high statistical reliability. Nonetheless, there are also important advantages to identifying larger sets of genes using less stringent p values, even at the expense of some statistical confidence. One advantage is fewer false negatives (less type II error). Furthermore, the lower stringency is partly offset by the increased confidence gained via detection of co-regulation among larger numbers of functionally related genes (Mirnics et al., 2000). Finally, a third advantage is that a more comprehensive picture of the associated processes/pathways is generated by larger sets of genes. Consequently, we also assessed the FDR using three less stringent p values: p ≤ 0.05,p ≤ 0.025, and p ≤ 0.01. Atp ≤ 0.05, 348 genes were found positive, but 99 were expected to be false positives, yielding an FDR of 99 of 348 = 0.29. At p ≤ 0.025, 50 false positives are expected in 1985 tests, but 233 total positives were found, yielding an FDR of 50 of 233 = 0.21. At p ≤ 0.01, ∼20 genes should be found positive by chance alone among the 1985 transcripts tested, but 145 positives were observed, yielding an FDR of 20 of 145 = 0.14. To balance the competing advantages of a stringent and a relaxedp value, we selected the genes obtained at thep ≤ 0.025 as the primary set of aging biomarkers for use in the next step of the analysis, the behavioral correlation. Thisp value level provided an intermediate FDR (0.21) and number of genes (233) for functional analysis. (However, all genes that changed at p ≤ 0.05 are posted on the web site.)
Cognitive performance correlation step (Pearson's test)
In the next step (Fig. 2, 3a,b), we used Pearson's test (p ≤ 0.025) to test for correlation of each of the 233 age-dependent biomarker genes with memory performance. Across all 29 animals, the expression level of each gene was tested for correlation with cognitive performance in both the OMT and SWM. However, these correlation tests were not fully independent of the previous ANOVA test that selected the aging-dependent genes to be tested (i.e., genes that changed with aging would be more likely to show a chance correlation with performance, because the latter also changed with aging). Thus, it was not feasible to calculate an independent FDR estimating false positives among the behavioral correlation results. Despite this, however, correlated genes clearly seemed more likely to be relevant to cognitive decline than noncorrelated genes. Additionally, given the decrease in cognition with age, genes downregulated with aging could only correlate positively, and genes upregulated with aging could only correlate negatively, with performance. Consequently, one-tailed tests were used.
Gene expression was also tested for correlation with performance within the aged group alone. This approach can reveal expression fluctuations associated with different degrees of impairment in animals of the same age. (In addition, because this correlation is relatively independent of the aging variable, it allows an FDR to be calculated.) Each aging biomarker gene was tested for correlation with 24 hr memory performance on the OMT in the aged group. The OMT was selected over the SWM for this test because it showed a more appropriate dispersion of performance among aged animals. Correlation tests limited to subjects in the aged group (n = 10) of course had considerably less power than tests across all three groups (n = 29), and therefore the criterion for significance was set atp ≤ 0.05.
Aging- and cognition-related genes
Of the 233 genes found to differ with aging at thep ≤ 0.025 level, 161 (69%) also correlated significantly (at the p ≤ 0.025 level) with either the OMT or SWM across all age groups. These genes were defined as the primary set of ACRGs. Of these 161, 84 (51%) were correlated with both tasks. Of the 161 genes that were correlated significantly with memory performance on at least one task, 64 (∼40%) were downregulated with aging (and positively correlated with performance) and 97 (∼60%) were upregulated with aging (and negatively correlated with performance). Of the core subset of 77 genes identified at the higher confidence level (p ≤ 0.001), 66 (86%) were among the 161 correlated with behavioral performance on at least one task, and of these, 49 (74%) were correlated with both tasks. Eleven of these 77 genes were not correlated with behavior atp < 0.025, but because of their higher reliability as aging markers were included with the 161 ACRGs (for a total of 172 ACRGs) in the functional categorization step (see below). Of these 172, we were unable to functionally categorize 6 transcripts, and 20 other transcripts also were deleted as duplicate gene representations, leaving a total of 146 primary ACRGs. In addition, 8 of 65 (12%) of the downregulated ACRGs but 25 of 81 (31%) of the upregulated ACRGs were correlated with retention performance on the OMT within the aged group only (indicated by asterisks in all tables). The overall FDR in the aged group-only correlation for ACRGs was 0.25. Examples of the correlation patterns with performance across all 29 animals for five ACRGs highly correlated in each direction are shown in Figure3.
Functional categorization
We assigned the 146 primary ACRGs to functional categories by two approaches. In the first approach, we used extensive literature searches in PubMed and other databases, including SwissProt, Trembl, NetAffx, GeneCards, Pfam, and InterPro to characterize each ACRG and assign it a broad functional category. Many of the same categories appeared to fit multiple ACRGs. In the second approach, we relied on the GO database. Although the GO system provides information that can be used to quantify over- or underrepresentation of identified genes relative to total microarray genes within a functional category (see Materials and Methods), it does not provide a functional designation for all genes. Therefore, our functional interpretations relied primarily on the first approach. Using the first approach, we identified 7 downregulated and 10 upregulated functional categories of ACRGs (Table 1). Some categories are broader than others, and some headings are modified by parenthetical characteristics that apply to a majority of ACRGs in that category. Of course, many of the genes fit within multiple categories.
Four ACRG examples for each category identified by the first approach are shown in Table 1. The examples were selected in each category first from ACRGs correlated with both tasks and then by lowest ANOVA p value. Selected levels within the GO biological process and GO molecular function categories (to which ACRGs were assigned through the GO system) are shown in Table2. The full set of 146 primary ACRGs identified here, and their assigned categories through the first classification approach, are shown in on-line Tables 3 and 4 (those correlated with both behavioral tasks are listed at the top of each category and ranked by ANOVA p value for aging changes). The set of all potential aging biomarker genes (those that changed with aging by ANOVA at p ≤ 0.05 but did not meet ACRG criteria) is posted in on-line Table 5 available atwww.jneurosci.org).
Categories of downregulated genes
Multiple genes related to energy metabolism, particularly to mitochondrial function and the electron transport chain (Rieske's iron-sulfur protein, NADH dehydrogenase) were downregulated with aging (Table 1A, on-line Table 3). Similarly, many more energy metabolic genes, including those involved in catabolism of glucogenic amino acids, were downregulated with aging (on-line Table 5A).
One of the most intriguing downregulated categories comprised ACRGs related to activity-dependent synaptic/neurite plasticity (agrin, Gap-43, Narp, Arc, Vgf) (Table 1A, on-line Table 3), many of which have been linked previously to synaptogenesis, neurite remodeling, plasticity, or memory (Biewenga et al., 1996; Steward et al., 1998; Guzowski et al., 2000; Bezakova et al., 2001; French et al., 2001). However, Gap-43 appears to be one of the few, or possibly the only, of these yet reported to change with aging and AD (Coleman et al., 1991). Importantly, several of these genes (agrin, Narp) are also significant components of the extracellular matrix (ECM). Because other major ECM elements including collagen IA, were also downregulated (Table 1A), the results suggest genomically mediated erosion or reorganization of the ECM.
Many other neural activity-dependent genes, including immediate early response genes (IEGs) in the transcription (Egr1, NGFI-C) and signaling (MAPKK6) categories were downregulated ACRGs (Table 1A, on-line Table 3). In addition, multiple ACRGs important for biosynthesis (nucleoporin, histone H2AZ) and protein trafficking (chaperones: Hsp60, DnaJ-like homolog) were also downregulated with aging as were specific neuronal marker and signaling genes (receptors, neuropeptide Y) (Table1A, on-line Table 3). Decreased chaperone capacity could have substantial implications for protein aggregation and vulnerability to AD.
Categories of upregulated genes
Not unexpectedly, many upregulated ACRGs were associated with glial functions, inflammation, immunity, and oxidative stress (Table1B, on-line Table 4). However, several unanticipated findings included extensive upregulation of genes encoding proteins for myelin synthesis and cholesterol and lipid metabolism (Table 1B, on-line Table 4). Several genes important for lipid β-oxidation and free fatty acid (FFA) catabolism (carnitine palmitoyltransferase, acyl-CoA oxidase) (on-line Tables 4, 5B), the primary pathway for FFA catabolism, were upregulated. The increase in myelin synthesis programs could entail elevated lipid turnover. Recent studies show that stimulation of myelin synthesis programs in oligodendrocytes is associated with induction of genes for both myelin proteins and lipogenic pathways (Nagarajan et al., 2001).
Consistent with the upregulation of myelin-related proteins, moreover, was the increased expression of genes related to protein/vesicle trafficking, including SNARE (N-ethylmaleimide-sensitive factor attachment protein receptor) proteins (Table 1B, on-line Table 4). Although several of these molecules are associated with neuronal vesicle transport and fusion, they are also known to play a major role in transport of myelin vesicles in oligodendrocytes (Madison et al., 1999). In addition, myelin is normally degraded to FFAs through the endosomal–lysosomal pathway and cathepsin S, which also was upregulated (Table 1B, Protein Processing), is particularly important in the processing of antigenic myelin fragments (Nixon et al., 2000; Dickinson, 2002). Notably, lysosomal alterations appear to be important in aging and AD (Bi et al., 2000; Nixon et al., 2000).
Expression was also upregulated for multiple genes encoding enzymes related to metabolism of the ketogenic/glucogenic amino acids, tyrosine, phenylalanine, and tryptophan (DHPR, KAT, FAH) (Table 1B, Amino Acid), which can be used for either energy metabolism or lipogenesis. Moreover, upregulation of DHPR, which catalyzes the formation of a critical cofactor (tetrahydrobiopterin) for tyrosine and monoamine synthesis, together with concomitant upregulation of MAO-B (on-line Table 5), suggests greater monoamine turnover. Another unexpected observation was a consistent upregulation of genes involved in iron utilization or storage [globins, transferrin, Nramp2 (Table1B); ferritin (on-line Table 5B]. This may be related to increased oxygen utilization, oxidative stress, or inflammatory responses in activated glia, because iron accumulation is associated with multiple age-related neurodegenerative conditions.
As noted, there also was massive upregulation of genes encoding major histocompatibility complex (MHC) class I antigen presenting molecules, as well as numerous other inflammatory/immune/oxidative stress molecules [GST (Table 1B, on-line Tables 4, 5B)]. ACRGs in the inflammation category exhibited some of the most robust monotonic changes with aging seen in this study (most were significant at or below the p ≤ 0.001 criterion) (on-line Table 4). In addition, the DHPR product, tetrahydrobiopterin, is also an essential cofactor for nitric oxide synthase (Boyhan et al., 1997); therefore, oxyradicals formed from increased nitric oxide could play a major role in inflammatory neuronal damage (Calingasan and Gibson, 2000; Bal-Price and Brown, 2001).
Astrocyte reactivity and other glial markers are well recognized to increase in the aged rodent and human hippocampus (Landfield et al., 1992; Finch and Tanzi, 1997), and the present data confirm upregulation of several glial marker genes [vimentin, GFAP (Table 1B, on-line Table 4)]. Furthermore, genes for extracellular components of astroglial scars [proteoglycans, fibronectin (Table 1B, on-line Tables 4, 5B)] also were upregulated.
Several signal transduction genes related to calcium-regulating or G-protein-coupled pathways also were upregulated ACRGs (Table 1B, on-line Table 4). In particular, S100A1 modulates Ca2+-induced Ca2+ release (Treves et al., 1997; Fulle et al., 2000), and phosphatidylinositol 4-kinase (PI 4-kinase) catalyzes a key step in IP3 production. Phospholemman (Table 1B), which regulates ion exchange, inhibits the Na+/Ca2+exchanger (Zhang et al., 2003), and the mitochondrial voltage-dependent anion channel (VDAC), which plays a central role in apoptosis, also appears to mediate Ca2+ flux into mitochondria (Gincel et al., 2001). Genes for other Ca2+-binding proteins [S100A4 (on-line Tables 4, 5B)] and annexin, a Ca2+-dependent, membrane-binding molecule (Table 1B), also were upregulated ACRGs.
In addition, there was wide upregulation of ACRGs related to growth and protein synthesis (Table 1B, on-line Table 4). This upregulation may be linked to the apparently major activation of MHC, proteoglycan, and myelin synthesis in glial compartments. The algorithm also identified a number of upregulated transcription factors, including KZF-1, Roaz, and members of the NFI family (Table 1B, Transcriptional Regulation), which can function as broadly acting negative transcriptional regulators.
Gene ontology: downregulated biological process ACRGs (Table 2A)
In the GO analysis, there was a significant overrepresentation of G-protein coupled processes among downregulated ACRGs, indicating that a greater number of genes in this category decreased with aging than would be expected relative to the experiment-wide proportion of genes that decreased with age. This agrees with previous work describing decreased G-protein activity with age in vivo (Roth et al., 1995) or in vitro (Blalock et al., 1999). An overrepresentation of downregulated ACRGs was also found in categories related to developmental process, second messenger signaling, and electron transport. Conversely, downregulated ACRGs in the categories of lipid metabolism (particularly lipid biosynthesis) and protein metabolism were significantly underrepresented. These results are consistent with the large numbers of downregulated ACRGs in the energy, signaling, and synaptic plasticity categories in the first categorization approach and the upregulation of lipid and protein processing ACRGs (Table 1B).
Gene ontology: upregulated biological process ACRGs (Table 2A)
For genes upregulated with age, there was a significant overrepresentation of ACRGs in stress response and inflammatory categories. Conversely, upregulated ACRGs in several other categories were significantly underrepresented, including carbohydrate and phosphate metabolism. These results also are generally consistent with the upregulation of inflammatory/immune ACRGs and the downregulation of energy ACRGs identified through the first categorization (Table 1).
Gene ontology: molecular function categories (Table 2B)
Downregulated ACRGs in categories of lipases, carbohydrate kinases, monooxygenases and oxidoreductases, and transcription factors were overrepresented. Upregulated ACRGs related to defense/immunity, vesicle transport, cytokine function, growth factor secretion, serine protease activity, and heavy metal binding also were significantly overrepresented.
Relationship to fold change
The large majority of microarray analyses to date have used fold-change criteria to identify changes in expression. However, apart from the statistical issues raised by these approaches (Miller et al., 2001), the 1.7- to twofold change detection criteria commonly used are relatively insensitive, particularly for identifying the moderate changes that might be expected in normal biological processes such as aging. On the basis of mean fold-changes between the young and aged groups, <15% of our results would have been detected by the twofold-change criterion used in many microarray studies (on-line Tables 3, 4). Furthermore, the rank order of group mean fold-change correlated only very modestly (r2 = ∼0.20) with that ofp values on the ANOVA for all ACRGs.
Age course of gene expression changes
Importantly, using an experimental design with three age groups elucidated the general chronological patterns of age-dependent change (Fig. 4). ACRGs could be classified by whether 75% of the maximal change occurred between the young and mid-aged groups (Yng to Mid), the mid-aged and aged groups (Mid to Age), or the young and aged groups (Monotonic).
Many downregulated ACRGs, in particular, exhibited their greatest mean change in expression between the young and mid-aged points. Less than 50% showed a monotonic pattern (Fig. 4A, pie chart inset). In contrast, >50% of upregulated ACRGs showed a monotonic age course, with change beginning between the young and mid-aged points and continuing to increase between the mid-aged and aged points (Fig.4B, pie chart inset). Interestingly, only a few scattered upregulated or downregulated genes showed a predominantly mid to aged change pattern (Fig. 4A,B, pie charts). Thus, nearly all genomic alterations began before midlife, and many, particularly among upregulated ACRGs, continued to advance between midlife and late life.
Strongest correlations with memory performance
To determine which processes were most closely correlated with memory performance, we calculated the percentage of genes in each of our functional categories that were correlated significantly (atp ≤ 0.025) with both memory tests. We reasoned that because each test is subject to its own error and contributions from noncognitive performance factors, genes that correlated with both tasks were more likely to be associated consistently with cognitive processes.
More upregulated (42) than downregulated (29) ACRGs were correlated with both behavioral tasks. Among downregulated functional categories, those with the highest percentages of ACRGs correlated with both tasks were the energy (9 of 13, 69%), protein trafficking (3 of 4, 75%), and biosynthesis (5 of 10, 50%) categories (on-line Table 3). Among upregulated categories, those with the highest percentages of ACRGs correlated with both tasks were the inflammation/immunity/oxidative (13 of 17, 76%), signal transduction (Ca2+-related) (6 of 8, 75%), and myelin-related (3 of 5, 60%) categories (on-line Table 4). However, subsets within larger categories in some cases showed a notably higher percentage than the overall category [(on-line Table 3) ECM subset of the ECM/structural category, 3 of 5 or 60%]. The criterion of highest percentage ACRGs correlated with retention within the aged group alone was used to identify two additional categories, the downregulated synaptic plasticity (activity regulated) category (43%) (on-line Table 3) and the upregulated protein processing/vesicle trafficking category (44%) (on-line Table 4).
Discussion
Novel processes associated with functional brain aging
The present studies used well powered statistical analyses and a strategy of correlating gene expression profiles with behavioral endpoints to identify a wide range of aging- and cognition-related genes. Some of the identified ACRGs were associated with processes that have been implicated previously in brain aging or AD, including inflammation, oxidative stress, glial activation, mitochondrial/metabolic dysfunction, protein processing, and some growth factors. However, many other ACRGs identified here have not been previously linked to normal brain aging or cognitive decline and therefore appear to be novel biomarkers of functional brain aging. The processes associated with these novel ACRGs included downregulation of synaptic structural plasticity, extracellular matrix formation/turnover, activity-regulated signaling, and transcription, biosynthesis, and protein chaperone functions (Table 1A, Fig.4A), and concomitant upregulation of myelin turnover, cholesterol synthesis/transport, lipid metabolism, vesicle trafficking, cytoskeletal reorganization, iron utilization, tyrosine/tryptophan/monoamine metabolism, protein synthesis, transcriptional regulation (negative), and signal transduction (especially involving Ca2+ regulation, e.g., phospholemman, PI 4-kinase, S100A1, annexin A3). Although a few scattered molecules within several of these novel categories of ACRGs have been found previously to change with brain aging (e.g., c-Fos, Gap-43, KAT, some chaperones), the present findings of co-regulation among numerous related genes provide essentially the first evidence implicating many of these larger functional processes in normal brain aging and cognitive decline.
Implications for theories of brain aging
Brain expression of a number of genes is known to change with aging (Finch and Tanzi, 1997; Lee et al., 2000). However, the present data indicate that many more genes and gene programs than previously thought may be altered. Additionally, the present studies indicate that reasonably high statistical confidence can be associated with this evidence of widespread alterations in expression, particularly for co-regulated ACRGs. Importantly, nearly all of the expression alterations began before midlife (Fig. 4) and therefore appear to be regulated rather than nonspecific responses to generalized senescent deterioration of the brain. Furthermore, few if any of these pre-midlife changes seem to reflect early developmental changes, given that most continued to advance after midlife. ACRGs also were identified by correlation with cognitive decline that did not develop until late life. Finally, the young group, at 4 months of age, was fully mature.
Genomic regulation by itself does not constitute sufficient evidence that brain aging is “programmed” or evolutionarily selected. Instead, altered regulation could be a response to subtle early random damage (e.g., molecular errors, oxidative stress) or reflect pleiotropic genomic effects that are adaptively controlled until adulthood (Austad, 1999). Nonetheless, the wide extent of genomic orchestration of brain aging seen in early adulthood here will have to be addressed by aging theories.
Functional implications
By the criteria for strongest correlation (highest percentage ACRGs correlated with both tasks or with performance in the aged group), the categories of energy metabolism, protein trafficking, biosynthesis, a subset of ECM, and synaptic plasticity (downregulated) (Table 1A, on-line Table 3), and of inflammation, signal transduction (Ca2+-related), myelin, and protein/vesicle trafficking (upregulated) (Table 1B, on-line Table 4) were most closely correlated with cognitive impairment. Taken together, these functionally correlated categories appear to reflect patterns of metabolic and biosynthetic involution and reduced synaptogenesis in neurons, in parallel with elevated myelin turnover, phagocytosis, and inflammation in glia. Although correlation alone does not demonstrate causation, it fulfills a key prediction of a causal relationship (i.e., that two causally linked variables will covary) and consequently can substantially focus the search for causal factors. Thus the functionally correlated age-dependent processes identified here likely represent particularly good candidates for factors that impact memory decline. At the least, they clearly represent potentially valuable early biomarkers of functional brain aging. Because the genomic alterations apparently precede measurable cognitive impairment (compare Figs. 1, 4), however, any deleterious cognitive impact of these expression changes presumably depends on cumulative actions between mid and late life.
In addition, several of the observed expression changes may have implications for functions other than cognition. For example, reductions in mitochondrial/biosynthetic functions and protein folding/chaperoning (Table 1A, on-line Table 3) or increases in oxidative stress, inflammation, and Ca2+signaling (Table 1B, on-line Table 4) could be associated with increased vulnerability to apoptosis. In particular, the increased expression of the mitochondrial VDAC (Table 4) seems to have major implications for release of apoptogenic proteins.
Behavioral arousal and activity-regulated genes
Many of the downregulated ACRGs are IEGs that are highly responsive to neural activity, stress, and arousal and are correlated with synaptic plasticity (Gall et al., 1990; Steward et al., 1998;Guzowski et al., 2000; French et al., 2001). Aspects of memory consolidation are also arousal-dependent (McGaugh, 2000). Thus, because the training procedures are highly arousing, expression of some activity-regulated ACRGs may well have remained elevated 24 hr after the last retention test, when brain samples were collected. This suggests that age differences and correlation with memory for these ACRGs may be detectable only under arousing but not baseline conditions.
Myelin turnover as an inflammatory trigger
Clearly, a critical unresolved question is what triggers the widespread neuroinflammatory response seen in brain aging and AD. Although oxidative stress is one important possibility (Gemma et al., 2002), a new candidate mechanism, demyelination, is suggested by the findings here that genes for myelin and cholesterol synthesis were upregulated in normal brain aging (Table 1B, on-line Table 4). That is, because upregulation of myelin synthesis programs in adult animals is often stimulated by demyelination (Kristensson et al., 1986), the myelin program activation seen here might well be a compensatory response to an underlying demyelinating process. Myelin fragments are extremely potent antigens for triggering autoimmunity and thus could account for the major increases observed in MHC antigen-presenting molecules, cytokines, and other inflammation/phagocytic-related factors (Table 1B, on-line Tables 4, 5B). Furthermore, because elevation of some proinflammatory cytokines (e.g., IL-18) (on-line Table 4) can induce hypomyelination (Corbin et al., 1996), inflammation might well exacerbate the initial demyelinating process, thereby creating an accelerating positive feedback loop between demyelination and inflammation. A similar low-grade demyelination (with or without compensatory activation of myelin programs) could also account for the hypomyelination that accompanies human brain aging (Golomb et al., 1995). Additionally, the increased cholesterol synthesis/transport accompanying myelin synthesis might contribute separately to functional decline, because cholesterol metabolism has recently been implicated in AD (Puglielli et al., 2001; Petanceska et al., 2002). Thus we suggest that a chronic demyelinating process and activation of myelin and cholesterol synthesis may act as triggers for inflammation in the aged brain.
Neuronal triggers of demyelination
This suggestion, in turn, raises the question of what might initiate the demyelinating process. Interestingly, both reduced mitochondrial function (Kalman et al., 1997) and reduced neuronal activity (Demerens et al., 1996) are sufficient to induce demyelination. Widespread evidence was found here of decreased mitochondrial function (Table 1A, on-line Table 3). In addition, genes for GluR5–2 and KAT, which both favor synaptic inhibition (Vignes et al., 1998; Moroni, 1999), were upregulated ACRGs (Table 4), and the product of KAT, kynurenic acid, increases with aging (Finn et al., 1991; Moroni, 1999). Electrophysiological data also support the conclusion that glutamatergic and adrenergic transmission are decreased with aging (Barnes, 1994; Bickford et al., 2000). Conceivably, either reduced energy metabolism or neuronal activity could trigger demyelination, perhaps by impairing synaptogenesis or axonal maintenance, but other pathways are obviously possible.
Ca2+ regulation
Elevated intracellular Ca2+concentrations can also reduce neuronal activity by activating inhibitory Ca2+-dependent conductances, and several Ca2+-regulating genes were upregulated ACRGs, including phospholemman (Zhang et al., 2003), PI 4-kinase, and S100A1, which interacts with the ryanodine receptor (RyR) to increase Ca2+-induced Ca2+ release (Treves et al., 1997; Fulle et al., 2000). Intracellular Ca2+ release also has been found to be enhanced in some AD models (Ito et al., 1994;Mattson et al., 2000). In addition, although many Ca2+ channel subunits were rated “not present” (see Results, Data filtering), L-type Ca2+ channel availability appears increased with aging (Thibault and Landfield, 1996), and the L-type Ca2+ channel is closely coupled to the RyR (Chavis et al., 1996). Together, these changes could amplify Ca2+ influx and release in aged hippocampal neurons, thereby dampening neuronal responsiveness and synaptic plasticity (Disterhoft et al., 1993; Foster and Norris, 1997;Thibault et al., 2001).
New model of functional brain aging
A more comprehensive and complex picture of functional brain aging emerges from these microarray analyses. On the basis of this overview, we suggest a new integrative model of aging-related cognitive decline (Fig. 5). In this model, alterations in neuronal activity or metabolism inhibit neurite growth that, in turn, triggers a demyelination process and an inflammatory cascade. Together, these neuronal and glial processes eventually impair memory. The sequential cascades hypothesized in Figure 5, of course, represent only one of numerous possible models. Nonetheless, because these microarray analyses identified many novel potential cause and effect interactions in aging, tests of the proposed mechanisms, whether they reject or support the hypotheses, should greatly clarify basic processes of brain aging and identify potential new targets for therapeutic interventions.
Footnotes
↵* E.M.B. and K.-C.C. contributed equally to this work.
This work was supported in part by National Institute on Aging Grants AG04542, AG10836, AG18228, and AG14979 and by National Institute of Mental Health Grant MH59891. We thank Kelley Secrest for excellent contributions to this manuscript and Arnold Stromberg and Xuejun Peng for valuable statistical discussion.
Correspondence should be addressed to Dr. Philip W. Landfield, University of Kentucky, Department of Molecular and Biomedical Pharmacology, MS-305, UKMC, Lexington, KY 40536-0298. E-mail:pwland{at}uky.edu.