Abstract
Electrophysiological and hemodynamic measures of human brain activity have been shown to distinguish between episodes of encoding items that are later recalled versus those that are not recalled (Paller and Wagner, 2002). Using intracranial recordings from 793 widespread cortical and subcortical sites in 10 epileptic patients undergoing invasive monitoring, we compared oscillatory power at frequencies ranging from 2 to 64 Hz as participants studied lists of common nouns. Significant increases in oscillatory power during encoding predicted subsequent recall, with this effect predominantly in the 4-8 Hz (theta) and 28-64 Hz (gamma) frequency bands. Sites exhibiting increased theta activity during successful encoding were clustered in right temporal and frontal cortex, whereas those exhibiting increased gamma activity appeared bilaterally at widespread cortical locations. These findings implicate theta and gamma oscillatory activity, across a widespread network of cortical regions, in the formation of new episodic memories.
Introduction
Physiological activity during encoding must participate in transforming experience into a lasting trace that can later be recalled and recognized. Subsequent memory paradigms, in which online measures of neural activity as people encode items are segregated by the subsequent ability to recall or recognize those items, have helped to characterize the activity of different brain regions during memory formation (Paller and Wagner, 2002).
In subjects faced with remembering a list of items, both evoked potentials and hemodynamic activity measured during encoding exhibit reliable differences between subsequently recalled and not recalled items (Rugg and Allan, 2000; Paller and Wagner, 2002). This effect has been termed the subsequent memory effect (SME). Recording intracranially in the medial temporal lobe (MTL), Fernandez et al. (1999) have shown that the differences that predict recall appear both as a negative deflection in the rhinal cortex event-related potential (ERP) ∼400 msec after stimulus exposure and as a positive hippocampal ERP beginning ∼800 msec after stimulus onset. This finding provides important constraints on the MTL generator of the SME observed at the parietal scalp (Rugg and Allan, 2000) and perhaps also the MTL SME observed using functional magnetic resonance imaging (fMRI) (Brewer et al., 1998; Wagner et al., 1998; Kirchhoff et al., 2000; Strange et al., 2002).
Recently, researchers have begun to examine the role of oscillations, inherent in both scalp and intracranial EEG (iEEG) potentials, in memory function. In iEEG recordings from MTL, Fell et al. (2001) observed that increased gamma synchrony between the rhinal cortex and the hippocampus, but decreased gamma power, predicted subsequent recall of words. The role of oscillations in cognitive processing is not confined to the gamma frequency band. In particular, oscillations in the 4-8 Hz theta band have been found to increase with cognitive demands during spatial learning (Kahana et al., 1999; Caplan et al., 2001), spatial navigation (de Araujo et al., 2002; Caplan et al., 2003), verbal and spatial working memory (Gevins et al., 1997; Krause et al., 2000; Tesche and Karhu, 2000; Raghavachari et al., 2001; Rizzuto et al., 2003), and item recognition (Klimesch et al., 1997; Klimesch, 1999).
Oscillations at lower frequencies, including theta, have also been linked to episodic recall. Klimesch et al. (1996) found significant increases in scalp-recorded theta power during episodic encoding of words that were subsequently recalled. Similarly, Weiss and Rappelsberger (2000) found increased oscillatory coherence between frontal and posterior scalp electrode sites, as well as significant increases in 1-4 Hz power, for both visually and auditorally presented, subsequently recalled nouns. Fell et al. (2003) also found increased theta coherence between intracranially recorded electrodes during encoding of successfully recalled words but did not observe any significant change in theta power.
Extensive animal work also points to the functional importance of theta oscillations. Long-term potentiation depends on the phase of hippocampal theta activity, with enhanced synaptic plasticity at the peak of the theta oscillation and decreased plasticity at the trough (Pavlides et al., 1988; Huerta and Lisman, 1993; Hölscher et al., 1997). During a working memory task, hippocampal theta appears phase locked to stimuli (Givens, 1996), providing a possible mechanism for sensory encoding (Hasselmo et al., 2002). Furthermore, blocking theta oscillations by lesioning the medial septum causes severe memory impairments (Givens and Olton, 1990; Leutgeb and Mizumori, 1999).
These findings concerning the general involvement of theta and gamma oscillations in both animal and human memory function lead us to ask whether cortical brain oscillations play a direct role in the successful encoding of information into long-term memory. Specifically, we investigated whether changes in oscillatory activity during encoding predict successful recall. We examined oscillatory power across a wide range of frequencies as participants with implanted electrodes performed an episodic memory task. We asked subjects to study a list of individually presented words, and then, after a filled delay period, subjects attempted to recall as many words as possible, in any order. This type of free-recall task has been shown to be especially well suited for studying the SME (Rugg and Allan, 2000).
Materials and Methods
Subjects. We tested 10 subjects (ages 8-17 years; four female) who had been surgically implanted with subdural electrodes on the cortical surface. The clinical team determined the placement of these electrodes so as to best localize epileptogenic regions. We recorded from a total of 961 electrodes; 168 of these were (1) in the epileptic focus, (2) overlying regions of radiographically evident structural brain damage, or (3) exhibited epileptiform EEG (i.e., spikes and/or sharp waves as determined by the clinical team). We restricted our analyses to the remaining 793 electrodes. Table 1 reports each participant's age, gender, region of resection, total number of electrodes, number of excluded electrodes, and average recall performance. All participants were tested for their reading ability, and none had difficulty reading the nouns used as stimuli in our study. The variability in their recall performance was reasonable considering their varied ages and the fact that they were tested at their hospital bedside. All subjects had normal range personality and intelligence. Our research protocol was approved by the institutional review board at Children's Hospital Boston, and informed consent was obtained from the subjects and their guardians.
Procedure. Subjects studied lists of words for a delayed free-recall task. Lists were composed of 20 common nouns, chosen at random and without replacement from a pool of high-frequency nouns (mean frequency of 49.0; http://fechner.ccs.brandeis.edu/wordpools.php). Over the course of one to four sessions, subjects received 12-60 study-test lists (the number of trials obtained depended on the patient's interest and availability for testing). A computer controlled stimulus presentation and recorded subjects' responses. At the start of each trial, a plus sign appeared at the center of the screen to alert subjects to the upcoming word presentation and to encourage them to fixate on the center of the screen. The plus sign appeared for 1600 msec, followed by a 800-1200 msec blank interstimulus interval (ISI). The computer then displayed each list item in capital letters for 1.6 sec, followed by an 800-1200 msec blank ISI. This temporal jitter served to decorrelate the physiological responses from successive word presentations. To ensure that each word was attended to, we asked that subjects read each word aloud as soon as it appeared. All subjects readily complied with these instructions.
Immediately after each list presentation, subjects were given a series of simple arithmetic problems. This end-of-list distractor task served to reduce the large advantage accorded to end-of-list items during recall (Howard and Kahana, 1999). Each problem took the form of A + B + C = ??, where A, B, and C were randomly chosen positive integers from the set one through nine. Subjects were asked to respond vocally as soon as they knew the answer. The experimenter typed their answer into the keyboard. Each entry evoked one of two tones from the computer speaker, depending on whether it was correct or incorrect. After subjects solved arithmetic problems for 16 sec, the subject's last arithmetic problem was immediately followed by a row of asterisks accompanied by a 300 msec tone signaling the start of the recall period. Subjects were given 46 sec to recall list items in any order (standard free-recall instructions). After each session, vocal responses, digitally recorded during the trial, were scored for analysis.
iEEG recordings. The iEEG signal was recorded from platinum electrodes (3 mm diameter) with an inter-electrode spacing of 1 cm. The signal was amplified, sampled at 256 Hz, and bandpass filtered between 0.3 and 70 Hz (apparatus from Bio-Logic Systems, Mundelein, IL). Data were subsequently notch filtered using Sleppian sequences at 60 and 44 Hz to eliminate electrical line and equipment noise.
To synchronize the electrophysiological recordings with behavioral events, the experimental computer sent pulses through the parallel port via an optical isolator into an unused recording channel. The time stamps associated with these pulses enabled the alignment of the experimental clock of the computer with the iEEG clock to a precision well under the sampling interval of the iEEG recording (<4 msec). For all subjects, the locations of the electrodes were determined using coregistered postoperative computed tomographies and preoperative MRIs by an indirect stereotactic technique and converted into Talairach coordinates.
Oscillatory power analyses. As in previous iEEG work (Kahana et al., 1999; Caplan et al., 2001, 2003; Howard et al., 2003; Rizzuto et al., 2003), we used the Morlet wavelet transform (with wave number = 6) to compute the spectral power as a function of time for all of our iEEG signals. Frequencies were sampled logarithmically at 41 intervals between 2 and 64 Hz. Because the wavelet scales with frequency, it allows for comparisons between time-varying high-frequency and low-frequency signals on an equal footing. This is especially important for EEG signals, which are generally nonstationary and have time-varying phase components (for additional details, see Caplan et al., 2001). Additionally, wavelets have Gaussian impulse functions that provide a true time-frequency distribution but avoid edge effects and issues of windowing that are inherent in fast Fourier transform methods.
A Wilcoxon rank sum test was then used to compare items that were later recalled with items that were not recalled using the mean of wavelet power during the encoding period (2 sec interval after presentation onset). This comparison was made separately for each electrode and at each frequency.
We generated an unbiased empirical estimate of the type I error rate using a bootstrap procedure (Efron, 1979), a method often used to handle correlations among statistical comparisons of electrophysiological measures (Lachaux et al., 1999; De Gennaro et al., 2000; Graimann et al., 2002; Howard et al., 2003; Rizzuto et al., 2003). The bootstrap procedure first entails generating 1000 random samples of the experimental data by randomly swapping items designated as recalled and not recalled for each subject. We used the same randomization across electrodes to control for between-electrode correlations. Next we performed the Wilcoxon rank sum test on the 1000 random shuffles of data, generating a distribution that allowed us to determine the Wilcoxon rank sum statistic, Ws, threshold that would set the type I error rate at 0.5% of a subject's electrodes per frequency. Although the same basic pattern of results can be seen for a range of type I error rates, using a more liberal threshold (e.g., 1.0% of a subject's electrodes) would make it difficult to interpret the brain maps (too many false positives). Similarly, using a more conservative threshold (e.g., 0.1% of a subject's electrodes) would have only picked out the most significant sites, making it impossible to compute aggregate time courses for the effects.
If Ws exceeded the empirically determined threshold, that particular frequency and electrode was deemed to exhibit a significant oscillatory SME. A positive SME at a given frequency for a given electrode indicates significantly greater power during the encoding of words that are later recalled; a negative SME indicates significantly greater power during encoding of words that are later not recalled.
To illustrate the time course of the oscillatory SMEs, we calculated time-frequency significance spectrograms of the difference between recalled and not recalled power. We first averaged power across significant electrodes within each subject. We then calculated a p value for the difference between recalled and not recalled power across subjects for each frequency from 2 to 64 Hz and each point in time from 0 to 2 sec after the onset of each item presentation.
Results
Out of 793 recording sites, 140 distinct sites exhibited positive oscillatory SMEs. This number of sites far exceeded the type I rate (p < 0.0001). Figure 1, a and b, illustrates two representative sites that exhibited positive oscillatory SMEs. Figure 1a, which illustrates data recorded from an electrode in the inferior frontal-temporal region of subject 3, had a peak frequency in the 4-8 Hz theta frequency band. Figure 1b plots data recorded from an electrode in the left temporal-occipital region of subject 4 with a peak frequency in the 28-64 Hz gamma band.
Figure 1c, which plots the number of sites exhibiting positive oscillatory SMEs at each frequency, shows that the dominant frequencies for this effect are in the theta and gamma frequency bands. Figure 1d illustrates the locations of the 51 distinct electrodes exhibiting a positive oscillatory SME in the theta band; Figure 1e illustrates the locations of the 45 distinct electrodes exhibiting a positive oscillatory SME in the gamma band. Electrodes exhibiting increases in theta oscillations that predicted successful recall clustered in right temporal and frontal areas (34 of 51 electrodes) compared with the left temporal and frontal areas (3 of 51 electrodes). This lateralization effect was statistically significant (p < 0.0001). Sites at which increased gamma oscillations positively correlated with subsequent recall were equally distributed across left and right brain regions (22 in the right temporal and frontal areas; 18 in the left temporal and frontal areas; NS).
We also found electrodes exhibiting significant negative SMEs, with decreased power during the encoding of words that were later recalled. Of 793 recording sites, 131 distinct sites exhibited negative oscillatory SMEs (this number exceeded the type I rate; p < 0.0001). Figure 2a, which illustrates data recorded from an electrode in the left inferior-temporal region of subject 8, had a peak frequency in the 9-28 Hz alpha/beta frequency band.
Figure 2b, which plots the number of sites exhibiting negative oscillatory SMEs at each frequency, shows that the dominant frequencies for this effect are in the alpha (9-12 Hz) and beta (13-28 Hz) frequency bands. Figure 2c illustrates the locations of the 79 distinct electrodes exhibiting a negative oscillatory SME in the 9-28 Hz band. Electrodes exhibiting decreases in alpha/beta oscillations that predicted successful recall clustered in both the right and left temporal and frontal areas but were equally distributed across the left and right hemispheres (36 right frontal and temporal; 38 left frontal and temporal; NS).
The foregoing analyses determined oscillatory SMEs based on the 2 sec period after each stimulus presentation. We next examined the time course of these effects by computing the average time-frequency spectrogram for the electrodes exhibiting positive theta, positive gamma, and negative alpha/beta oscillatory SMEs. As shown in Figure 3a, the positive theta SME was highly significant (p < 10-8) in the 600-1300 msec interval after stimulus onset. For the gamma SME sites (Fig. 3b), the average time-frequency spectrogram exhibits a similar time course, with significant differences between recalled and not recalled items beginning at 700 msec after word onset. This analysis indicates that the positive oscillatory SMEs occur relatively late in the encoding of a to-be-remembered word, suggesting a role for theta and gamma oscillations in associative or rehearsal processes.
The negative SME sites in the alpha/beta frequency range (Fig. 3c) only met our significance threshold during a short interval ∼1500 msec after stimulus onset. Some of these negative SME sites simultaneously contributed to the positive SME at lower frequencies. Because these sites were distributed widely throughout cortex and across frequency bands, it is likely that their timing was not as consistent as that of the positive SME sites. This might have contributed to their weak showing in the time-frequency spectrogram.
Discussion
This study explored the role of cortical brain oscillations in memory formation. Specifically, we tested whether changes in oscillatory activity during the encoding of an item can predict its subsequent recall. In recording from 793 sites across 10 subjects, we found that theta (4-8 Hz) activity during encoding of successfully recalled words significantly increased at 51 sites, predominantly in right temporal and frontal cortex. Increases in gamma (28-64 Hz) activity significantly predicted subsequent recall at 45 widely distributed cortical sites. Some sites also showed decreases in oscillations during encoding of successfully recalled items, an effect that was broadband and did not localize to any particular brain structure.
Our finding that theta oscillations in human cortex increase during successful encoding is reminiscent of studies showing that, in rats, hippocampal theta oscillations gate synaptic plasticity both in vivo and in vitro (Pavlides et al., 1988; Huerta and Lisman, 1993; Hölscher et al., 1997). The increased theta activation localizing to the temporal and frontal lobes in humans may signify the recruitment of those regions during successful memory encoding.
Whereas we report that increases in cortical theta and gamma power predict successful memory encoding, Fell et al. (2001, 2003) found no changes in hippocampal theta power and decreases in hippocampal gamma power during encoding of subsequently recalled words. Whereas Fell et al. recorded from depth electrodes in hippocampal and parahippocampal regions, we recorded from electrode grids placed subdurally on the cortical surface. It is possible that theta oscillations recorded in small regions throughout the cortex play a different role and are generated distinctly from the theta oscillations found in the hippocampus (Kahana et al., 2001).
Our finding of increased cortical gamma activity during successful encoding is consistent with the hypothesized role of gamma in top-down attentional processing. For example, gamma power at the scalp has been found to increase when subjects selectively attend to target stimuli (Tiitinen et al., 1993) and during the processing of rare, compared with common, targets (Debener et al., 2003). Tallon-Baudry et al. (1998) found that gamma power increased during the retention interval of a delayed match-to-sample task, interpreting the increase as related to rehearsal processes in working memory. Gamma power has also been examined intracranially and found to correlate with successful recognition of ambiguous objects (Rodriguez et al., 1999) and to increase with memory load during a working memory task (Howard et al., 2003). Thus, increased gamma power may be seen as reflecting increased allocation of attentional resources, which would in turn lead to better memory performance.
Lisman and Idiart (1995) and Fell et al. (2003) both discuss a possible interaction between theta and gamma oscillations underlying memory function. Specifically, Lisman and colleagues present a biophysical model of working memory in which simultaneously present theta and gamma oscillations allow for the segmentation of 7 ± 2 distinct patterns of neural activity (representing memories) that are repeated in each theta cycle (Lisman and Idiart, 1995; Jensen and Lisman, 1998). Our cortical results, however, show a very small positive correlation between electrodes exhibiting significant positive theta and gamma SMEs (discrete correlation Yule's Q = 0.23 ± 0.25; NS). This minimal degree of overlap between sites exhibiting significant theta and gamma SMEs indicates that different brain regions underlie these two physiological correlates of successful memory formation.
Our finding of numerous widespread sites exhibiting significant negative SMEs in the 9-28 Hz frequency range may be seen as being related to the finding of increased alpha (9-12 Hz) activity, primarily in posterior brain regions, during drowsiness or inattention. For example, Goldman et al. (2002) report that increases in alpha power when subjects close their eyes correlate with decreases in simultaneously recorded fMRI blood oxygen level-dependent signal in occipital lobe, superior temporal lobe, inferior frontal lobe, and cingulate cortex. In our task, decreases in alpha band oscillations during periods of heightened attention would produce a negative oscillatory SME in that frequency range. This does not, however, account for the negative SMEs found in the beta (13-28 Hz) frequency range. Although the negative alpha and beta SMEs are highly correlated (discrete correlation Yule's Q = 0.91 ± 0.17), the beta SMEs may reflect neurocognitive processes other than resting alpha, but until we have more detailed information concerning other behavioral correlates of oscillatory activity in the 13-28 Hz range, the present findings are best characterized phenomenologically.
Although we found a positive theta SME that lateralized to right frontal regions, fMRI studies typically find preferential activation of left prefrontal regions during episodic encoding (compared with retrieval) (Habib et al., 2003). It should be noted that we did not find significant lateralization of the positive gamma SME and that the negative SMEs in the alpha and beta frequency bands were slightly more prevalent in left compared with right prefrontal cortex. Given that our electrode positions were determined by clinical criteria and that we had relatively small numbers of subjects contributing electrodes in a given region, it is premature to interpret our findings of lateralization in the SME as being either consistent or inconsistent with the positron emission tomography and fMRI literature.
By examining intracranially recorded EEGs while subjects studied lists of words, we showed that increased oscillatory power during encoding predicts subsequent recall. Furthermore, this oscillatory subsequent memory effect localized to the 4-8 Hz (theta) and 28-64 Hz (gamma) frequency bands, with each effect exhibiting a unique brain topography. The increase in gamma activity at widespread cortical sites during successful encoding supports an emerging view of gamma as reflecting top-down attentional processes (Tiitinen et al., 1993; Debener et al., 2003). The increase in temporofrontal theta activity during successful encoding is consistent with physiological evidence concerning the role of theta in gating long-term potentiation. As such, this finding may help to bridge our understanding of the role of theta in synaptic plasticity with its role in human memory function.
Footnotes
This work was supported by National Institutes of Health Research Grants MH55687 and MH61975 (M.J.K.). This work was made possible by the cooperation of colleagues in the Epilepsy Program at Children's Hospital Boston, including P. M. Black. We thank Kelly Addis, Jeremy Caplan, Arne Ekstrom, Sri Raghavachari, and Dan Rizzuto for helpful discussions and feedback on this manuscript. We are grateful to Emily Dolan, David Seelig, and Michele Tully for data collection and processing.
Correspondence should be addressed to Dr. Michael Kahana, Volen Center for Complex Systems, Brandeis University, Waltham, MA 02454. E-mail: kahana{at}brandeis.edu.
Copyright © 2003 Society for Neuroscience 0270-6474/03/2310809-06$15.00/0