Abstract
In the rodent hippocampus, different phases of each theta activity cycle may be devoted to encoding and retrieval processes. These cycles of ∼3–8 Hz would allow equal processing time for each state and also provide temporal segregation to minimize their mutual interference. We show here that, by controlling the presentation asynchrony between verbal encoding and retrieval cues, theta-resolution (<100 ms) interference-free shifts between functional states are not expressed in hippocampally dependent, human “episodic” memory. Instead, retrieval attempts selectively and transiently interfere, for ∼450 ms, with the encoding of ongoing experiences. Analyses of scalp event-related potentials confirmed that the functional state of the brain during retrieval is largely unperturbed by concurrent encoding and also suggested that encoding impairments may last until a neocortical phase of retrieval can begin. The findings reveal the dynamic properties of interdependent encoding and retrieval functions that contribute to episodic memory in vivo and, moreover, show that, in humans, this form of memory does not operate with either the equality, or the rapidity, intrinsic to the theta model of rodent hippocampal function.
Introduction
The sharing of neural substrates between encoding and retrieval is a central organizing principle in current memory models (Cohen and Eichenbaum, 1993; McLelland et al., 1995; Squire et al., 2004). Here, we examine an unexplored consequence of the principle for human “episodic” memory function in vivo. We investigate whether competition for shared hippocampal substrates produces mutual interference when new memory traces are formed at the same time as attempts are made to retrieve existing traces.
Recent rodent neurophysiological work (Hyman et al., 2003) and computational simulations (Kunec et al., 2005) have begun to elucidate activity dynamics that allow the hippocampus to play a role in both encoding and retrieval. A key idea is that encoding and retrieval activities need temporal segregation to avoid their mutual interference within circuits that interconnect subfields CA1 and CA3 with their entorhinal input/output pathways. Effective temporal segregation could be achieved by synchronizing entorhinal inputs to cyclic fluctuations in hippocampal long-term potentiation (LTP) and recurrent CA3 and CA3–CA1 synaptic activity (Kunec et al., 2005; Rizzuto et al., 2003) [for an alternative model, see Meeter et al. (2004)]. To prevent interference from or with existing hippocampal traces, this model proposes that formation of a new hippocampal trace is facilitated by the arrival of entorhinal input at CA1 synapses when LTP is maximal and when recurrent CA3 and CA3–CA1 synaptic connections are least active, conditions observed to coincide in vivo and in vitro with peak phases of hippocampal theta (Huerta and Lisman, 1993; Pavlides et al., 1988; Hyman et al., 2003). Conversely, rapid access to existing traces is facilitated if retrieval cue information arrives at CA1 synapses when LTP is weakest, and input from CA3 is strong, conditions that coincide with troughs in hippocampal theta (Hyman et al., 2003).
Temporally segregating encoding and retrieval to different phases of the theta cycle may offset any adaptive disadvantages arising from the dual functional role of the hippocampus. In particular, this occurs by minimizing “hysteresis” effects when shifting from encoding to retrieval states and vice versa. However, if 3–8 Hz theta rhythms guide its transitions, the hippocampus may not distinguish and independently process entorhinal inputs representing to-be-encoded patterns or retrieval cues that arrive within the ∼100 ms period between consecutive theta peaks and troughs. Here, we performed two experiments, with healthy adult human participants, to discover whether there are, in fact, any limits on temporal segregation between encoding and retrieval in the hippocampally dependent episodic form of memory. Using a novel dual-task paradigm (see Fig. 1), we controlled the stimulus onset asynchrony (SOA) of encoding and retrieval cues to examine how their temporal overlap affects memory, relative to “base-line” conditions in which cues were given in isolation. Experiment 1 combined high-temporal resolution scalp event-related potential (ERP) measures with behavioral performance data to reveal the time course and “pattern” (mutual or unidirectional) of interference effects within the 200 ms theta period. Experiment 2 used an extended SOA range, up to 2 s, to reveal the full time course of interference effects.
Materials and Methods
Participants. In experiment 1, participants were 16 healthy right-handed adults (five males; 17–48 years of age) recruited from the University of Aberdeen undergraduate student population. In experiment 2, a different sample of 30 right-handed participants (11 males; 18–31 years of age) was recruited from the same student population.
Procedures. Both experiments used the same set of 400 low- to medium-frequency words (Wilding and Rugg, 1997). For auditory presentations, each word was digitized at 22 kHz with 16-bit resolution and then edited so that the start of the stored sound segment corresponded with the onset of the spoken word. All words were recorded in a male (average duration, 660 ms) and a female (average duration, 630 ms) voice. For visual presentation, words were displayed for 500 ms at the center of a 17 inch computer monitor in white 24-point Times New Roman font on a black background. Horizontal visual angle was never >5° at the viewing distance of 0.57 m, and vertical visual angle was 3°. All stimuli were randomly assigned to conditions before each experimental run and item order was also randomized at each stage.
The three-stage dual-task paradigm (see Fig. 1) was implemented using Presentation Control Language, the scripting language of Presentation (Neurobehavioral Systems, Albany, CA). Presentation allows all stimulus timing uncertainties and inaccuracies deriving from the personal computer (PC) hardware/operating system to be monitored. In experiment 1, SOA values began at 51 ms and incremented in nine steps of 17 ms to a maximum SOA of 204 ms (17 ms was the finest temporal resolution that we could reliably produce given the refresh rate of our PC monitor). In experiment 2, SOAs began at 51 ms and incremented in nine steps of 204 ms to a maximum SOA of 1887 ms. The average inaccuracy in onset times of auditory and visual stimuli across experimental runs was never greater than a few milliseconds.
During stage 1 (see Fig. 1 A), participants listened to a series of 80 words, with 40 spoken by a male voice and 40 spoken by a female voice, in a random order. Participants judged whether each word denoted an animate or inanimate object, and no overt response was required. Each trial began with a fixation cross, replaced after 1.0 s by a blank screen, followed 0.5 s later by presentation of the auditory stimulus. Four seconds of “encoding” time was given before the fixation cross was presented again for the subsequent trial, giving an intertrial interval of 5.5 s. Stage 1 lasted for ∼8 min.
During stage 2 (see Fig. 1 B), which comprised 200 trials in total, participants performed recognition and animacy tasks either singly [no interference (“I–”)] or jointly [interference (“I+”)]. In each of the 120 I–trials, participants either listened to 1 of 40 studied or 40 unstudied auditory items to decide whether they were presented during stage 1 (giving their decision on 1 of 2 mouse buttons), or they were shown 1 of 40 new items on a monitor screen and judged their animate/inanimate status (no overt response was required). The temporal structure of events on the I–trials was identical to that of the stage 1 trials described above. In each of the 80 stage 2 I+ trials, participants attempted both the recognition and the animacy tasks simultaneously. Forty I+ trials contained a stimulus pair comprising an auditory studied item and a new visual item, and the remaining 40 trials contained auditory unstudied items paired with a new visual item.
In both experiments, each of the 10 different SOA values were used in four I+ trials, with the encoding or the retrieval cues presented first at each SOA an equal number of times. I+ trial duration varied according to SOA to equate the time available for encoding and retrieval with the 4 s that is available on I–trials. I+ trial durations therefore ranged in experiment 2 between 5.55 s (at SOAs of 51 ms) and 7.36 s (at SOAs of 1887 ms). In experiment 1, because SOA values were never >204 ms, the duration of all I+ trials was fixed at 5.5 s. The order of stage 2 trial types (i.e., I–or I+) was randomized for each participant. Including a short rest after one-half of the trials, on average, stage 2 lasted ∼20 min.
In the third and final stage (see Fig. 1C), a visual recognition task was performed comprising 240 trials, with 120 containing items newly encoded during stage 2 (80 I+ and 40 I–) and 120 containing unstudied items. Recognition responses here were also given on mouse buttons. The temporal structure of each stage 3 trial was identical to that of stage 1 trials. Stage 3 lasted an average of 25 min, including a rest pause at the midpoint.
In experiment 1 only, EEG was recorded from 32 Ag-AgCl electrodes embedded in an elasticized cap (Easy Cap; Falk Minow Services, Herrsching-Breitbrunn, Germany) using an equidistant montage. Recordings were made with an on-line reference to the left mastoid and, subsequently, were algebraically re-referenced to linked left and right mastoids. Vertical and horizontal electro-oculograms (EOGs) were recorded from electrode pairs situated above and to the side of each eye on the outer canthi. EEGs and EOGs were recorded at a rate of 125 Hz with a bandwidth of 0.03–40 Hz (3 dB points) and digitized with a 12-bit resolution. Recordings were made continuously throughout all stages of experiment 1 and then epoched off-line, beginning 104 ms before stimulus onset for a duration of 2048 ms. Epochs containing horizontal or vertical eye movements or eyeblink artifacts were rejected. Epochs were also rejected if analog-to-digital saturation occurred or if baseline drift across the recording epoch exceeded ± 50 μV.
Here, we report analyses conducted on ERPs from each participant, formed to retrieval cues given correct responses during stage 2. ERPs were digitally smoothed (3 dB down at 19.4 Hz). Participants did not contribute <16 trials in critical conditions (per condition means for the numbers of trials in ERPs are indicated in Fig. 2). ERPs in each condition were quantified by measurement of their mean amplitude (with respect to mean prestimulus baseline) during specific latency regions. The selection of sites for analysis from our montage, and of latency regions, was based on previous ERP literature on episodic retrieval in comparable recognition tasks. Scalp topography analyses used normalized ERP difference scores, according to the method recommended by McCarthy and Wood (1985) and by Wilding (2005). ERP analyses, by ANOVA, used the Greenhouse–Geisser procedure to correct for sphericity violation. All F-ratios from ERP analyses are reported here with their corrected degrees of freedom. The ANOVAs used the factors of interference (I+ vs I–), response type [correct identification of studied items (HIT) vs correction rejection (CR) of unstudied items] and, where indicated, scalp electrode location factors [anterior–posterior, hemiscalp (left vs right), and inferior-superior].
Results
Experiment 1
The experiment comprised three distinct stages. In the first stage (Fig. 1 A), a series of auditory samples of single words was played via head-phones to each participant. An animacy judgment (living vs nonliving) was performed on each item. All auditory items presented during the first stage were thus encoded with full attention. Participants' ability to discriminate between studied and unstudied auditory items was then tested on single- and dual-task trials in stage 2 of the experiment (Fig. 1 B). Dual-task trials required the performance of the auditory recognition task concurrently with the animacy judgment task on visually presented words. On each dual-task trial, the SOA between the presentation of the auditory retrieval cues and the visual item to be encoded was controlled so that it ranged across 10 different values between 51 and 204 ms (see Materials and Methods). This SOA manipulation allowed us to examine whether auditory recognition and concurrent visual encoding mutually interfere with one another when their respective cues are delivered in the subtheta time scale. Item presentations were counterbalanced so that auditory and visual cues appeared first equally often across the dual-task trials. Within the constraints of the experimental design used here, this had to be done to ensure that a sufficient numbers of trials were presented at each SOA. The pattern of interference effects described below therefore reflects the temporal proximity of retrieval and encoding processes and not the temporal precedence of either one. Because the dual-task trials were designed to produce interference, performance measures for items presented on these trials were labeled I+.
To detect the presence of interference, it was necessary to obtain contrasting baseline measures of encoding and retrieval performance without interference. These baseline measures were obtained on single-task trials from stage 2 (Fig. 1 B). Speed and accuracy measures of item recognition on the single-task trials are labeled I– to denote no interference. The effect of SOA on dual-task recognition was assessed by comparing I+ performance at each of the 10 SOA values against the I– baseline. Interference from concurrent encoding was predicted to manifest as a reduction in recognition levels for I+ items, manifesting at all SOAs within the subtheta time scale. Additional single-task trials were included at stage 2 to obtain baseline measures of visual encoding ability without interference from concurrent auditory recognition attempts. On these trials, visual items were presented for the animacy judgment task. The impact of concurrent retrieval on encoding ability could then be assessed during the third and final stage of the experiment (Fig. 1C). Here, the visually presented items from stage 2 were shown again, along with an equal number of unstudied items, and participants had to make “old/new” recognition judgments.
Behavioral data
Measures of recognition accuracy and time to respond on the dual-task trials were taken from each participant. I+ performance could then be compared with measures obtained on the single-task trials in which recognition attempts proceeded without any concurrent distraction. The resulting group-average functions for the correct discrimination of studied and unstudied auditory items (i.e., the HIT and CR rates), are shown in Figure 2, A and B, along with their corresponding reaction times.
Figure 2 A shows that, at all SOAs, interference from concurrent encoding has essentially no effect on participants' speed and accuracy at recognizing studied auditory items. Figure 2 B depicts a similar lack of interference, at any SOA, on participants' speed and accuracy at identifying unstudied items. Figure 2C shows group-average recognition performance measures from stage 3 of the experiment, in which participants were asked to judge whether visual items had been studied previously or not during stage 2. The mean CR rate at stage 3 (data not shown in Fig. 2) was 71.0% (SD = 13.5). Recognition of visual items encoded on I+ trials was initially analyzed separately according to the studied/unstudied status of the concurrently presented auditory item. However, the status of the auditory item was not associated with differences in visual recognition during stage 3 and thus the data in Figure 2C are accordingly collapsed across this factor. It is evident from Figure 2C that the concurrent auditory recognition task, at all SOA values, had an impairing effect on the participants' ability to encode the visual items. The impairment is made manifest in the participants' reduced ability to subsequently recognize I+ visual items compared with the I– visual items.
To establish statistically that interference acted selectively on encoding and left retrieval essentially intact, HIT and CR performance measures at each SOA were subtracted from their corresponding baseline I– rate. The resulting baseline-corrected data were then subjected to ANOVA, using factors of experimental stage (2 vs 3) and SOA. ANOVA of the HIT rates gave a main effect of experimental stage, attributable to the reduction in performance relative to baseline found selectively at stage 3 (F(1,15) = 10.42; p < 0.0075). There was not a main effect or an interaction involving the factor of SOA, reflecting the consistent detriment to performance across all SOA values at stage 3. ANOVA of the baseline-corrected CR rates did not give rise to any significant main effects or interactions.
These findings demonstrate a clear pattern of interference effects generated when encoding and retrieval are performed in close temporal proximity, but this does not manifest as a mutual impairment to both stages of processing. Instead, retrieval appears to be immune to interference from encoding, to the significant detriment of participants' ability to encode other concurrent elements of their experience.
ERP data
Neural correlates of retrieval processing under single- and dual-task conditions were compared, revealing evidence of interference-induced changes in the functional state of the brain. These analyses were restricted to well studied ERP modulations called old/new effects (Friedman and Johnson, 2000; Mecklinger, 2000; Rugg and Allan, 2000), comprising differences in ERPs evoked by correct responses to studied and unstudied items (i.e., HITs and CRs, respectively). Endogenous cognitive ERP components, like the old/new effect, predominantly reflect changes in neocortical activity time-locked to the onset of evoking stimuli. Therefore, it is worth emphasizing that the temporal jittering of the encoding and retrieval cues in the dual-task trials ensures that the onset time of the visual stimuli was effectively randomized with respect to the onset time of the auditory recognition cues. This serves to reduce any contamination of the recognition ERPs by encoding-related neural activity. In addition, by restricting the analyses to the differences between HIT and CR ERPs, neural activity common to HITs and CRs in the I+ condition is held constant. Similarly, neural activity common to the HIT and CR ERPs is held constant in the I– condition. “Second-order” contrasts, i.e., between the I– and I+ old/new effects, are thus not confounded by neural activity specific to each condition.
Figure 3 shows the I+ and I– HIT and CR ERPs recorded at a subset of electrode sites situated over the anterior and posterior lateral scalp. For the I+ condition, ERPs comprise data from all dual-task trials, collapsing across the different SOA values. Within the I– and I+ conditions, differences appear as positive shifts in the HIT ERPs between ∼400 and 1200 ms after stimulus. Within the I– baseline condition (Fig. 3, top), HIT vs CR ERP differences were not statistically significant during the initial 400–600 ms period, but were reliable between 600 and 800 ms where, at the selected anterior sites, a main effect of response type was observed (F(1,15) = 11.64; p < 0.005), reflecting the greater relative positivity of the HIT compared with the CR ERPs. At the posterior sites, ANOVA of 600–800 ms mean amplitude I– data also gave a main effect of response type (F(1,15) = 7.80; p < 0.025), reflecting the HIT ERP positivity as well as a trend within the interaction between response type and hemiscalp (F(1,15) = 3.78; p = 0.071), attributable to relatively larger HIT positivity over the posterior left hemiscalp.
In the I+ interference condition (Fig. 3, bottom), ANOVA of the 400–600 ms mean amplitude data gave, at the selected anterior sites, a trend for a response type by hemiscalp interaction (F(1,15) = 3.86; p = 0.069), reflecting relatively larger positivity over the left anterior hemiscalp. By 600–800 ms, ANOVA of the I+ HIT vs CR ERPs at anterior sites gave significant main effects of response type, reflecting HIT ERP positivity (F(1,15) = 10.54; p = 0.005), and also a more robust condition by hemiscalp interaction (F(1,15) = 4.40; p = 0.053), reflecting the greater amplitude of the positivity over the left compared with the right anterior hemiscalp. During the same interval at the posterior sites, a main effect of response type (F(1,15) = 16.05; p < 0.001) was observed, attributable to the relative positivity of the HIT compared with the CR ERPs.
In both the I– and the I+ conditions, ANOVAs of mean amplitude data from 800 to 1000 ms gave main effects of response type [I– anterior (F(1,15) = 7.37; p < 0.025), I– posterior (F(1,15) = 8.06; p < 0.025), and I+ posterior (F(1,15) = 12.11; p < 0.005)], but no additional interactions involving the factors of condition and hemiscalp. The HIT and CR ERPs in both conditions do not differ from ∼1 s onward to the end of the epoch at 1944 ms after stimulus. Finally, direct contrasts between the mean amplitudes of the I– and I+ old/new effects during their entire time course failed to give rise to any main effects or interactions involving the factor of interference condition.
The scalp topographies of the old/new effects during 600–800 ms (Fig. 4) show differences in their left-sided maxima that are consistent with the different loci of the response type by hemiscalp interactions reported above. An additional set of scalp distribution analyses was performed to determine whether there was any evidence of differential neural generator activity in the dual-versus the single-task condition, indicating a qualitative functional change in retrieval processing that might accompany a shift in participants' retrieval strategy. For these distributional analyses, we first formed subtraction waves representing the difference between the HIT and CR ERPs for each condition at each electrode in the subset of anterior and posterior sites, which were then normalized by rescaling (see Materials and Methods). The ANOVA used the factors epoch (400–600, 600–800, and 800–1000 ms), condition (I– vs I+), anterior–posterior location, hemiscalp, and site. The ANOVA did not reveal any effects of interference condition on topography but did give a three-way interaction between the factors epoch, hemiscalp, and site (F(3.1,47.0) = 3.23; p < 0.05), reflecting the absence of the left-sided foci from each effect during the final 800–1000 ms interval. This interpretation was confirmed by subsidiary ANOVAs during each of the 400–600, 600–800, and 800–1000 ms periods, which only gave rise to significant main effects of hemiscalp during the first two intervals (F(1,15) = 5.83, p < 0.05; F(1,15) = 10.38, p < 0.01, respectively).
Experiment 2
Experiment 1 demonstrated a consistent impairment to encoding from concurrent retrieval attempts that existed throughout the <200 ms SOA range. In experiment 2, our aim was to replicate and extend the behavioral findings of experiment 1 by mapping out the full time course of this interference effect. The experiment used a larger sample of participants and an extended range of 10 SOAs in 204 ms increments from 51 to 1887 ms (for details, see Materials and Methods). This SOA range mirrors the ∼2 s time scale of the ERP data and therefore allows us to determine whether any temporal correspondences exist between the time courses of the stage 2 ERP old/new effects and the behavioral interference effects that are subsequently manifested during stage 3.
Figure 5 depicts the results from experiment 2, which appear to provide a good replication of the behavioral findings from experiment 1. Stage 2 HIT and CR rates and their reaction times do not appear to differ at any SOA value between the I+ and I– conditions (Fig. 5 A, B). As in experiment 1, recognition accuracy and reactions times did not differ significantly at any SOA between the two classes of I+ HIT items at stage 3; therefore, responses were again collapsed together to form a single stage 3 I+ condition, shown in Figure 5C. As is clear from Figure 5C, stage 3 impairments in recognition I+ HIT rates, compared with their I– baseline, were dependent on the SOA at which I+ items were presented when first encountered. The mean CR rate from stage 3 (data not shown in Fig. 5) was 68.5% (SD = 15.0).
As in experiment 1, baseline-corrected HIT rate data from stages 2 and 3 were analyzed first by ANOVA using the factors phase and SOA. The ANOVA gave a main effect of phase, resulting from the reduced performance, relative to baseline in stage 3 (F(1,29) = 24.34; p < 0.001) and a significant interaction between phase and SOA (F(1,9) = 2.53; p < 0.01). To follow up on the interaction, individual one-tailed paired sample t tests were performed to compare HIT rates on stage 3 items encoded at each SOA, which gave significant results only at SOAs of 51 ms (t(29) = 3.01; p = 0.005), 255 ms (t(29) = 5.80; p < 0.001), 459 ms (t(29) = 4.37; p < 0.001), and 663 ms (t(29) = 2.58; p < 0.025). Interference effects disappear from 867 ms onward [although they briefly reemerge at 1683 ms (t(29) = 2.43; p < 0.025)]. However, application of the conservative Bonferroni-type correction for these 10 multiple comparisons restricts the interval of significant interference effects to between 51 and 459 ms. Analysis of the baseline-corrected HIT reaction time data did not give rise to any significant main effects or interactions involving the factors stage or SOA. SOA also failed to affect the baseline-corrected CR rates from stage 2.
Discussion
Using a retrieval cue transiently impairs the ability to encode other concurrently presented items, as revealed by reductions in the ensuing recognition rates for these items compared with items presented under single-task conditions (Fig. 2C). By manipulating the asynchrony of encoding and retrieval cues, we showed that encoding was impaired at SOAs between 51 and 459 ms but began to improve at 663 ms SOA and had returned to baseline levels by 867 ms SOA (Fig. 5C). Encoding was impaired to the same degree throughout the 51–459 ms interval and was also equivalent regardless of whether retrieval was cued with a previously studied or an unstudied auditory item. Encoding, however, had no discernible behavioral effect on concurrent retrieval throughout the entire 51–1887 ms SOA range used across the present two experiments (Figs. 2A,B, 5A,B). It is important to note that, in both experiments, stimulus presentation order was counterbalanced so that encoding and retrieval cues appeared first equally as often. Hence, the encoding impairments do not occur simply because the retrieval cues temporally preceded other elements of an episode. Instead, these impairments are generated by close temporal proximity between a retrieval cue and another element of the ongoing event in which the cue is presented.
Although concurrent encoding had no demonstrable behavioral effect on recognition, there could still have been underlying changes in our participants' retrieval strategy as an adaptation to dual tasking. Hence, we used ERPs to covertly indicate whether or not successful recognition judgments were made on different functional bases under single- and dual-task conditions. Neural activity associated with successful recognition was revealed by contrasting ERPs in the HIT and CR conditions, which gave statistically robust differences during the 600–1000 ms interval after the presentation of retrieval cues. HIT ERPs during this interval exhibited positive-going shifts that were initially pronounced over the left hemiscalp. However, the exact foci of these early left asymmetries were slightly different in the I– and I+ conditions. Recognition during the single-task I– trials was associated with a temporoparietal left hemiscalp effect (Fig. 4A), whereas recognition during the dual-task I+ trials produced a more anteriorly distributed left hemiscalp effect [which began to manifest earlier (400–600 ms) than was the case in the I– condition] (Fig. 4B). However, no differences were detected in the underlying neural generators of the I– and I+ old/new effects, suggesting that interference from concurrent encoding did not qualitatively alter the functional basis on which correct recognition judgments were made.
Our experiments demonstrate that segregation between encoding and retrieval processes in human episodic memory breaks down when they are triggered within ∼450 ms of each other, well above the ∼100 ms resolution of theta-linked hippocampal cycles (Kunec et al., 2005). Our findings therefore imply that the ability of the human brain to avoid interference between each processing stage does not reflect segregation of encoding and retrieval to different phases of a theta cycle. We obviously do not claim to have demonstrated a failure to conserve such cycles. Instead, we showed that the functional architecture of episodic memory has slower segregation capabilities than we would expect if theta-linked cycles were the sole determining factor. Our findings are therefore more in agreement with models that posit slower-acting shifts between hippocampal encoding and retrieval states (Lisman and Otmakhova, 2001; Meeter et al., 2004; Lisman and Grace, 2005). An additional consequence of the theta-linked encoding/retrieval model is that cyclic changes in physiological state allow the hippocampus to efficiently serve both functions, without prioritizing one at the expense of the other (Hasselmo et al., 2002; Kunec et al., 2005). Our findings suggest, however, that such equality may not exist in human episodic memory, which appears to be able to prioritize retrieval processing at the expense of concurrent encoding, even when the retrieval task involves relatively simple old/new recognition judgments.
Evidence in humans for the prioritization of retrieval over other concurrent cognitive tasks has been obtained in previous dual-task studies (Craik et al., 2000; Naveh-Benjamin et al., 2000) (cf. Fernandes and Moscovitch, 2002; Fernandes et al., 2004), and our experiments here extend this work in two specific ways: first, by demonstrating the impact retrieval prioritization has on concurrent episodic encoding and second, by revealing the fine-grained temporal properties of the interdependency between encoding and retrieval processes. Straightforward adaptations to instructions and stimulus materials will allow future use of the three-stage dual task to explore the dynamics of encoding/retrieval interactions with considerable flexibility and experimental control. Moreover, the fine-grained temporal information obtained may be compared with detailed knowledge on the time course of neuronal activity from studies of memory function in nonhuman animals and from depth electrode recordings in human neuropsychological patients, as we show below.
The apparent prioritization of retrieval processing was a transient phenomenon that appeared to end when changes in neural activity associated with successful retrieval began to manifest in ERP recordings. We hypothesize that this temporal correspondence is a direct expression of dynamic resource sharing between encoding and retrieval. A critical question, then, concerns what happens at ∼600–800 ms SOAs such that encoding becomes free of interference. It has been repeatedly demonstrated, in previous intracranial electrode recording work with human epileptic patients, that the ∼500 ms period after stimulus onset is critical to effective medial temporal lobe encoding processes (Halgren et al., 1985; Heit et al., 1988, 1990; Fernandez et al., 1999, 2002; Fell et al., 2002; Paller and McCarthy, 2002) [for congruent findings in monkeys, see Ringo (1995) and Messinger et al. (2001)]. If hippocampal activity within this interval does play a crucial role in encoding, any disruption of that activity should have a negative impact on subsequent memory, whereas disruption outwith the interval should not (all else being equal). Which is essentially what we observed here in experiment 2.
Impairments to subsequent recognition ability attributable to disruption of hippocampal encoding activity have been reported in a recent experiment by Coleshill et al. (2004) that used subepileptogenic electrical stimulation (ES) applied directly to the hippocampus in human epilepsy patients. ES was time locked to the presentation of visual stimuli during the encoding phase of a recognition memory task similar to the one we used in the present experiments, resulting in recognition impairments for such items compared with items presented for encoding without concurrent ES. The effect of varying the SOA between ES and encoding stimuli was not explored; however, our own findings suggest that this could be a promising approach for future studies. In particular, our findings suggest that, in long-term recognition tasks, i.e., ones that do not engage a capacity-limited short-term working memory, the effect of ES may be dependent on its asynchrony with to-be-encoded stimuli. If proven to be correct, this hypothesis would provide substantial support for the existence of a critical encoding interval in hippocampal activity in vivo that immediately follows the onset of a stimulus. Similarly, the effect of SOA between hippocampal ES and the presentation of retrieval cues, using the present three-phase dual-task paradigm, could reveal much about the dynamic interdependence of encoding and retrieval activities within the hippocampus. Sobotka et al. (2005) also showed recently that delay-interval ES directly in the monkey hippocampus, persisting for 125–250 ms but delivered 1 s after onset of the sample stimulus, disrupts recognition in a short-term delayed matching-to-sample task. Their findings, in conjunction with our own and those of Coleshill et al. (2004), suggest that the hippocampus may be differentially engaged, or controlled, in long-term versus short-term (working) memory tasks.
The dynamics of hippocampal activity just discussed imply the existence of a critical window of approximately one-half of a second that immediately follows the onset of a stimulus to be encoded. Evidence on the precise timing of hippocampal retrieval-related activity and associated cortical changes in humans (Halgren et al., 1985; Heit et al., 1988; Paller and McCarthy, 2002) and in nonhuman animals (Naya et al., 2003) suggests that a similar critical interval of hippocampal activity follows the onset of a retrieval cue. For example, Naya et al. (2003) examined the informational content of activity in monkey inferotemporal area TE neurons during the delay interval in a paired-associate retrieval task. They observed that changes in neuronal activity in area TE began at ∼600 ms after the onset of cues that triggered the retrieval of previously learned associates from long-term memory. Naya et al. (2003) concluded that this delay period TE activation most likely reflects signaling from the hippocampus that is fed to TE via inferotemporal area 36. In the present experiments, we observed a similar onset time, of ∼600 ms, in the ERP old/new effects that were associated with successful retrieval. Furthermore, the onset time of the ERP old/new effects coincided remarkably well with the offset of interference effects on encoding.
We are led to the conclusion that the return of encoding to baseline levels that occurs when the ERP old/new effects emerge may signal the end of a prioritized hippocampal contribution to retrieval. This interpretation is consistent with current views on the functional significance of the ERP old/new effects, which link them to changes in neocortical activity triggered by the output from hippocampal pattern completion operations acting on retrieval cues (Wilding and Rugg, 1997; Allan et al., 1998; Mecklinger, 2000; Rugg and Yonelinas, 2003). This account adheres to the consensual view that distributed neocortical sites store the representational content that underlies the experiential quality of recollection, whereas the hippocampus, in effect, indexes the pattern of neocortical activity specific to each episode (Cohen and Eichenbaum, 1993; McLelland et al., 1995; Nadel and Moscovitch, 1997; Moscovitch and Nadel, 1998; Eichenbaum, 2000; Gilboa et al., 2004; Miyashita, 2004; Squire et al., 2004). The posterior left temporoparietal old/new component observed here in the single-task I– condition (Fig. 3, top) has particularly clear empirically established links to processing contingent on the reactivation of episodic traces (Wilding and Rugg, 1997; Allan and Rugg, 1998; Rugg and Allan, 2000; Donaldson et al., 2003; Paller et al., 2003; Curran, 2004). The functional status of the more anterior foci of the dual-task I+ old/new effect (Fig. 3, bottom) is less well established; however, it has been linked to recognition judgments based on item “familiarity” (Rugg et al., 1998; Friedman and Johnson, 2000; Mecklinger, 2000; Tsivilis et al., 2001; Donaldson et al., 2003), a more automatic form of memory that could be used when attentional resources are insufficient to support “full-blown” recollection (Yonelinas, 2002). The presence of the anterior component in the dual-task I+ condition thus fits well with, and provides additional support for, the familiarity account.
Klimesch et al. (2000, 2005) proposed recently that oscillatory neocortical changes in δ-band (2–4 Hz) activity may generate temporoparietal old/new effects. Such changes are sustained in time from the onset of retrieval cues, occur in response to new (unstudied) and studied cues (more so the latter), peak at ∼600–800 ms after stimulus, and coexist with a transient change in theta-band power that occurs ∼200–400 ms after stimulus. Based on this pattern of findings, Klimesch et al. (2005) suggested that the δ-band changes are associated with processes that detect the reactivation of a memory trace. In contrast, Klimesch et al. (2000, 2005) associated the more transient theta-band changes with neocortical trace reactivation induced via re-entrant corticohippocampal connectivity. Our present findings fit very well within this framework. As noted in Results, we found that studied and unstudied recognition cues produced equivalent interference effects on encoding, which persist until ERP old/new effects begin to manifest at ∼600 ms after stimulus. The δ-frequency range of ∼2–4 Hz has a sinusoidal period of 0.25–0.5 s. Hence, the duration of the interference effect is consistent with a process that manifests on a δ-linked time scale rather than the more rapid theta cycle. Our findings therefore suggest that episodic encoding is disrupted by the sustained “capture” of this δ-band activity in the service of retrieval, e.g., by a “reset” mechanism (cf. Rizzuto et al., 2003).
In sum, by manipulating the asynchrony of presentation times for encoding and retrieval cues, we showed that a transient interference could be created that selectively impairs encoding and leaves retrieval performance intact. The time course of the interference effect reveals limits on the human ability to efficiently segregate encoding and retrieval processes so that they do not interfere with one another. This ability was found to operate at a resolution slower than that possible for theta-linked changes in a physiological state that were proposed recently as a mechanism to keep encoding and retrieval separate within the rodent hippocampus. Instead, the interference from retrieval on encoding appears to last about as long as it takes to reactivate the neocortical memory trace of a target episode, as indicated by the time course of putative ERP correlates of this reactivation process. These findings indicate that, within the interval preceding the onset of the ERP old/new effects, hippocampal functioning may be targeted more toward the processing of retrieval cues than on the encoding of other concurrent elements of experience. This may occur if neocortical processes are engaged to sustain and focus attention on retrieval cues, drawing these resources away from other concurrent elements of experience. As a result, memory is significantly impaired for experiences that take place during the immediate peri-retrieval period.
Footnotes
This work was supported by British Biotechnology and Biological Sciences Research Council Grant 1/S15294. We thank two anonymous reviewers for their very useful comments on a previous version of this manuscript.
Correspondence should be addressed to Dr. Kevin Allan, Cognitive Electrophysiology and Memory Laboratory, School of Psychology, College of Life Sciences and Medicine, University of Aberdeen, Aberdeen AB24 2UB, UK. E-mail: k.allan{at}abdn.ac.uk.
DOI:10.1523/JNEUROSCI.1304-05.2005
Copyright © 2005 Society for Neuroscience 0270-6474/05/258122-09$15.00/0