Abstract
Visual short-term memory in humans and animals is frequently assessed using delayed matching to sample (DMTS) and delayed nonmatching to sample (DNMTS) tasks across variable delay intervals. Although these tasks depend on certain common mechanisms, there are behavioral differences between them, and neuroimaging provides a means of assessing explicitly whether this is underpinned by differences at a neural level. Findings of delay-dependent deficits, after lesions in humans and animals, suggest that the neural implementation of these tasks may also critically depend on the delay interval. In this study we determined whether there were differential neural responses associated with DMTS and DNMTS tasks at two different delay intervals using functional magnetic resonance imaging. Ten healthy volunteers were studied under four test conditions: DMTS and DNMTS at 5 and 15 sec delay. The main effect of DMTS compared with DNMTS across both delay intervals was associated with significant activation in bilateral head of caudate and medial orbitofrontal cortex. By contrast, DNMTS compared with DMTS was associated with significant activation in mediodorsal thalamus, bilateral lateral orbitofrontal cortex, and left premotor cortex. The main effect of short compared with long delay, across both tasks, was associated with significantly greater activity in occipital and parietal cortices. By contrast, long compared with short delay was associated with significantly greater activity in temporal and ventrolateral frontal cortices. We conclude that DMTS and DNMTS are not equivalent and furthermore that the precise neural implementation of these tasks is a dynamic function of delay interval.
- delayed matching to sample
- delayed nonmatching to sample
- orbitofrontal cortex
- caudate
- premotor cortex
- medial temporal lobe
- functional magnetic resonance imaging
Delayed matching to sample (DMTS) and delayed nonmatching to sample (DNMTS) paradigms are widely used to study visual memory in humans and animals. Both tasks require subjects to hold a visual stimulus “on line” over a delay interval before responding to a choice of stimuli. In DMTS, subjects must select the familiar stimulus; in DNMTS they must select the novel stimulus. Successful performance of both tasks depends on the integrity of inferior temporal regions (Gross, 1973; Mishkin, 1982; Horel et al., 1987), posterior perceptual cortex, and medial temporal lobe structures (Zola-Morgan and Squire, 1985; Murray, 1992; Mishkin and Murray, 1994). The entorhinal and perirhinal cortices have been particularly implicated in rats (Mumby and Pinel, 1994; Zhu et al., 1995) and monkeys (Gaffan and Murray, 1992; Meunier et al., 1993). Medial temporal damage in humans also results in various recognition memory deficits (Kimura 1963; Warrington and James 1967;Milner, 1968, 1970). A recent study using a computerized DMTS procedure, analogous to that used in monkeys, reported impairments in patients with temporal lobe lesions or bilateral amygdalo-hippocampectomies (Owen et al., 1995). There is also evidence implicating prefrontal cortex, particularly ventral and medial regions, in performance of DMTS and DNMTS tasks in monkeys (Bachevalier and Mishkin, 1986; Meunier et al., 1997) and humans (Milner, 1964; Milner et al., 1985; Owen et al., 1995).
Although common processes are doubtless evoked, and common neural substrates are involved (see above), there are certain key differences between the tasks. For example, Diamond (1991) discussed DNMTS in monkeys and human children as involving memory alone, whereas DMTS involves memory and inhibition of an innate tendency to respond to novel stimuli; thus DMTS is harder than DNMTS in monkeys and infants (Mishkin and Delacour, 1975; Gaffan et al., 1984; Diamond 1991). Interestingly, human adults show the reverse pattern, an important caveat in comparing the monkey and human literature. A primary aim of the present study was to compare explicitly neural responses in DMTS and DNMTS in humans. A further issue is the extent to which different delay intervals influence the neural implementation of DMTS and DNMTS tasks. In monkeys, there is evidence that lesions of temporal lobes (Alvarez et al., 1992, 1995) and prefrontal regions (Fuster, 1995) produce more significant deficits at longer delay intervals. Human patients with Alzheimer’s disease or temporal lobe damage show deficits on DMTS tasks with abstract stimuli that become more pronounced at longer delays (Sahakian et al., 1988; Owen et al., 1995;Buffalo et al., 1998). Delay-dependent deficits have also been observed in patients with prefrontal lesions (Owen et al., 1995).
In this study we used functional magnetic resonance imaging (fMRI) to compare evoked activity associated with DMTS and DNMTS at two different delay intervals. We hypothesized that, in addition to activations common to both tasks, there would also be task-specific neural responses. Furthermore, on the basis of recent human and animal data, we predicted that there would be a difference in the neural substrates at the two delay intervals, with temporal lobe and prefrontal regions more activated at longer delays.
MATERIALS AND METHODS
MRI scanning technique. Ten healthy right-handed volunteers, 6 male and 4 female, aged 22–40 years, were scanned. A Siemens (Munich, Germany) VISION system at 2 T was used to acquire both T1-weighted structural images and gradient echo, echo planar T2*-weighted images with blood oxygenation level-dependent (BOLD) contrast. Functional images were acquired in a single run, with a total of 312 volumes acquired for each subject. Each volume comprised 48 3 mm axial slices with 3 mm in-plane resolution, positioned to cover the whole brain. During each run, volumes were acquired continuously every 4.1 sec. The run began with six “dummy” volumes to allow for T1 equilibration effects; these volumes were subsequently discarded.
Cognitive activation paradigm. This experiment had a factorial design with four experimental conditions and two control conditions. In all, experimental conditions subjects were presented with a trial-unique test stimulus comprising a complex shape made up of four color subelements (based on stimuli from the Cambridge Automated Neuropsychological Test Battery; Robbins et al., 1994). After 1 sec the test stimulus was replaced by a blank screen. After a delay interval (see below) two choice stimuli were presented, one of which was the original stimulus and the other a novel stimulus. Subjects responded with a button press, pressing either a left or right key depending on whether the left or right stimulus was correct. They were given three seconds in which to make this response (all subjects made all responses within this time) before the next problem appeared. In the “match” conditions, subjects were told to respond by choosing the stimulus that matched the test stimulus, i.e., to select the familiar stimulus. In the “nonmatch” condition, subjects were told to select the novel stimulus. This constituted the first factor. The second factor involved both DMTS and DNMTS tasks being performed under two delay conditions; a “short” interval of 5 sec and a “long” interval of 15 sec. Although 15 sec is not truly a long delay in terms of the DMTS and DNMTS literature, and we did not expect significant rates of forgetting, using a substantially longer delay such that significant forgetting occurred would not have been practical in the confines of an fMRI design. If subjects are scanned over a protracted period, discomfort and motion artifacts become serious concerns. There is also an established theoretical framework suggesting that there may be important qualitative differences in human short-term memory at these delay intervals (Phillips, 1974), and the neuropsychological literature on DMTS would support this at a neural level.
Perceptuomotor control conditions. In these conditions subjects saw a random sequence of single stimuli and paired stimuli, none of which matched each other. Subjects were explicitly instructed not to attempt to remember the stimuli and were told to respond by pressing either response button whenever a pair of stimuli appeared. In the short and long control conditions, the temporal spacing of stimuli corresponded to that in the short and long experimental conditions, respectively. This control condition was chosen to match the perceptual and motor demands of the memory task as closely as possible. A consequence of this was to introduce possible confounds, which cannot be addressed in this study. For example, it is not possible to determine empirically the extent to which subjects complied with the instruction not to remember the stimuli. However, by randomizing the order of single and double stimuli, we attempted to obviate any tendency to remember single stimuli. Also, even were this to be a confound, it is not an issue for the key comparisons, which are between different memory conditions and therefore do not involve the control condition at all.
During each activation epoch, three problems, all from the same condition, were presented. Epochs were interleaved with blocks of rest during which subjects were simply required to fixate on a central cross. The order of conditions was randomized within and between subjects. Given the different delay intervals used, the number of scans per epoch was variable. However, the random effects approach to data analysis described below minimizes the statistical impact of this variability. The alternative to this design was to equate the length of epochs, but this would have meant that subjects completed more short than long trials, which we judged to be a more serious confound.
Data analysis. Data were analyzed using Statistical Parametric Mapping (SPM98; Wellcome Department of Cognitive Neurology) implemented in MATLAB 5 (Mathworks, Sherborn, MA) and run on a SPARC workstation (Sun Microsystems, Surrey, UK). Scans were realigned using the first as a reference and were subsequently transformed into a standard space corresponding to the stereotactic atlas of Talairach and Tournoux (1988) using Montreal Neurological Institute templates. These normalized images were smoothed with a 10 mm full-width, half-maximum isotropic Gaussian kernel. Analysis was performed using a random effects model with a delayed boxcar waveform. In this model, the relatively higher intersubject, compared with intrasubject, variability is taken into account by first reducing each subject’s data to a single mean image for each condition and then using these mean images as the basis for intersubject comparisons. Subject-specific low-frequency drift in signal was removed by modeling with low-frequency sine and cosine waves, and global changes were removed by proportional scaling (Holmes et al., 1998). Effects at each and every voxel were estimated, and regionally specific effects were compared using linear contrasts. The resulting set of voxel values for each contrast constituted a statistical parametric map of thet statistic (SPM{t}), which was then transformed to the unit normal distribution (SPM{Z}). Statistical inferences were based on the theory of random Gaussian fields (Friston et al., 1995). The stereotactic coordinates ofTalairach and Tournoux (1988) are used to report the activation foci, but descriptions of the anatomical localization of the foci were determined using individual structural MRIs of the group and the atlas of Duvernoy (1991).
RESULTS
Performance data
The mean percentages of problems solved correctly under the four memory conditions are shown in Table 1. These data were analyzed using a two-by-two factorial ANOVA. The only significant effect was a main effect of DMTS compared with DNMTS, with subjects performing better in the DMTS condition (F(1,36) = 4.6; p < 0.05).
Percentage of correct behavioral performance under the four memory conditions
Relative activations associated with all memory conditions compared with control
When all four memory conditions (DMTS and DNMTS across both delays) were compared with the perceptuomotor control conditions, significant evoked neural activity changes were seen in bilateral anterior cingulate (BA 32 and 24; significant at p < 0.05 corrected) and bilateral posterior middle frontal gyrus (BA 9), bilateral medial occipital gyrus (BA 19), right superior parietal cortex (BA 7), right posterior fusiform gyrus (BA 18), and left cerebellar hemisphere (all at p < 0.001 uncorrected; Fig. 1, Table2).
Activations associated with all four memory conditions compared with the perceptuomotor control condition in the form of a statistical parametric map of the t statistic (after transformation to an SPM{Z}) thresholded at uncorrected p < 0.001.
Areas where evoked activity was significantly different between memory conditions and perceptuomotor control
Reductions in evoked activity relative to control (allp < 0.001, uncorrected) were seen in cuneus (BA 19), right medial parietal cortex (BA 7), right lateral medial temporal gyrus (B 21), bilateral anterior frontal cortex (BA 10), left angular gyrus (BA 39), and the cerebellar vermis bilaterally (Table 2).
When the memory conditions were compared with the baseline fixation condition, a very similar pattern of evoked activity was seen as for the comparison with perceptuomotor control. However the reverse comparison, fixation compared with memory, showed no significant changes, which suggests that the reductions in evoked activity for memory relative to the perceptuomotor control were attributable to evoked activity in the control condition rather than decreased activity in the memory conditions per se.
Relative activations associated with DMTS compared with DNMTS
Significant evoked neural activity associated with DMTS compared with DNMTS was observed in the head of caudate nucleus on the left extending to the subgenual cingulate cortex (BA 24) and medial orbitofrontal cortex (BA 25 extending to 11) (p< 0.05 corrected; Fig. 2, Table3). Neural responses at a lower level of significance (p < 0.001 uncorrected) were seen in corresponding regions on the right.
Relative activations associated with DMTS compared with DNMTS. The statistical parametric map of the tstatistic (after transformation to an SPM{Z}) thresholded at p < 0.001 is shown rendered onto a standard MRI template and focused on the left head of caudate (x = −12; y = 24;z = 8).
Areas where evoked activity was significantly different for DMTS and DNMTS
Relative activations associated with DNMTS compared with DMTS
Significant evoked neural activity (p < 0.001 uncorrected) associated with DNMTS compared with DMTS was seen in left premotor cortex (BA 6), lateral orbitofrontal cortex bilaterally (BA 11) (Fig. 3), left dorsomedial thalamus, and left precuneus (BA 7) (Fig. 3, Table 3).
Relative activations associated with DNMTS compared with DMTS. The statistical parametric map of thet statistic (after transformation to an SPM{Z}) thresholded at p < 0.001 is shown rendered onto a standard MRI template, focused on the premotor (x = −22; y = 18;z = 60) and lateral orbitofrontal (x = 46; y = 44;z = −22) activations.
Correlations between neural response and behavioral performance
Because subjects performed significantly worse at DNMTS than DMTS, this raises the question of whether any of the activations specific to the individual tasks simply reflect this performance difference. To address this we repeated the above contrasts, modeling performance as a covariate of interest. There were no significant correlations between neural response and performance in any of the regions identified in the between-task comparisons.
Relative activations associated with short compared with long delay intervals
Short, compared with long, delays were associated with significant evoked activity in medial and inferior occipital gyri (BA 19), fusiform gyrus (BA 37), and cerebellar hemispheres, all bilaterally (p < 0.05 corrected). There was also evoked activity in superior parietal cortex (BA 7) bilaterally (p < 0.001 uncorrected; Fig.4, Table4).
Relative activations associated with different delay intervals in the form of statistical parametric maps of thet statistic (after transformation to an SPM{Z}) thresholded at p < 0.001. A, Regions more activated at short delays;B, regions more activated at long delays.
Areas where significant evoked activity differed for the two delay intervals
Relative activations associated with long compared with short delay intervals
Long compared with short delays were associated with significant evoked activity in temporal pole bilaterally (BA 21/38), medial temporal gyrus (BA 21) bilaterally, and inferior temporal gyrus (BA 20) and left hippocampus (all p < 0.05 corrected). There was also activation of inferior frontal gyrus (BA 44/45) bilaterally atp < 0.001 uncorrected (Fig. 4, Table 4).
Activations relative to control seen for both DMTS and DNMTS
Common task-related activations were determined from a conjunction analysis of DMTS compared with control and DNMTS compared with control (using separate blocks of the control task for each comparison) and defines areas commonly activated in both tasks. Significant evoked activity (at p < 0.001 uncorrected) was seen in anterior cingulate (BA 32) bilaterally, posterior medial frontal gyrus (BA 9) bilaterally, medial occipital gyrus (BA 19) bilaterally, left posterior parietal cortex (BA 7), and left cerebellum.
Modulation of activations associated with DMTS compared with DNMTS by different delays
These comparisons represent interaction terms in the factorial design. Activations of bilateral caudate and medial orbitofrontal cortex, associated with DMTS compared with DNMTS, were significantly greater at short than long delays (p < 0.001 uncorrected). Adjusted neural responses in caudate are shown in Figure5. Activation of precuneus, associated with DNMTS compared with DMTS, was significantly greater at short than long delays (p < 0.05 corrected) and there was also a significant activation of right amygdala (p < 0.05 corrected) not seen in the main effect expressed solely in DNMTS at short delays. The right anterior cingulate (BA 32), although activated in all four conditions, showed augmented activation for DNMTS at long delays (p< 0.001 uncorrected).
Adjusted neural response in the head of caudate under the four different conditions and low-level rest. The adjusted response is based on normalized counts, and this graph shows the relative counts associated with each condition. Because the numbers represent relative counts, there are no associated units.
DISCUSSION
The key findings of this study are of significant differences in neural response to DMTS and DNMTS tasks and, furthermore, significant differences in neural response to short and long delays. DMTS was associated with greater activation in caudate and medial orbitofrontal cortex. By contrast, DNMTS was associated with activation in premotor cortex, lateral orbitofrontal cortex, and mediodorsal thalamus. Comparisons between 5 and 15 sec delay intervals showed that neural responses in posterior perceptual regions were greater at shorter delays, whereas responses in temporal lobes and ventrolateral frontal cortices were greater at longer delays. Activations common to all four conditions were seen in anterior cingulate, medial frontal gyrus, and various posterior cortical regions. These regions overlap with those reported in earlier positron emission tomography (Swartz et al., 1994;Faillenot et al., 1997; Elliott and Dolan 1998) and fMRI (Courtney et al., 1996, 1997; Klingberg et al., 1997) studies using DMTS or related tasks.
The finding of differential neural responses to DMTS and DNMTS explicitly confirms that these tasks are not equivalent at the neural implementational level in humans. At debriefing, subjects reported their approach to DNMTS as identical to DMTS up until the final stage of response execution. For both tasks they attempted to remember the sample during the delay and identified the familiar stimulus at the choice stage. DNMTS diverges from DMTS in that rather than responding to this familiar stimulus, they respond to the other. On the basis of these subjective reports, it is plausible that DNMTS involves an additional processing stage to DMTS, necessitating inhibition of responses to familiar stimuli. It should be noted that this is the reverse of the situation in monkeys (Gaffan et al., 1984) and human infants (Diamond, 1991) in which the instinctive preference is for novel objects, so that DMTS rather than DNMTS involves the extra inhibition stage.
The premotor activation in DNMTS is consistent with this theory, because premotor cortex is associated with response inhibition in monkeys (Sawaguchi, 1996) and humans (Casey et al., 1997; Kiefer et al., 1998). However, ventral regions of the prefrontal cortex have most reliably been associated with response inhibition (Jones and Mishkin, 1972; Kowalska et al., 1991; Fuster, 1997), in accord with our observation of lateral orbitofrontal activation. Monkeys with lateral orbitofrontal lesions fail to inhibit responding on no-go trials of a go/no-go task (Iversen and Mishkin, 1970). In an fMRI study using a go/no-go task, Casey et al. (1997) demonstrated activation in several prefrontal regions, including BA 11, a region also activated in our present study. Casey et al. (1997) also found that activation of this region, but not dorsolateral prefrontal cortex, was significantly correlated with behavioral performance. Thus, more successful inhibition of inappropriate responses predicted greater activity in this region. There is also evidence that damage to the mediodorsal thalamus in humans produces deficits that may be interpreted in terms of response inhibition (Leng and Parkin, 1988; Joyce and Robbins, 1991;Forstl and Sahakian, 1993). The contention that the prefrontal and thalamic activations are associated with response inhibition in our study is supported, to some extent, by an absence of delay-related modulation of activity in these regions. If activations were delay-related, such modulation would be expected. The lack of delay dependency in lateral orbitofrontal cortex may be consistent with the observation that ventral prefrontal lesions in primates impair components of matching tasks that are not delay-related (Rushworth et al., 1997), although it should be noted that this study lesioned inferior convexity rather than orbitofrontal cortex. InterestinglyRushworth et al. (1997) used DMTS rather than DNMTS; however, as discussed, in monkeys it is DMTS that involves the additional inhibition stage.
DMTS compared with DNMTS was associated with activation of caudate and medial orbitofrontal cortex. The caudate has previously been implicated in animals performing various delayed response tasks with similarities to the paradigm used here (Divac, 1967; Levy et al., 1997). Human patients with basal ganglia disorders show deficits on delayed response tasks (Partiot et al., 1996) and a computerized DMTS task similar to that used in the present study (Sahakian et al., 1988; Owen et al., 1993; A. D. Lawrence, personal communication). These deficits are independent of delay interval, consistent with our finding that caudate activation does not increase at the longer delay. It has been suggested (Packard and White, 1991; MacDonald and White, 1993) that the dorsal striatum is particularly involved in memory tasks with a win–stay component, in which a key requirement is to make stimulus–response associations. Similarly, Rolls (1994) has argued that caudate neurons respond to stimuli that signal behavioral responses. Medial orbitofrontal cortex has also been implicated in tasks requiring subjects to associate stimuli with appropriate behavioral responses (Bechara et al., 1994; Rolls et al., 1994). It has been argued, based on primate lesion studies, that the involvement of medial orbitofrontal cortex in DMTS and DNMTS may be attributable to learning of associations between stimuli and rewarded responses (Meunier et al., 1997).
Thus, a possible explanation for caudate and medial orbitofrontal activations associated with DMTS, but not DNMTS, is that they represent neural responses to stimuli that specify the actual response. Although these tasks are more complex than classic stimulus–response habit tasks, they do involve a component of mapping stimuli to correct responses. In DMTS, as soon as subjects see the sample stimulus, they know the stimulus they must subsequently respond to. By contrast, in DNMTS the sample stimulus specifies a nonresponse. Thus in DMTS there is a greater association between a stimulus, or its representation during the delay, and the specification of the correct behavioral response. Neural responses in the caudate and medial orbitofrontal cortex may mediate this association. These activations are modulated by delay, being greater at shorter delays, maybe reflecting the fact that stimulus–response mapping is strongest with closer temporal linkage. Also, as we will argue, at short delays representation of a more veridical form of the stimulus, rather than a relatively abstract description, is linked to the required response.
Our other main finding was the distinct pattern of activation associated with varying delay intervals. Although both 5 and 15 sec would be considered within the scope of “short-term” memory, there are pronounced differences in the associated neural substrates. At 5 sec, greater activation is seen in occipital and medial parietal cortices relative to 15 sec delays. At 15 sec, greater activation is seen in temporal lobes and ventrolateral frontal cortices. The temporal lobe activations include medial temporal structures of hippocampus and neighboring rhinal cortices that have been strongly implicated in primate studies of DMTS (Gaffan and Murray, 1992; Meunier et al., 1993). Our delay-dependent findings are consistent with observations in animals that both temporal and frontal lobe lesions produce more severe deficits on DMTS and DNMTS as delay increases (Alvarez et al., 1992,1995; Bauer and Fuster, 1976). They also accord with observations that in humans with temporal or frontal lesions, there is a trend toward more pronounced DMTS impairment at longer delays (Owen et al., 1995). Patients with dementia of the Alzheimer type, characterized by pronounced temporal lobe pathology, also show delay-dependent DMTS deficits (Sahakian et al., 1988).
The differential neural responses at the two delay intervals suggest different processing requirements under these conditions. The predominant perceptual cortex activations at shorter delays suggest a greater dependence on primarily visual representations of the stimulus during the delay interval. Phillips (1974) characterized a distinct short-term visual memory system in which recent visual events are actively visualized and persist for delays of up to 10 sec, in the absence of intervening distracters. Memory over longer delays requires conversion of visual representations into more structural descriptions. Our findings suggest that such a short-term posticonic memory system may be subsumed by posterior cortical perceptual systems. Once the delay exceeds the limits of this system, other regions are recruited to mediate more abstract representations. Consistent with this account is subjects’ reported reliance on a “snapshot” approach at short delays, whereas at longer delays, they rehearsed a verbal description of the stimuli. Increased activation in temporal inferior frontal cortices would accord with a greater reliance on verbal coding. However, it should be noted that increasing delay also implicates these regions more strongly in monkey studies, and therefore it seems unlikely that verbal mediation is the only factor involved.
In conclusion, this study provides interesting insights into the neural substrates of classic short-term memory tasks. We have demonstrated that, although there are common substrates that mediate DMTS and DNMTS in humans, there are also pronounced differences. We have proposed that the differential activations seen in DNMTS compared with DMTS may be associated with the requirement to inhibit prepotent responses to familiar stimuli. Activations in the reverse comparison may reflect associations between representations of the test stimulus and anticipated behavioral response. The neural substrates of both tasks depend critically on the delay period, with performance at shorter delays dependent on early perceptual cortices, whereas longer delays involve temporal and frontal cortices, perhaps reflecting more abstract stimulus representation.
Footnotes
R.J.D. is supported by the Wellcome Trust. We are grateful to Prof. Trevor Robbins (University of Cambridge, Cambridge, UK) for helpful comments and discussion.
Correspondence should be addressed to Dr. Rebecca Elliott, Wellcome Department of Cognitive Neurology, 12 Queen Square, London WC1N 3BG, UK.