Abstract
Humans adaptively control reaching movements to maintain good performance in the presence of novel forces acting on the arm. A recent study suggested that motor memories of different force conditions are not transformed from fragile to stable states, but rather are always vulnerable to interference from newly learned conditions (Caithness et al., 2004). This is contrary to the results of previous studies (Brashers-Krug et al., 1996; Shadmehr and Brashers-Krug, 1997), although all of these studies followed similar methods. Here, we show that a seemingly insignificant and inconsistently applied methodological detail may reconcile this discrepancy. Catch trials, in which the novel forces are removed, may be randomly interspersed among the more frequent force trials to assess how a subject is learning to predict the pattern of forces. In the absence of an interfering condition, subjects retained their learning until retest a day later regardless of whether they experienced catch trials. But in the presence of an interfering condition, only the subjects who had experienced forces intermittently retained their learning and thereby showed resistance to the interference. Thus, intermittent rather than constant practice conditions appear to be critical for dynamic motor memory stabilization.
Introduction
The memory consolidation hypothesis states that a memory is transformed from a labile to a stable state in a gradual, time-dependent manner (McGaugh, 2000). Consolidation in the motor system was originally demonstrated by Shadmehr and coworkers (Brashers-Krug et al., 1996; Shadmehr and Brashers-Krug, 1997) using a task in which subjects learned to reach in novel dynamical environments, or “force fields,” created by a robotic manipulandum.
To track adaptation to these force fields, the initial studies would intermittently turn the force field off (i.e., present a null field) during random trials referred to as “catch trials.” This manipulation was not thought to impact memory consolidation, but was merely used as a tool to assay field-specific learning (Brashers-Krug et al., 1996; Shadmehr and Brashers-Krug, 1997). In a more recent study, Caithness et al. (2004) failed to show consolidation in the force field task, but rather found continued sensitivity to an interfering condition even at 1 d or more after learning. But Caithness et al. (2004) also did not use catch trials as in the original literature. Instead, they presented the force field relatively continuously during each learning session, assessing specificity of learning with a block of null-field trials only after learning was completed.
Here, we tested whether the presence or absence of catch trials during the course of force field learning could explain the different outcomes of these studies. In particular, we tested whether subjects' force field learning could survive exposure to an interfering force field and whether such retention varied depending on their experience of catch trials or continuous forces. Our results indicate that subjects practicing with or without such interruptions of the force field do indeed demonstrate markedly different patterns of skill retention.
Materials and Methods
Subjects
This study involved 40 right-handed, English-speaking subjects (mean, 23.0 years old; 21 males). Participants were screened for history of neurological illness, epilepsy, seizures, head injuries, and neurosurgery. Ethical approval was obtained through the Massachusetts Institute of Technology Committee on the Use of Humans as Experimental Subjects. Subjects were required to sleep at least 6 h in the two nights preceding and the night during the experiment. The 40 participants were assigned to four experimental groups of 10 subjects each in a 2 × 2 design, defined by (no or one) interfering epoch between learning and retest epochs and (0 or 20%) catch trials in all force epochs.
Paradigm
Task.
Subjects held onto a robotic manipulandum and used it to make visually guided reaching movements. During these movements, the forearm was held approximately level by means of a ceiling-mounted sling. The targets were white 1-cm-wide squares appearing on a black background, as displayed on a vertically oriented monitor above the planar manipulandum apparatus. Target distances were 10 cm both in real space and as shown on the monitor. Motions of the handle were represented on the screen as continuous movements of a 0.8-cm-wide white crosshair. The six peripheral targets were spaced hexagonally around a seventh, central target to ensure equidistant separation of neighboring targets. Participants were given 0.5 ± 0.5 s to complete each movement, starting from the moment the target was presented and lasting until their cursor entered the target. The target remained on the screen even if subjects failed to reach it in the allotted time. On acquiring the target, an additional 0.5 s within-target hold time had to be observed in order for the trial to be a “success.” Trial success was indicated to the subject by a transition in the color of the target from white to green just before its disappearance. If either of the conditions was not met, the trial failure was instead indicated by a target transition from white to red.
Design.
Subjects first completed a “baseline” epoch in which no force field was applied, followed 5 min later by a “learning” epoch in which a clockwise velocity-dependent rotational force field was applied. An “interference” epoch with a counterclockwise force field was then presented 6 h later to two of the groups. Finally, all subjects returned 24 h after the end of their previous epoch for a “retest” epoch in the clockwise force field. The epochs themselves required ∼8 min to complete. We did not control the activities of the participants in the remaining intervals, except to require that the 24 h interval include at least 6 h of sleep (verified by subject report).
Instructions.
The basic task was described, and subjects were asked not to “anticipate” the targets but merely to move as “naturally” as possible. They were informed that targets would often be difficult to reach in the allotted time, particularly at the beginning of the experiment. The trial “success” feedback was described, but subjects were told it was given merely to help them make their movements as consistently as possible, and not to evaluate how successful they were. As a safety precaution, participants were warned before each epoch that they might experience “forces” generated by the robot.
Target sequence.
Each epoch was defined by a pseudorandom sequence of 216 targets that was the same for all subjects. All participants completed each movement specified by the target sequence; trials were not aborted if they failed to reach the target on time. After each trial and an additional 0.1 s period, a new target appeared and the former target became the “origin” for the next movement. The first origin in each epoch was always the central square. From each of the six peripheral origins, the possible targets were restricted to one of the three neighboring equidistant targets. From the central origin, movements to only three of the six candidate directions were allowed. The target sequence was constrained to have equal frequencies of both the six movement directions possible from the set of peripheral targets and the three directions allowed from the central target. In addition, we equalized the frequency of each movement origin/target combination as much as possible, by requiring that each target sequence contain at least seven trials in each of these 21 movement path possibilities.
Forces.
All subjects experienced a null (0 N · s/m) force field during the baseline epoch, a velocity-dependent clockwise field in the learning and retest epochs, and a counterclockwise field in the interference epoch. The curl forces, of magnitude 10 N · s/m, were calculated on-line as where (clockwise) or (counterclockwise) and ẋ was the endpoint (i.e., hand) velocity.
Catch trials.
On these trials (experienced by one-half of the subjects), the motors were turned off suddenly, silently, and with no warning to the subject, just as the target was presented. Catch trials occurred on the same trials for both groups of catch trial subjects. These trials were preprogrammed pseudorandomly such that in each epoch, exactly two catch trials occurred among each of the 21 movement paths defined by the allowable origin and target combinations (see above, Target sequence). In addition to these 42 catch trials per epoch, on the “zeroth trial” of each epoch all catch and noncatch trial subjects moved the cursor to the central origin in the absence of forces, to begin the epoch proper. Thus, of the 217 trials including the zeroth trial, 43 (or ∼20%) were performed without forces by the catch trial subjects. There was no additional constraint that they were to occur at a regular rate within each epoch, but at least one catch trial was present in each 18-trial bin.
Analysis
Velocity vector correlation. Adopting a traditional method of gauging force field adaptation (Shadmehr and Brashers-Krug, 1997; Caithness et al., 2004), we computed the similarity between subjects' velocity profiles in the force field epochs with their velocities late in the baseline epoch. Velocity data were obtained from manipulandum tachometer signals, translated into endpoint coordinates. Given an [x,y] pair of velocity vectors n for the null-field baseline epoch and a corresponding vector pair f for a force field trial, the measure computes the correlation coefficient as follows: Following Caithness et al. (2004), we calculated the baseline x and y velocity vectors as the average of the last four baseline trials performed in each of the (21) allowable movement paths. Note that this procedure differs from that of Shadmehr and Brashers-Krug (1997), who did not calculate average baseline speed vectors but rather selected individual baseline speed vectors that were optimally correlated to subsequent vectors. Because a 0.5 ± 0.5 s time limit was applied to the sum of subjects' reaction plus movement time, rather than a tighter 0.5 ± 0.05 s time limit applied to the movement time alone (Caithness et al., 2004), we adjusted the method to allow for greater variability in the timing and duration of the velocity profiles. Specifically, we aligned the profiles on the time of peak speed rather than the time that a minimum speed threshold was met, and we used 1 s rather than 0.5 s velocity vectors (Caithness et al., 2004). In doing so, we averaged together or correlated the vectors only over the portions of the vectors that overlapped one another.
Learning index. An additional measure of performance allowed us to gauge the degree to which catch trial subjects learned to anticipate the forces, rather than react to them by cocontraction (Donchin et al., 2002). This “learning index” was based on the signed (counterclockwise–clockwise) perpendicular displacement (PD) of each movement at 300 ms after reaction time. Position data were derived from manipulandum joint encoder position signals, translated into endpoint coordinates. Displacement was defined relative to a straight line connecting the centers of the origin and target squares and thereby defining an idealized trajectory. Reaction time was defined as the time the speed profile reached 20% of its peak on the trial. Several other measures of trajectory deviation (e.g., signed peak perpendicular deviation, deviation angle or area) yielded qualitatively similar results. To summarize the catch trial subjects' internalization of each field, we computed a learning index (LI) that expressed performance in each trial bin as a ratio of PDs on catch trials and all trials: The measure is like that of Donchin et al. (2002), except that we allow the numerator to be signed to make clear the distinction between the two curl fields.
Comparisons.
Trials in which the subject failed to reach the target in the allotted 0.5 ± 0.5 s time limit were still included in the analysis, unless the reaction time was <0.2 s or the combined reaction plus movement time was >1.5 s. Trials were binned by 18 trials in each epoch and condition. Catch trials were excluded from the trial bins for the CC measure. We do not present data from the baseline epoch because performance, as described in the preceding two sections, was either defined relative to this epoch (CC) or based on catch and fielded trials (LI). Statistical results were based on the within-bin-averaged CC or LI values. The transfer of clockwise field learning across epochs was estimated by comparing the mean scores of the first two 18-trial bins in the retest epoch with the last two trial bins in the learning epoch. Main and interaction effects of between-subjects interference and catch trial frequency and within-subjects time bin (or bin average, in the case of learning transfer) were assessed using repeated-measures ANOVAs. All significant effects (at the p < 0.01 level) are reported.
Results
Subject trajectories were deviated in the patterns typical of rotational force fields (Fig. 1A). After the introduction of a clockwise force field in the learning epoch, movements from the center to the periphery became deviated in a clockwise direction. Essentially the opposite pattern of errors was evident in movements performed with the counterclockwise field during the interference epoch 6 h later, but clockwise-deviated movements were again expressed at retest 24 h later still. The two curl fields were also associated with multimodal velocity profiles (Fig. 1B), particularly in the learning epoch, indicating corrective submovements that became attenuated with practice.
To quantify performance in the task, we computed the velocity vector correlation between force trials (in the learning, interference, retest epochs) and null trials (baseline epoch; see Materials and Methods). Results from the two control groups (with and without catch trials) experiencing no interference are shown in Figure 2A, whereas the two groups who did experience an interference condition are shown in Figure 2B.
Within the learning epoch, all four groups experienced a comparable increase in performance as measured by the velocity vector correlation. In an interference by catch trial by time bin repeated-measures ANOVA of this epoch, the only significant effect was that of time (F(11,396) = 17.72; p < 0.0001). Within the interference epoch, both 0 and 20% catch trial groups demonstrated similar performance in the counterclockwise field, with the only significant effect being that of time (F(11,198) = 2.89; p = 0.0053). Within the retest epoch, there was again a significant effect of time across the four groups (F(11,396) = 11.11; p < 0.0001), but both the presence of an interfering epoch and the presence of catch trials influenced recall (interference by time: F(11,396) = 3.15, p = 0.0016; interference by catch trial by time: F(11,396) = 3.62, p = 0.0003). The effect of catch trials in the retest epoch was limited to the interference groups (Fig. 2B): only among these subjects was there a significant catch trial by time interaction (F(11,198) = 3.03; p = 0.0033), in addition to a significant main effect of time (F(11,198) = 10.17; p < 0.0001). Noninterference groups (Fig. 2A) only showed the effect of time (F(11,198) = 3.76; p = 0.0002).
To describe these recall differences in reference to initial clockwise field learning, we compared the average of the final two learning-epoch trial bins with the average of the first two retest bins, for all groups, using an interference by catch trial by bin average ANOVA. The relative retest performance of the group given an interference epoch but no catch trials appeared to be deviant relative to the other groups (Fig. 2A,B). Indeed, we found significant effects not only of bin average (F(1,36) = 21.20; p < 0.0001) but of interference by catch trial by bin average (F(1,36) = 6.07; p = 0.0090).
The retest difference between the 0 and 20% catch trial subjects (Fig. 2B) did not appear to be attributable to coactivation by the latter subjects. Subjects given 20% catch trials (with or without an interference epoch) appeared to internalize the clockwise force field in the learning epoch (Fig. 2C), as measured by an increasing LI (i.e., aftereffects on catch trials expressed as a fraction of total, absolute trajectory error; see Materials and Methods). Indeed, an interference by time bin repeated-measures ANOVA revealed a significant effect only of time on the LI (F(11,19) = 14.84; p < 0.0001). A comparable development of clockwise LI characterized the interference epoch, where a repeated-measures (time bin) ANOVA on the interference subjects alone again revealed a significant effect of time (F(11,9) = 5.77; p < 0.0001). In the retest epoch, an interference by time bin ANOVA again found only a significant effect of time (F(11,19) = 9.88; p < 0.0001). A small difference in the LI initially apparent between the groups given or spared an interfering field (Fig. 2C) may have underlain a nonsignificant trend toward an interference by time interaction (p = 0.0310).
Discussion
Our results indicate that participants who had experienced the force field intermittently, because of the presence of catch trials, appeared uniquely able to retain their initial learning regardless of whether they had subsequently experienced an interfering force field. Catch trials have previously been shown to lead to an undercompensation for forces during initial learning (Thoroughman and Shadmehr, 2000), but their effect on learning retention has not previously been appreciated. In the following discussion, we consider several features of the catch trial subjects' experience that could account for the relative stability of their motor memory: 1) the characteristic error feedback they generated; 2) the presence of interruptions of the force field; and 3) the unpredictability of these interruptions.
First, the distinct errors experienced on catch trials might well emphasize the different dynamic conditions experienced on force trials, but how precisely this error feedback impacts learning retention is uncertain. Our paradigm bears resemblance to those used by Wada et al. (2003) and Osu et al. (2004), who alternated two oppositely directed velocity-dependent fields within epochs. These studies found participants could recall each field when the fields were presented with distinct sensory cues. It is conceivable that our subjects learned to associate their distinctive visual and proprioceptive error on catch and force trials with the null and curl fields, respectively. Distinct error feedback could also have caused subjects to consciously attend to the perturbations they experienced (Malfait and Ostry, 2004). However, Klassen et al. (2005) found that subjects were able to recall the force field of a previous day regardless of whether they had been introduced to the full force field gain from the first trial of learning or had had the field presented incrementally. This study implies that salient movement feedback errors are not required for the recall of dynamic motor learning, even if they may facilitate cognitively mediated interlimb transfer of the adaptation (Malfait and Ostry, 2004). Our results are consistent with this latter finding insofar as both the 0 and 20% catch trial subjects were identically able to recall their uninterfered clockwise force field learning from the previous day (Fig. 2A), despite the characteristic aftereffect errors of the 20% catch trial subjects (Fig. 2C).
Second, the catch trial subjects' apparent memory stabilization did not likely benefit simply from interruptions of the force field. At a cellular level, visual activity-induced synaptic modifications are afforded resistance against subsequent interference from white-noise stimuli by intermittent, rather than continuous, presentation of the initial stimuli (Zhou et al., 2003). In the motor system, our results parallel previous findings showing long-term benefits of distributed rather than massed practice (Schmidt and Lee, 1999). But such interruptions, in and of themselves, may not be sufficient to stabilize motor memories. Research into saccadic adaptation by primates with either a block of catch trials or an equivalent period of rest without visual inputs suggests that actual practice under null conditions is required to cause active unlearning and relearning of the novel condition (Kojima et al., 2004).
Third, this “active” learning, and memory stabilization, may have benefited from the relative unpredictability of the catch trials. Monotonous repetition of a motor behavior leads to superior immediate performance, but practice with some degree of randomness facilitates long-term retention (Shea and Morgan, 1979; Proteau et al., 1994; Osu et al., 2004). Similarly, in operant conditioning experiments, it has long been known that variable reinforcement schedules lead to associations that are more resistant to extinction than those generated by continuous reward regimens (Skinner, 1953). Although the common interpretation is that subjects are more motivated to continue producing a behavior if they have even a slight, probabilistic expectation of reward (Myers, 1998), we suggest that the durability of these associations could also be a result of their greater stabilization in memory.
Note that the unpredictability of the forces experienced by the catch trial subjects did not lead them to use a muscular coactivation strategy, as demonstrated by their pattern of increasing aftereffects on catch trials in each epoch (Fig. 2C). Although cocontraction is a reasonable response when the force field gain on each trial is genuinely unpredictable (Takahashi et al., 2001), we did not expect such a strategy in our experiment, given both that the field was still present on a majority of trials, and that a 1/5 catch trial rate is comparable with the 1/6 rate previously associated with force field internalization (Shadmehr and Brashers-Krug, 1997). However, coactivation by the participants given no catch trials was also possible, and cannot be ruled out because they did not experience the catch trials by which we could gauge such coactivation.
Although the above discussion assumes that intermittent rather than continuous presentation of forces was associated with protection from retrograde interference, anterograde interference effects are also possible. Indeed, among catch trial subjects, the time course of their aftereffects appeared to be different in the learning and interference epochs, with prominent aftereffects delayed until about halfway through the latter (Fig. 2C). This pattern suggests that 6 h after learning the clockwise field, the catch trial subjects still experienced anterograde interference (Miall et al., 2004) when acquiring the counterclockwise field. At retest, these subjects might have experienced additional anterograde interference, given their initially lower, if statistically similar, performance relative to noninterference subjects (Fig. 2C). Such anterograde interference has been observed previously even 24 h after an interfering condition (Shadmehr et al., 1998; Donchin et al., 2002). It is possible that the 0% catch trial subjects' relatively poor recall after interference (Fig. 2B) could have been attributable to greater anterograde interference at retest relative to the 20% catch trial subjects, rather than greater retrograde interference (Miall et al., 2004).
Whatever the relationship between intermittent practice and motor memory stabilization, and whatever the direction of interference between epochs, our empirical findings at least suggest that subjects given uninterrupted force fields were uniquely susceptible to an interfering field, even at 6 h after learning. The presence or absence of catch trials during force field learning can thus reconcile the differences between Brashers-Krug et al. (1996) and Caithness et al. (2004). This is the case although our paradigm is a distinctly more generalized version of the standard center-out reaching task, with more variability in movement paths and their ordering, and in movement speed. As such, our paradigm provided a test for the generality, as well as the validity, of the original motor consolidation result (Brashers-Krug et al., 1996).
Although our experiment was designed to test the stability of motor memories, it did not explore the second aspect of consolidation theory, namely the time course of stabilization. Additional work, with careful attention to methodological detail, will be needed to probe determinants of this time course (Krakauer et al., 2005).
Footnotes
-
This work was supported by National Institutes of Health Grant K23-MH65434 (D.Z.P.). We thank Martin Ramos Rizo-Patron for engineering assistance.
- Correspondence should be addressed to Dr. Daniel Z. Press, Assistant Professor in Neurology, Beth Israel Deaconess Medical Center and Harvard Medical School, 330 Brookline Avenue, Boston, MA 02215. dpress{at}bidmc.harvard.edu