Abstract
Temporal orienting of attention plays an important role in our day-to-day lives and can use timing information from exogenous or endogenous sources. Yet, it is unclear what neural mechanisms give rise to temporal attention, and it is debated whether both exogenous and endogenous forms of temporal attention share a common neural source. Here, older adult nonmusicians (N = 47, 24 female) were randomized to undergo 8 weeks of either rhythm training, which places demands on exogenous temporal attention, or word search training as a control. The goal was to assess (1) the neural basis of exogenous temporal attention and (2) whether training-induced improvements in exogenous temporal attention can transfer to enhanced endogenous temporal attention abilities, thereby providing support for a common neural mechanism of temporal attention. Before and after training, exogenous temporal attention was assessed using a rhythmic synchronization paradigm, whereas endogenous temporal attention was evaluated via a temporally cued visual discrimination task. Results showed that rhythm training improved performance on the exogenous temporal attention task, which was associated with increased intertrial coherence within the δ (1-4 Hz) band as assessed by EEG recordings. Source localization revealed increased δ-band intertrial coherence arose from a sensorimotor network, including premotor cortex, anterior cingulate cortex, postcentral gyrus, and the inferior parietal lobule. Despite these improvements in exogenous temporal attention, such benefits were not transferred to endogenous attentional ability. These results support the notion that exogenous and endogenous temporal attention uses independent neural sources, with exogenous temporal attention relying on the precise timing of δ band oscillations within a sensorimotor network.
SIGNIFICANCE STATEMENT Allocating attention to specific points in time is known as temporal attention, and may arise from external (exogenous) or internal (endogenous) sources. Despite its importance to our daily lives, it is unclear how the brain gives rise to temporal attention and whether exogenous- or endogenous-based sources for temporal attention rely on shared brain regions. Here, we demonstrate that musical rhythm training improves exogenous temporal attention, which was associated with more consistent timing of neural activity in sensory and motor processing brain regions. However, these benefits did not extend to endogenous temporal attention, indicating that temporal attention relies on different brain regions depending on the source of timing information.
Introduction
Attention is known to be deployable in both space and time. The allocation of attention, and hence the prioritization of the input of information, at a particular point in time is called temporal attention. Temporal attention plays an important role in our day-to-day lives as the precise and accurate prediction of timing results in optimized performance across numerous cognitive domains. For example, temporal attention is used to anticipate the onset of phonemes and words, which facilitate language comprehension (de Diego-Balaguer et al., 2016). It is also used to help time the gaps in traffic when attempting to cross a busy street. Temporal attention can be further categorized into endogenous (top-down) and exogenous (bottom-up) mediated processes. Exogenous temporal orienting of attention takes place in response to rhythmic stimuli or stimuli with a predictable temporal structure, which is also referred to as beat-based or rhythmic timing (Coull and Nobre, 1998, 2008; Repp et al., 2008; Cravo et al., 2017). Thus, exogenous temporal attention refers to the process by which attention is oriented to the environment to drive internal timing representations. Endogenous temporal orienting of attention, on the other hand, relies on symbolic cues or nonrhythmic stimuli to infer timing information, and is also referred to as event-based, duration-based, or memory-based timing (Large and Jones, 1999; Olson and Chun, 2001; Nobre and van Ede, 2018). As such, endogenous temporal attention refers to the process by which attention is oriented internally for timing information. Interestingly, both exogenous and endogenous temporal attention becomes impaired with age, and negatively affects cognitive abilities that rely on the precise orienting of attention in time (Zanto et al., 2010, 2011, 2019; Bollinger et al., 2011; Padgaonkar et al., 2017). In this study, we aim to assess whether temporal attention in healthy older adults can be improved through musical rhythm (exogenous temporal attention) training, and to characterize the neuroplastic changes that underlie such improvements.
During musical performances, exogenous temporal attention is used to synchronize the musicians playing with rhythmic sources of timing information, such as a metronome, a conductor, or music from other band members. On the other hand, endogenous temporal attention is needed when rhythmic timing information is not available (as in a solo performance) and when a musician must rely on their own internal (memory-based) timing mechanisms. Thus, musical performances use both endogenous and exogenous temporal attention, albeit under different contexts. It is therefore not surprising that previous research has shown a musicianship advantage in both exogenous and endogenous timing abilities (Franĕk et al., 1991; Rammsayer et al., 2012; Cameron and Grahn, 2014; Janzen et al., 2014; Zanto et al., 2019). Yet, it remains unclear whether exogenous and endogenous temporal attention relies on common neural mechanisms, which would enable training on one to improve the other. Here, we directly address this question by providing musical rhythm training for older adults with specific demands on exogenous temporal attention ability (i.e., synchronizing movements to beats). We then characterize the mechanisms that give rise to improvements in exogenous temporal attention and assess whether such changes transfer to improved endogenous temporal attention.
In support of the notion that exogenous and endogenous temporal attention utilizes a common neural network, a recent meta-analysis of exogenous and endogenous timing tasks have shown that both task types evoke activity in overlapping brain regions (Teghil et al., 2019). These regions include the supplementary motor area, intraparietal sulcus, inferior frontal gyrus, insula, and basal ganglia. Indeed, patients with basal ganglia degeneration have shown deficits in both exogenous and endogenous timing tasks (Cope et al., 2014), suggesting a critical role of the basal ganglia in timing ability. Furthermore, both exogenous and endogenous temporal attention has been associated with oscillatory neural activity within the δ (1-4 Hz) (Stefanics et al., 2010; Gomez-Ramirez et al., 2011; Cravo et al., 2013; Arnal et al., 2015; Wilsch et al., 2015; Arnal and Kleinschmidt, 2017; Breska and Deouell, 2017), α (8-12 Hz) (Praamstra et al., 2006; Rohenkohl and Nobre, 2011; Zanto et al., 2011; Breska and Deouell, 2014), and β (12-30 Hz) (Snyder and Large, 2005; Zanto et al., 2005, 2020; Androulidakis et al., 2007; Fujioka et al., 2009; Todorovic et al., 2015; Morillon and Baillet, 2017; Breska and Ivry, 2020; Graber and Fujioka, 2020) bands, although it remains unclear whether this oscillatory neural activity emanates from a common neural source. Nonetheless, these data collectively support models of a unified timing mechanism (Treisman, 1963; Miall, 1989; Ivry and Schlerf, 2008; Teki et al., 2012), which would enable musical training focused on exogenous temporal attention to yield benefits in endogenous temporal attention.
Conversely, there are numerous studies that suggest endogenous and exogenous temporal attention relies on separate neural mechanisms (Pashler, 2001; McAuley and Jones, 2003; Rohenkohl et al., 2011; Bouwer et al., 2020). There is a rich literature demonstrating that the cerebellum is uniquely involved in endogenous, not exogenous, timing tasks (Ivry et al., 2002; Spencer et al., 2005, 2007). Indeed, a double dissociation has been identified between patients with cerebellar or basal ganglia degeneration, such that the former exhibits a deficit in endogenous, not exogenous, timing, whereas the latter exhibits a deficit in exogenous, not endogenous, timing (Breska and Ivry, 2018). Endogenous temporal attention has also been uniquely associated with the inferior olive, whereas exogenous temporal attention has been uniquely associated with the putamen, caudate nucleus, and thalamus (Teki et al., 2011). This research supports models suggesting that timing is a distributed process that relies on modality- or context-specific neural regions (for review, see Mauk and Buonomano, 2004; Ivry and Schlerf, 2008; Paton and Buonomano, 2018). As such, if musical training that emphasizes exogenous temporal attention does not result in improvements in endogenous temporal attention performance, this supports the idea that these timing abilities rely on different neural mechanisms.
Understanding the source of temporal attention ability remains an active area of investigation. Although numerous brain regions and oscillatory signatures have been previously associated with temporal attention (as noted above), much less research has been dedicated to understanding which of these brain regions gives rise to this oscillatory activity. Here, we assessed changes in oscillatory EEG data during an exogenous temporal attention task both before and after rhythm training. Analysis focused on δ phase coherence and α/beta power in line with prior research that associated these frequency bands with temporal attention. Source localization was then conducted to elucidate which neural regions give rise to training-related changes in oscillatory signatures of exogenous temporal attention. Finally, we assessed whether rhythm training also improved endogenous temporal attention ability, which would address whether endogenous/exogenous temporal attention utilizes a common mechanism.
To achieve these goals, older adult (aged 60-79 years) nonmusicians were randomized to receive either rhythm training or active control training (word search). Older, rather than younger, adults were selected because we have previously demonstrated that older adults exhibit deficient performance on our assessments of exogenous (Zanto et al., 2019) and endogenous (Zanto et al., 2011) temporal attention. Selecting this population diminishes the likelihood that ceiling effects would mask potential training-related improvements. Pre-/post-training, exogenous temporal attention was assessed with a rhythmic synchronization paradigm that we have previously validated (Zanto et al., 2019). This paradigm has been found to separate adults based on age and musicianship, such that young musicians exhibit the highest performance, followed by young nonmusicians, older musicians, and older nonmusicians in that order. We hypothesized that only the rhythm training group would exhibit improvements in exogenous temporal attention because only their training paradigm placed demands on exogenous temporal attention ability. To control for the possibility that sensory or motoric changes may have occurred following training, a sensorimotor response task was also implemented before and after training. Finally, to assess whether musical rhythm training benefits endogenous temporal attention, a temporally cued visual discrimination task was used before and after training. We have previously demonstrated that only younger, not older, adults benefit from predictive temporal cues during this task (Zanto et al., 2011). Here, we hypothesized that if exogenous and endogenous temporal attention share a common neural mechanism, then the rhythm training group would show an improvement in their ability to use predictive temporal cues and exhibit a similar change in oscillatory neural activity across the exogenous and endogenous temporal attention tasks.
Materials and Methods
Experimental design and statistical analysis
In this preregistered study (Open Science Framework; https://osf.io/fynpd), participants were randomized to either a musical rhythm training group (Rhythmicity) or a word search training group (Control). Each group engaged with the training paradigm for 8 weeks (5 d per week, ∼20 min per day) in their own home. Pre- and post-training, participants were assessed in-laboratory on their endogenous and exogenous temporal attention abilities, which served as the primary outcome measures for this study. Short-term memory performance was also collected, but not reported here as it is outside the scope of the temporal attention hypotheses and was published previously (Zanto et al., 2022). While engaged in the outcome assessments, EEG data were recorded.
Statistical analysis of behavioral performance used an ANOVA with a Greenhouse–Geisser correction when appropriate. To assess exogenous temporal attention, a rhythmic synchronization paradigm was used (described below) and analysis focused on performance asynchrony and variability. These data were submitted to separate ANOVAs with Group (Rhythmicity, Control) as a between-subjects factor and Time (Pre, Post) and Tempo (Slow, Medium, Fast) as within-subjects factors. Although circular statistics could be used to quantify performance asynchrony and variability, the calculation of circular statistics incorporates the tempo of the stimuli. As such, it is a biased measure when comparing across different tempos. Therefore, we opted to use linear statistics to quantify performance asynchrony and variability, which we have previously shown to be a more sensitive method to differentiate performance between age groups, tasks, and stimulus types (Zanto et al., 2019).
To assess endogenous temporal attention, a temporally cued visual discrimination task was conducted (described below). Analysis focused on accuracy and response time performance. These data were submitted to separate ANOVAs with Group (Rhythmicity, Control) as a between-subjects factor and Time (Pre, Post) and Condition (Predictive, Neutral) as within-subjects factors.
Finally, analysis of EEG data focused on event-related spectral perturbations (ERSPs) and intertrial coherence (ITC) in sensor space, as well as ITC in source space. t tests were used to compare between groups and time points (pre-/post-training). To correct for multiple comparisons, p values were cluster-corrected across electrodes/voxels based on Monte Carlo simulations. Corrected p values < 0.05 were considered significant.
Participants
Forty-seven older adult nonmusicians aged between 60 and 79 years were recruited for this study (24 females). Nonmusicians were defined as having <3 years of formal instrument, song, or dance instruction, and have not engaged in playing an instrument within the last year. All participants gave informed consent as approved by the University of California San Francisco Institutional Review Board. Participants were randomized into 1 of 2 groups: a musical rhythm training group (Rhythmicity) or a word search training group (Control). Both participants and researchers were blinded to the group assignments. Participants had no history of neurologic or psychiatric disease, no history of brain tumors, were not taking medications that modulate brain excitability (e.g., neuroleptic, antidepressant, stimulant, hypnotic), no amblyopia, strabismus, or color blindness, and had normal or corrected-to-normal vision and hearing. To ensure participants were cognitively healthy, all participants scored within 2 SDs of standardized scores on 12 tests of neuropsychological and physical function: California Verbal Learning Test-II, animal fluency, digit symbol, Patient Health Questionnaire, Delis-Kaplan Executive Functioning System Trails, Number and Number-Letter, Stroop, Measurement of Everyday Cognition, Ishihara Color Deficiency test, physical assessments (chair sitting and standing speed), hearing, and visual acuity. Five Rhythmicity participants were dropped from the study (3 for noncompliance, 2 for device malfunctions that prevented training/schedule adherence), resulting in 18 participants used for data analysis in the Rhythmicity group (mean = 67.7 years, SEM = 1.2, range = 60-79 years, 7 female). Five Control participants were dropped from the study (4 for noncompliance, 1 for technical problems during data collection), resulting in 19 participants used for data analysis in the Control group (mean = 69.0 years, SEM = 1.4, range = 62-78, 11 female). This number of participants in the final analysis was sufficient to achieve >80% power at a 95% significance threshold, based on our prior research with these temporal attention assessments in older adults (Zanto et al., 2011, 2019). Specifically, we calculated power using the smallest effect size from previously observed three-way interactions, which included 1 between-subject factor and 2 within-subject factors as implemented here. Participants received $15 per hour for participation and a $50 bonus for completion of the study.
Training paradigms
Musical rhythm training was conducted using a custom-designed video game, Rhythmicity (Fig. 1A). Rhythmicity was created at Neuroscape and incorporates closed-loop adaptive algorithms to consistently challenge cognitive function, which is thought to optimize training benefits (Mishra et al., 2016) and can be used to improve multiple cognitive abilities (Anguera et al., 2013; Mishra et al., 2014; Ziegler et al., 2019). It was played on a Microsoft Surface three tablet that permits tapping on a screen akin to certain types of drumming. Participants tapped the screen of the tablet in synchrony with the musical “beat,” which was also visually cued by moving targets (aliens). The game is designed to challenge rhythm and timing abilities, such that a nonmusician may learn to tap a steady rhythm. Adaptivity was built into Rhythmicity so that with practice, the rhythms become increasingly difficult, but if performance falters, the rhythmic demands become easier. As difficulty increases, participants subsequently accumulate more points. Difficulty of rhythms increased along three dimensions: tempo, complexity, and precision. Tempo refers to the speed of the rhythm, complexity refers to the number of taps required before the rhythm repeats (2, 4, or 8 taps), and precision refers to the amount of temporal offset allowed for a tap to be considered “on-beat.” During weeks 5-8 of training, the visual cues were presented so that participants may encode the rhythmic pattern into memory (as in weeks 1-4), but then the cues disappeared, and participants continued tapping the previously cued rhythmic pattern. In doing so, participants relied more on their working memory to produce the rhythmic pattern.
Training and Experimental paradigms. Screenshots from (A) Rhythmicity and (B) Worder (Control) training paradigms. Experimental design for (C) exogenous temporal attention task and (D) endogenous temporal attention task.
The Control group trained on a word search game, Worder (Fig. 1B), which was played on an iPad. Worder is a word search game that gets more difficult as the training progresses by expanding the field of letters to search. For example, Worder starts with a 3 × 3 grid of letters and advances to a 4 × 4 grid on sufficient progress. Hints were offered in case a participant gets stuck, but the number of daily hints was limited to maintain a challenging environment. Importantly, Worder places no demands on timing abilities, and so it was hypothesized that only Rhythmicity would result in enhanced temporal attention ability.
Exogenous temporal attention assessment
To assess exogenous temporal attention, participants were given a rhythmic synchronization paradigm (Fig. 1C), which was presented on a Microsoft Surface Pro 3 tablet, as previously described (Zanto et al., 2019). Briefly, this paradigm required participants to tap different metronome-like sequences on the screen of the tablet across 27 levels. These levels parametrically manipulated three variables: tempo of the metronome, the audiovisual information provided, and the rhythmic task performed. The tempo varied between slow (750 ms), medium (525 ms), and fast (350 ms) interonset intervals (IOIs). The stimuli presented varied between a visual-only stimulus where the movement of a ball between two lines on each side of the screen denoted the metronome “beat,” an audio-only stimulus where a distinct tone denoted the metronome “beat,” and an audiovisual stimulus where these cues were integrated. Last, participants were asked to perform three tasks: (1) on-beat: tap along with each stimulus event (i.e., beat: sound onset and/or when the ball touched the lines at either side of the screen); (2) off-beat: tap half-way between each stimulus event; or (3) continuation: after four stimulus events (i.e., 4 beats), the stimuli were discontinued, and participants had to continue the metronomic rhythm by tapping for 4 beats without disrupting the tempo. After the 4 beat “silent period” where participants were to tap, stimuli were resumed for another 4 beats and followed by another 4 beat “silent period” where participants were instructed to tap. The stimuli and silent periods continued to alternate for the duration of the level. Therefore, the “on-beat” task places the strongest demands on exogenous temporal attention as the stimuli references time points that index both the temporal duration to be encoded and used for subsequent temporal predictions, but also coincides with the time point where action is required. It is this convergence of action and perception based on extrinsic cues that enables exogenous temporal attention to play a central role in the on-beat task. Conversely, the “off-beat” and “continuation” tasks separate the action from the extrinsic timing cues. It could therefore be argued that this dissociation results in some reliance on endogenous temporal attention abilities, as memory for the encoded time interval cannot be updated at the time of action. Nonetheless, all three tasks (on-beat, off-beat, continuation) are useful to assess exogenous temporal attention abilities because, in each task, participants continually rely on extrinsic cues for timing information.
Together, the rhythmic synchronization paradigm consisted of 27 levels (3 tempos × 3 stimulus types × 3 tasks) each lasting ∼30 s. To characterize exogenous temporal attention, performance asynchrony and variability were assessed. Asynchrony was calculated as the absolute offset in milliseconds from the instructed tap onset. Variability was calculated as the SD of tap offsets. Data were averaged over stimulus types and tasks to obtain an estimate of temporal attention ability.
Sensorimotor response assessment
A sensorimotor response assessment was conducted to determine whether changes in the rhythmic synchronization task were because of alterations in temporal attention or sensory/motor processing changes. Participants were instructed to respond as quickly as possible by pressing a button as soon as they perceived a stimulus. Participants were to fixate on a white fixation cross (60 point font) presented on a black background. Stimuli were presented for 100 ms randomly every 500-2500 ms. Stimuli were either an auditory 1000 Hz sine tone (auditory stimulus), a white circle (visual stimulus), or auditory-visual stimuli presented simultaneously. All visual stimuli were presented with a 200 pixel diameter at central fixation by temporarily replacing the fixation cross. Auditory stimuli were presented at a comfortable listening level using insert earphones (Cortech Solutions), which was held constant before and after intervention. Each stimulus type (auditory, visual, audiovisual) was presented in blocks of 50 trials. Each block of 50 trials was presented twice (once for each hand), resulting in 100 trials per stimulus type. Stimulus blocks were presented randomly across participants, but the order was held constant for each participant before and after intervention. Analysis focused on response times and response variability as measured by the SD of response times. Of note, one additional participant from the Control group was excluded from this specific analysis because of a technical error in data collection.
Endogenous temporal attention assessment
To assess endogenous temporal attention, a temporally cued visual discrimination task was conducted (Fig. 1D), as previously described (Zanto et al., 2011). A white circle with a 7 cm inner (8 cm outer) diameter was centered on a black background during the assessment. Participants were instructed to fixate on a dot (0.5 cm diameter) located in the center of the circle. The fixation period was between 1300 and 1800 ms (selected randomly). A 100 ms cue appeared and contained one of three letters: S, L, or N. These letters indicated the duration of the foreperiod (S = short = 600 ms; L = long = 1400 ms; N = neutral = 600 or 1400 ms). Thus, only S and L cues predicted when the target would appear and will be referred to as PS (predictive short) and PL (predictive long), respectively. The neutral cues will be referred to as NS and NL, depending on whether the N preceded a short or long foreperiod, respectively. One of the four cue types (PS, PL, NS, NL) was selected randomly on each trial, each with a 25% probability. Following the foreperiod, a target appeared with either an x or a + inside the white circle, each with a 50% probability of occurrence. A forced-choice discrimination required participants to press the left button for x targets and the right button for + targets. Participants were instructed to respond with a button press as quickly and accurately as possible to the target. Thus, by using the temporally predictive cues, participants would be able to rely on their endogenous temporal attention abilities to facilitate performance. Specifically, engaging in the task enables participants to encode the cued time periods and form predictions for stimulus onset. It is this temporal encoding and prediction that enables attention to be allocated to target onset, resulting in speeded performance. Analysis focused on accuracy and response times to the PS and NS stimuli, in line with our previous observation of an age-related deficit in temporal attention ability (Zanto et al., 2011).
EEG
EEG data were recorded from a BioSemi ActiveTwo system with 64 Ag/AgCl active electrodes (Cortech Solutions). Signals were amplified and digitized at 2048 Hz with a 24-bit resolution and no online filter. All electrode offsets were maintained between ±20 mV. EEG signal preprocessing and subsequent analyses were performed using EEGLAB (Delorme and Makeig, 2004) and MATLAB (The MathWorks). Raw EEG data were detrended to remove linear drift, bad electrodes were identified and corrected using spherical interpolation, and each channel was rereferenced against averaged EEG data. Independent component analysis was performed to remove eye blink and eye movement artifacts. Each EEG channel data was then downsampled to 512 Hz and epoched into 2.2 s segments centered at stimulus onset. Epochs containing artifacts >±75 µV were excluded from further analysis.
Spectral data (1-50 Hz) were acquired via complex Morlet wavelets applied to the epoched data. The log-transformed ERSP (Makeig, 1993) was then computed from the wavelet coefficients using a prestimulus baseline (−150 to −50 ms). For analysis of the exogenous timing task, a 300 ms time window was selected (75-375 ms after stimulus onset), which corresponded to peak ERSP activity when averaged over groups, conditions, and pre-/post-training sessions. ITC was computed from the phase angles obtained from the wavelet coefficients. δ ITC was calculated using the following formula:
Where N = total number of trials, w = instantaneous amplitude, and θ = instantaneous angle. w and θ are derived from Morlet wavelet coefficients of time-series data. ITC values were obtained by averaging over each frequency band. Analysis of ITC data relied on the same time window as that used for ERSP analysis.
For source localization of ITC, the LORETA-KEY (Pascual-Marqui, 2002) software package was used. Epoched data were band pass filtered in the δ band (1-4 Hz). Each of the filtered trials was then source localized using standardized low resolution electromagnetic tomography. The cartesian coordinate source localized current density output was then converted to spherical coordinates to obtain the azimuth and elevation of the dipole within each voxel. The circular mean of the vector lengths from the azimuth and elevation angles were then calculated and averaged together to obtain the ITC of the source localized trials filtered in δ frequency band.
Results
Exogenous temporal attention performance
To assess exogenous temporal attention, performance asynchrony (offset in milliseconds from the instructed tap onset) and variability (SD of tap offsets) data (Table 1) were submitted to separate ANOVAs with Group (Rhythmicity, Control) as a between-subjects factor and Time (Pre, Post) and Tempo (Slow, Medium, Fast) as within-subjects factors. Results from the asynchrony data yielded a main effect of Tempo (F(2,70) = 138.82, p < 0.001, ηp2 = 0.80). Specifically, asynchrony was larger during slow compared with medium tempo (t(36) = 10.12, p < 0.001), and the medium tempo exhibited larger asynchrony than the fast tempo (t(36) = 9.61, p < 0.001). No other main effects were observed (Group: F(1,35) = 0.52, p = 0.47, ηp2 = 0.01; Time: F(1,35) = 2.09, p = 0.16, ηp2 = 0.16). Moreover, no interactions were observed between Group × Time (F(1,35) = 0.23, p = 0.64, ηp2 = 0.01), Group × Tempo (F(2,70) = 0.20, p = 0.82, ηp2 = 0.01), Time × Tempo (F(2,70) = 1.06, p = 0.35, ηp2 = 0.03), or Group × Time × Tempo (F(2,70) = 0.04, p = 0.96, ηp2 = 0.00). Thus, training did not affect performance asynchrony.
Performance asynchrony and variability data from the exogenous temporal attention taska
Analysis of the performance variability data exhibited a main effect of Time (F(1,35) = 7.02, p = 0.012, ηp2 = 0.17) such that performance variability decreased post-training. A main effect of Tempo was also observed (F(2,70) = 151.13, p < 0.001, ηp2 = 0.81), indicating that variability was larger during slow compared with medium tempo (t(36) = 9.65, p < 0.001), and the medium tempo exhibited larger variability than the fast tempo (t(36) = 10.91, p < 0.001). No main effect was observed for Group (F(1,35) = 1.84, p = 0.18, ηp2 = 0.05). Importantly, significant interactions were observed for Group × Time (F(1,35) = 8.91, p = 0.005, ηp2 = 0.20) (Fig. 2A) and Group × Time × Tempo (F(2,70) = 4.81, p = 0.022, ηp2 = 0.12) (Fig. 2B). Assessment of the Group × Time interaction showed no differences in performance variability between groups at baseline (t(35) = 0.24, p = 0.81), but the Rhythmicity group exhibited decreased performance variability post-training compared with the Control group (t(35) = 2.32, p = 0.026; Fig. 2A). Assessment of the three-way interaction showed that performance improved (i.e., less variable) following rhythm training during the slow (t(17) = 3.70, p = 0.002) and medium (t(17) = 3.81, p = 0.001), but not fast (t(17) = 0.86, p = 0.40), tempos (Fig. 2B). No such improvement was observed in the Control group pre- to post-training (slow: t(18) = 0.60, p = 0.56; medium: t(18) = 1.23, p = 0.23; fast: t(18) = 0.19, p = 0.85). Direct comparisons between groups on the change (pre-post) in performance variability shows the Rhythm training group yielded an improvement in performance (i.e., less variable) during the slow (t(35) = 2.69, p = 0.011) and medium (t(35) = 2.19, p = 0.036), but not fast (t(35) = 0.68, p = 0.42) tempos (Fig. 2B). No other interactions were observed (Group × Tempo: F(2,70) = 2.49, p = 0.11, ηp2 = 0.07; Time × Tempo: F(2,70) = 2.92, p = 0.081, ηp2 = 0.08). Together, these results show that musical rhythm training facilitated exogenous temporal attention as indexed by lowered performance variability.
Response time variability performance during exogenous temporal attention task. Bar diagrams for interaction between (A) Group × Visit (lower values indicate better performance) and (B) Group × Visit × Tempo (higher values indicate greater improvements).
Of note, the analyses above collapsed over three sensorimotor synchronization tasks to assess exogenous temporal attention in a manner that is less task-dependent. However, it could be argued that the continuation and off-beat synchronization tasks incorporate some aspects of endogenous temporal attention. Therefore, in an exploratory analysis, we repeated the above ANOVAs on performance variability for each of the three task types (averaged over stimulus type) to assess whether the two-way (Group × Time) or three-way (Group × Time × Tempo) interactions would still be observed (Table 2). Results from the on-beat task replicated both the Group × Time (F(1,35) = 6.60, p = 0.015, ηp2 = 0.16) and Group × Time × Tempo (F(2,70) = 3.49, p = 0.043, ηp2 = 0.09) interactions. Results from the continuation task exhibited a trend toward a significant Group × Time interaction (F(1,35) = 3.92, p = 0.056, ηp2 = 0.10), but not a Group × Time × Tempo interaction (F(2,70) = 0.29, p = 0.75, ηp2 = 0.01). Results from the off-beat task exhibited no Group × Time interaction (F(1,35) = 1.56, p = 0.22, ηp2 = 0.04); however, the Group × Time × Tempo interaction trended toward significance (F(2,70) = 2.94, p = 0.081, ηp2 = 0.08). Importantly, for all three tasks, even when interactions were not significant, the largest improvements pre- to post-training occurred in the Rhythmicity group at the slow and medium tempos. Nonetheless, this exploratory analysis indicates that improvements in exogenous temporal attention were driven by the task that had the largest exogenous (least endogenous) component.
Performance variability data from the exogenous temporal attention taska
For completeness, another exploratory analysis was conducted where we repeated the same ANOVAs as our original assessment of performance variability, but here we do it for each of the three stimulus types (averaged over task). We were specifically interested in whether the two-way (Group × Visit) or three-way (Group × Visit × Tempo) interactions would still be observed as in the original analysis that averaged over all stimulus and task types. Results from the visual-only stimuli replicated both the Group × Visit (F(1,35) = 5.96, p = 0.020, ηp2 = 0.15) and Group × Visit × Time (F(2,70) = 4.13, p = 0.035, ηp2 = 0.11) interactions. Results from the audio-only stimuli exhibited no significant interactions for either Group × Time (F(1,35) = 1.83, p = 0.18, ηp2 = 0.05) or Group × Time × Tempo (F(2,70) = 0.87, p = 0.42, ηp2 = 0.02). Results from the audiovisual stimuli exhibited trends toward significance for both the Group × Time (F(1,35) = 1.56, p = 0.22, ηp2 = 0.04) and the Group × Time × Tempo interactions (F(2,70) = 2.94, p = 0.081, ηp2 = 0.08). Importantly, for all three stimulus types, even when interactions were not significant, the largest improvements pre- to post-training occurred in the Rhythmicity group at the slow and medium tempos.
Sensorimotor response performance
Sensorimotor response performance was evaluated to test whether changes in performance from the temporal attention assessment may be attributed to sensorimotor changes. One additional participant from the control group was not included in this analysis because of errors in data collection. Response time and response time variability data (Table 3) were submitted to a mixed ANOVA with Group (Rhythmicity, Control) as a between-subjects factor, and Stimulus (Auditory, Visual, AudioVisual) and Time (Pre, Post) as within-subjects factors. Results from the analysis of response time data exhibited a main effect of Stimulus (F(2,68) = 27.04, p < 0.001, ηp2 = 0.44) such that response times were slower to Auditory stimuli compared with Visual (t(35) = 5.35, p < 0.001) and AudioVisual stimuli (t(35) = 5.67, p < 0.001). No significant difference in response times were observed between Visual and AudioVisual stimuli (t(35) = 1.41, p = 0.17). No main effects were observed for Group (F(1,34) = 0.00, p = 0.99, ηp2 = 0.00) or Time (F(1,34) = 0.49, p = 0.49, ηp2 = 0.01). Additionally, no interactions were observed between Group × Time (F(1,34) = 0.94, p = 0.34, ηp2 = 0.03), Group × Stimulus (F(2,68) = 0.71, p = 0.50, ηp2 = 0.02), or Stimulus × Time (F(2,68) = 1.03, p = 0.36, ηp2 = 0.03). Similarly, no three-way interaction was observed between Group × Time × Stimulus (F(2,68) = 1.41, p = 0.25, ηp2 = 0.04).
Data from the sensorimotor response assessmenta
Results from the response variability data yielded no main effect of Group (F(1,34) = 0.25, p = 0.62, ηp2 = 0.01), Time (F(1,34) = 0.42, p = 0.52, ηp2 = 0.01), or Stimulus (F(2,68) = 0.91, p = 0.41, ηp2 = 0.03). However, a significant interaction was observed between Group × Time (F(1,34) = 4.91, p = 0.034, ηp2 = 0.13). This interaction was driven by improvements in the Control (control) group pre- to post-training (t(17) = 2.28, p = 0.036). No such improvement in performance was observed in the Rhythmicity group (t(17) = 1.02, p = 0.32; see Table 3). No other interactions were significant: Group × Stimulus (F(2,68) = 0.51, p = 0.60, ηp2 = 0.01), Stimulus × Time (F(2,68) = 0.10, p = 0.90, ηp2 = 0.00), or Group × Time × Stimulus (F(2,68) = 0.52, p = 0.60, ηp2 = 0.02). Together, results from the response time and response variability data indicate that rhythm training did not affect the sensorimotor response.
Endogenous temporal attention performance
To test whether musical training also improved endogenous temporal attention, response time and accuracy data from the temporally cued discrimination task were submitted to a mixed ANOVA with Group (Rhythmicity, Control) as a between-subjects factor, and Condition (PS, NS) and Time (Pre, Post) as within-subjects factors (Table 4). Results from the analysis of response time data exhibited no main effects for Group (F(1,35) = 0.43, p = 0.52, ηp2 = 0.01), Time (F(1,35) = 0.23, p = 0.64, ηp2 = 0.01), or Condition (F(1,35) = 1.85, p = 0.18, ηp2 = 0.05). Additionally, no interactions were observed between Group × Time (F(1,35) = 2.09, p = 0.16, ηp2 = 0.06), Group × Condition (F(1,35) = 0.00, p = 0.96, ηp2 = 0.00), or Condition × Time (F(1,35) = 0.31, p = 0.58, ηp2 = 0.01). Critically, no three-way interaction was observed between Group × Time × Condition (F(1,35) = 1.01, p = 0.32, ηp2 = 0.03).
Response time and accuracy data from the endogenous temporal attention taska
Analysis of the accuracy data yielded a main effect of Time (F(1,35) = 7.23, p = 0.011, ηp2 = 0.17), such that accuracy was greater post-training. No main effects were observed for Group (F(1,36) = 2.03, p = 0.16, ηp2 = 0.05) or Condition (F(1,35) = 0.54, p = 0.47, ηp2 = 0.02). Additionally, no interactions were observed between Group × Time (F(1,35) = 0.52, p = 0.48, ηp2 = 0.01), Group × Condition (F(1,35) = 0.89, p = 0.35, ηp2 = 0.02), or Condition × Time (F(1,35) = 0.01, p = 0.92, ηp2 = 0.00). Critically, no three-way interaction was observed between Group × Time × Condition (F(1,35) = 0.11, p = 0.74, ηp2 = 0.00). Together, results from the response time and accuracy data provided no evidence that rhythm training transferred any benefit to endogenous temporal attention as indexed by the ability to use predictive temporal cues.
Although improvements in the ability to use the predictive temporal cues were not observed at the group level, we assessed whether changes in exogenous temporal attention were related to changes in endogenous temporal attention, specifically within the Rhythmicity group. To do this, we first calculated an index of endogenous temporal attention as the difference in response times following the neutral and predictive cues (i.e., neutral minus predictive). Next, we conducted a Pearson's correlation between the change (post-pre) in this index and the change (pre-post) in performance variability. Two correlations were conducted: one for each tempo where significant group effects were observed (i.e., slow and medium tempos). Results indicated no significant relationship between changes in exogenous and endogenous temporal attention abilities (slow tempo: r = 0.09, p = 0.74; medium tempo: r = −0.05, p = 0.85). These results converge with the analyses above to provide no evidence that exogenous temporal attention transferred any benefit to endogenous temporal attention ability.
ERSP: exogenous temporal attention
To understand whether oscillatory power changes may have occurred following rhythm training, ERSP was computed for the Rhythmicity and Control groups, pre- and post-training. The frequency bands of interest were δ, α, and β because of their association with exogenous and endogenous temporal attention. Apart from these frequency bands, theta and γ bands were also assessed as a part of an exploratory analysis. ERSP was computed for each of these frequency bands, for each of the three tempos (slow, medium, and fast), averaged across all three conditions (on-beat, off-beat, and continuous). Visual inspection of the ERSP spectrogram shows δ power desynchronization and α synchronization peaks corresponding with the interstimulus interval for slow and medium tempos, but less so for the fast tempo (Fig. 3). Given the Group × Time interaction observed in the performance variability data, statistical analysis of ERSP data focused on Group × Time interactions (i.e., Rhythmicity Post-Pre ≠ Control Post-Pre) that arose from differences within group (Post ≠ Pre) and between groups (Rhythmicity ≠ Control). Results showed a significant interaction for medium tempo in the alpha band. However, this was because of reduced α ERSP post-training in the control group, and no significant difference was observed between groups pre- or post-training. Importantly, no other significant interactions were observed during the slow or medium tempos in any frequency band. Thus, ERSP data were not altered in a way that would explain training-related changes in performance.
ERSP time-frequency map of electrode Cz for data from (A) the Rhythmicity group and (B) the Control group. The rows of each subplot are data from the different tempos. Columns represent pre-/post-training. Warmer colors represent increased ERSP. Vertical dashed line indicates stimulus onset.
ITC: exogenous temporal attention
Next, we assessed ITC for both Rhythmicity and Control groups, pre- and post-training, averaged across all conditions (on-beat, off-beat, and continuous) for each of the three tempos (slow, medium, and fast). Our main frequency band of interest was δ because of evidences that phases of slow-frequency δ oscillations are linked with temporal attention (Stefanics et al., 2010; Cravo et al., 2013; Wilsch et al., 2015; Arnal and Kleinschmidt, 2017; Breska and Deouell, 2017). However, we analyzed other frequency bands as exploratory measures: theta, α, β, and γ. Visual inspection of the ITC clearly showed increased coherence within the δ and theta frequency bands following stimulus onset within both groups (Fig. 4), although the Rhythmicity group appears to show a greater increase in ITC after training (Fig. 4, bottom row).
ITC time-frequency map of electrode Cz during the slow tempo. Warmer colors represent increased ITC. Vertical dashed line indicates stimulus onset.
To formally assess changes in ITC, we compared topographies between and within groups, in line the ERSP analysis, with a focus on Group × Time interactions. Results showed a significant interaction (Post-Pre Rhythmicity vs Post-Pre Control) in δ ITC for both slow and medium tempos (Fig. 5A,B, bottom right corner). These interactions were driven by changes within the Rhythmicity group pre- to post-training (see top row: Fig. 5A for slow tempo, Fig. 5B for medium tempo). Comparisons between groups showed no group differences before training, but greater ITC in the Rhythmicity group after training (Fig. 5A,B, middle columns). Because ITC is a measure of phase consistency across trials, these findings reveal that rhythm training results in more consistent neural activity patterns within the δ band. Results from the other frequency bands showed no significant interactions in the β or γ bands. However, theta and α bands exhibited an interaction during the slow, but not medium, tempo. Although these interactions were driven by changes in the Rhythmicity group pre- to post-training, no group differences were observed at either time point. Thus, changes in δ band ITC are best aligned with the observed changes in performance.
ITC topographies in the δ band during the (A) slow tempo and (B) medium tempo. First row represents topographies from the Rhythmicity group pre-training (left), post-training (middle), and a pre- versus post-significance map (right; p < 0.05). Second row represents topographies from the Control group, whereas third row represents significance maps when comparing between groups. Bottom right corner, Topography indicates a Group × Time interaction. Top left corner, The four topographies are scaled according to the color bar. Bottom row and right column are significance maps where red represents significance (p < 0.05) and blue represents not significant. All significance maps are cluster-corrected.
Neuro-behavioral correlation
Since only the rhythm training group showed improved performance and increased ITC for the slow and medium tempos, we conducted a correlation between the change in performance and change in ITC to assess whether there was a relationship between these measures. Results from the slow tempo data showed a negative correlation between the change (post-pre) in ITC and the change (post-pre) in performance variability (r = −0.72, p < 0.001), such that, as ITC increases (i.e., becomes less variable), performance also becomes less variable (Fig. 6). No such correlation was observed during the medium tempo (r = 0.16, p = 0.53).
Neuro-behavioral correlation. A, Cluster-corrected topography showing significantly correlated regions (shaded in red; p < 0.05) between post-pre ITC values and post-pre variability of taps. Dark blue regions represent no significance. B, Scatter plot between change in performance variability (SD) and change in ITC, averaged over significant electrodes shown in A.
To understand whether the observed changes in δ ITC during the slow tempo may be because of stimulus-evoked activity, we also analyzed ITC during the prestimulus period of −150 to −50 ms. Results mirrored that of the post-stimulus onset data reported above, such that a Group × Time interaction was observed that was driven by changes in the Rhythmicity group pre- to post-training. Furthermore, this anticipatory δ band ITC data showed a significant negative correlation with the change (post-pre) in performance variability (r = −0.71, p = 0.001).
Source localization
Given the significant neuro-behavioral correlation between ITC and behavioral data during the slow tempo, we sought to localize the source of phase synchrony to better understand which brain regions were responsible for the improvement in task performance. ITC was computed on δ band filtered source localized EEG data from the post-stimulus time period. Following cluster correction using a Monte Carlo simulation (p = 0.0001), ITC data were localized to three main regions: premotor cortex (MNI coordinate X = 7, Y = −21, Z = 53), anterior cingulate cortex (MNI coordinate X = −5, Y = 30, Z = 28), and inferior parietal lobule extending into postcentral gyrus (MNI coordinate X = −63, Y = 23, Z = 29) (Fig. 7).
Slices show axial (top row) and sagittal (bottom row) views of source localized δ band ITC in the Rhythmicity group during the slow tempo.
Discussion
In this study, our aim was to assess whether temporal attention in healthy older adults can be improved through musical rhythm training that specifically focused on exogenous temporal attention, and to understand the neural basis of such improvements, if any. We also assessed whether improvements in exogenous temporal attention would induce improvements in endogenous temporal attention. Behavioral findings indicated improved synchronization (less variance) to an external rhythm specifically in the rhythm training group, while no such change was observed in the control group. Given that no such changes were observed in a basic response time task, which controls for sensory and motoric processes, this result was interpreted as an improvement in exogenous temporal attention. We next analyzed electrophysiological data to understand the neuroplastic changes associated with improved exogenous temporal attention. Results showed an increase in ITC pre- to post-training in the δ band, specifically in the musical rhythm training group. Importantly, the change in δ ITC correlated with changes in performance in the rhythm training group, such that those participants who exhibited the greatest improvement in performance (more consistency) were the ones who exhibited the largest increase in δ ITC (more consistency). Source localization of the δ ITC data indicated neural generators within the anterior cingulate cortex, premotor cortex, inferior parietal lobule, and postcentral gyrus. Finally, assessment of performance from the endogenous temporal attention task exhibited no changes in either group post-training. Thus, there was no transfer of improvements to endogenous temporal attention after being trained on an exogenous temporal attention task. Together, these results indicate that musical rhythm training facilitates exogenous, not endogenous, temporal attention, which is associated with increased neural phase consistency (ITC) within a sensorimotor network.
To characterize training related changes in exogenous temporal attention, we used a synchronization task, which places demands on temporal attention as well as sensory and motor function. Therefore, it can be argued that the improvement in synchronization performance was because of changes in sensory or motoric processing. To test this possibility, we conducted a sensorimotor control task, which showed that performance did not change (Table 3). This indicates that the improvement in synchronization, as indexed by lowered variance (more consistency), was because of changes in temporal attention and not because of alterations in sensory or motor processing.
Interestingly, synchronization performance improved the most at the slowest tested tempo (750 ms IOI), with a smaller (albeit still significant) improvement at the medium tempo (525 ms IOI). Yet, performance at the fastest tempo (350 ms IOI) exhibited no improvements. We speculate that diminished signs of benefit from slow to fast tempos may be attributed to preferred tempos. The slowest tempo tested was closest to the preferred tempo for adults in this age range, whereas the medium and fast tempos are preferred among young adults and children, respectively (Mcauley et al., 2006). Although we did not test for preferred tempos, these results lead to an interesting possibility that tempo preferences may be related to exogenous temporal attention.
Previous research has shown that exogenous timing abilities can improve with training. This has been seen in studies comparing musicians and nonmusicians, where years of musical training have improved timing abilities (Repp, 2010). However, in most cases, these musicians have undergone many years of formal training, with many of them starting their training from early childhood. In this study, we demonstrated that, even with a smaller duration of musical rhythm training (2 months) at late-stages of life, age-related declines in temporal attention can be remediated. Furthermore, these results show that rhythmic timing ability remains plastic throughout the lifespan and dampens the concern that musicians are a self-selected group because of their inherent timing abilities. Rather, the choice to pursue musical training may stem from differences in self-concept (Demorest et al., 2017) or motivation (McAuley et al., 2011).
The improvement in temporal attention was neurally reflected by an increase in δ band ITC from pre- to post-training. Because ITC is a measure of phase consistency across trials, the correlation between ITC and performance indicates that more consistent neural activity patterns are related to more consistent synchronization performance. Notably, ITC was found to increase in anticipation of stimulus onset, as well as post stimulus onset. This indicates that increased ITC cannot be simply be attributed to changes in stimulus-evoked phase resetting, which would otherwise suggest a change in sensory processing. Rather, the increased δ band ITC more likely reflects temporal attention processes that underlie the timing aspects of motor planning. Indeed, prior research has shown that low-frequency δ band (1-4 Hz) oscillatory brain activity is phase synchronized with the temporal pattern of an external rhythm, which in turn facilitates the prediction of target stimuli (Lakatos et al., 2008; Schroeder and Lakatos, 2009). Importantly, δ band activity, particularly its phase synchronization, has been increasingly associated with exogenous temporal attention ability (Stefanics et al., 2010; Gomez-Ramirez et al., 2011; Cravo et al., 2013; Arnal et al., 2015; Wilsch et al., 2015; Arnal and Kleinschmidt, 2017; Breska and Deouell, 2017) and this δ phase synchronization may be influenced by cerebellar activity (Breska and Ivry, 2020). In previous studies, neural entrainment to external rhythmic activity has been shown to be a mechanism of enhanced processing efficiency (Schroeder and Lakatos, 2009; Zion Golumbic et al., 2013; Kayser et al., 2015; Lakatos et al., 2016). Here, we suggest such processing efficiency is related to temporal attention, which in turn, may affect sensory or motor-based processes.
Source localization of the brain regions associated with the increased δ ITC show significant changes in premotor cortex, anterior cingulate cortex, inferior parietal lobule, and postcentral gyrus from pre- to post-training. Although previous studies have localized, these results are in general agreement with previous fMRI research assessing neural regions associated with temporal attention (Coull and Nobre, 2008). The premotor cortex is known to be involved in motor planning (Hoshi and Tanji, 2006; Nakayama et al., 2008; Li et al., 2016). It is critically involved in rhythm production (Halsband et al., 1993) and is associated with the processing and perception of rhythmic stimuli, such as speech (Meister et al., 2007; Kotz and Schwartze, 2010). Conversely, the anterior cingulate cortex is associated with error detection and correction (Carter et al., 1998; Kiehl et al., 2000; Garavan et al., 2002; Swick and Turken, 2002; Veen and van Carter, 2002), such as needed during rhythmic synchronization tasks (Jantzen et al., 2018). On the other hand, the inferior parietal lobule has been associated with tasks involving temporal predictions, including synchronization tasks (Janata and Grafton, 2003; Pecenka et al., 2013; Andreou et al., 2015; De Pretto and James, 2015). Interestingly, the inferior parietal lobule has also been associated with motor intentions, motor predictions, and the feeling of “wanting to move” (Rizzolatti et al., 2006; Desmurget and Sirigu, 2012). Similar to the inferior parietal lobule, the postcentral gyrus has been linked to temporal predictions (Andreou et al., 2015; Jantzen et al., 2018), but it is also associated with somatosensory processing (Nelson and Chen, 2008) and is commonly observed in synchronization tasks (Chauvigné et al., 2014). Together, our findings suggest that the source localized areas for δ ITC contributed to temporal attention processes through temporal predictions in conjunction with error detection and correcting for timing deviations during the synchronization task. Importantly, the regions identified as being involved in temporal attention are commonly associated with the sensorimotor network (Stephan et al., 2002; Jantzen et al., 2007; Bueti et al., 2008; Anwar et al., 2016). Given that sensory and motoric processes were not affected by rhythm training, these results indicate the sensorimotor network is involved in critical timing functions (i.e., temporal attention) needed for sensorimotor synchronization. This is in line with the Action Simulation for Auditory Prediction hypothesis, which suggests that motor planning regions provide essential timing information to help predict upcoming beats (Patel and Iversen, 2014).
In this study, we also assessed whether improvements in exogenous temporal attention (orienting attention to external stimuli for timing information) transferred to endogenous temporal attention (orienting attention internally for timing information). However, we did not find any such supporting evidence. This lack of transfer supports models that suggest timing is a distributed process that may rely on modality- or context-specific neural regions (for review, see Mauk and Buonomano, 2004; Ivry and Schlerf, 2008; Paton and Buonomano, 2018). Temporal attention abilities improve in accordance with the networks that are most often engaged (Janzen et al., 2014). As such, if our musical rhythm training paradigm had included a component that taxed endogenous temporal attention ability, it likely would have improved performance on our endogenous temporal attention task. In support of this notion, research has shown that musical training in older adults that focused on endogenous temporal attention (solo piano performance) has shown improved endogenous timing ability as measured via continuation tapping, but not exogenous timing as indexed by synchronization tapping (Fujioka and Ross, 2017). Thus, improvement in one form of timing ability will not necessarily transfer to the other.
These results are in agreement with mounting evidence that cognitive interventions generally do not improve untrained cognitive domains (Simons et al., 2016; Swaminathan and Schellenberg, 2021). Rather, transfers of benefit from cognitive interventions are often observed in tasks that may be different from the trained task but use the same cognitive function. It has been long debated whether learned behaviors can be applied to novel contexts (Singley and Anderson, 1989). It has also been suggested that the content of what can be transferred can be classified along three dimensions: (1) procedure, representation, and principles are transferred during learned skills; (2) speed, accuracy, and approach are transferred during performance change; and (3) execution, recognition, and recall are transferred during memory demands (Barnett and Ceci, 2002). Transfer is most likely to occur when the training has been offered on generalized principles, when outcome measures are similar to that of the trained tasks and when it requires minimal memory retrieval from the trained task (Barnett and Ceci, 2002; Zelinski, 2009). Although we did not observe transfer between exogenous and endogenous temporal attention abilities, this is not to say that learned timing abilities do not transfer within exogenous or within endogenous temporal attention tasks. Indeed, training on tasks that rely on endogenous temporal attention has shown transfer within modality such that the stimuli used during training to encode the timing interval can change within modality and still yield comparable benefits in timing. This has been observed when changing auditory features (Wright et al., 1997), somatosensory (skin) locations (Nagarajan et al., 1998), or visual fields (Westheimer, 1999). Transfer within endogenous temporal attention tasks has also been observed across modalities, including from somatosensory to auditory (Nagarajan et al., 1998), from visual to auditory (Bueti et al., 2012), and from auditory to visual (Bartolo and Merchant, 2009). Moreover, even perceptual training can improve motor timing (Planetta and Servos, 2008; Meegan et al., 2000). Thus, transfer of learned timing abilities may certainly occur within specific constraints.
Yet, such transfer within endogenous temporal attention tasks is not always observed (Grondin and Ulrich, 2011). This is particularly true when it comes to the specific duration of the trained interval, as transfer to other interval durations is not common (Bueti and Buonomano, 2014). Interestingly, this may help explain why the Rhythmicity group exhibited improvements specifically at the slow and medium tempos in the exogenous temporal attention task because the training took place largely at tempos similar to the slow and medium tempos of the task. If true, the prediction would be that participants who trained at a fast tempo would exhibit improvements during the fast tempo of the exogenous temporal attention task. Unfortunately, we cannot test this with the current data because participants did not train long enough to reach performance levels that would enable training at a tempo that is comparable to the fast tempo of the exogenous temporal attention task. Through extensive pilot testing, we observed that only musicians (10+ years of training) were able to reach Rhythmicity levels that required performance at comparably fast tempos (i.e., 350 ms intertap interval).
A potential limitation of this study is that it was conducted in healthy older, not younger, adults. It could be argued that a lack of transfer to the endogenous temporal attention task may reflect age-related changes in brain function that are not specifically linked to a potential generalized neural mechanism of timing. For example, in the endogenous temporal attention task, neither group of participants benefitted from predictive cues before or after training. Therefore, it could be argued that a lack of transfer to the endogenous temporal attention task arose from an age-related change in strategy where older adults did not attempt to use temporal attention to boost performance, thereby eliminating any possible benefits of transfer. However, we have previously demonstrated that this lack of benefit in performance is not indicative of a strategy shift that abandons temporal attention as older adults modulate anticipatory neural activity that indexes temporal attention, but to a lesser extent compared with young adults (Zanto et al., 2011). Moreover, in our assessment of exogenous temporal attention, we showed that the training-related improvements in performance were driven by the on-beat task, which has the most reliance on exogenous temporal attention. The off-beat (syncopation) and continuation tasks did not exhibit as large of a training effect, likely because these tasks include some aspects of endogenous temporal attention. This provides additional (albeit less direct) assessments of endogenous temporal attention, further supporting the idea that exogenous and endogenous temporal attention relies on distinct neural mechanisms. Thus, it seems unlikely that age played a role in the lack of transfer because a common neural mechanism should be equally affected by age regardless of task.
In conclusion, in this study, our aim was to understand if and how rhythm training can improve temporal attention in a healthy aging population. A group of nonmusician older adults performed a rhythm training task, and another group of nonmusician older adults completed an active control task. The group that received rhythm training showed significant improvement in their exogenous temporal attention abilities, as indexed by decreased variability in task performance. Moreover, rhythm training increased intertrial phase coherence (ITC), which correlated with improved performance. Further analysis localized these changes in ITC to a sensorimotor network, including premotor cortex, anterior cingulate cortex, postcentral gyrus, and inferior parietal lobule. These regions have been previously found to be associated with motor planning, error detection, and temporal predictions. Together, these results support the hypothesis that musical rhythm training can remediate age-related declines in exogenous temporal attention, which occurs via alterations in δ phase coherence within a sensorimotor network. However, no evidence was found indicating a transfer of benefit from exogenous to endogenous temporal attention abilities, providing support for separate neural mechanisms underlying these subtypes of temporal attention.
Footnotes
This work was supported by the Hart Foundation for Deep Rhythm, the Lounsbery Foundation, the GRAMMY Foundation, Aroha Philanthropies, and the Neuroscape Network. We thank Dennis Lambert for assistance on data collection; and Studio Bee, Rob Garza, Mickey Hart, Jo Gazzaley, and Roger Anguera for assistance in creating the Rhythmicity training paradigm.
A.G. is co-founder, shareholder, BOD member, and advisor for Akili Interactive, which is a company that makes therapeutic video games, but Akili was not involved in the current study. The remaining authors declare no competing financial interests.
- Correspondence should be addressed to Theodore P. Zanto at theodore.zanto{at}ucsf.edu