Abstract
The more we anticipate a response to a predictable stimulus, the faster we react. This empirical observation has been confirmed and quantified by many investigators suggesting that the processing of behaviorally relevant stimuli is facilitated by probability-based confidence of anticipation. However, the exact neural mechanisms underlying this phenomenon are largely unknown. Here we show that performance changes related to different levels of expectancy originate in dynamic modulation of delta oscillation phase. Our results obtained in rhythmic auditory target detection tasks indicated significant entrainment of the EEG delta rhythm to the onset of the target tones with increasing phase synchronization at higher levels of predictability. Reaction times correlated with the phase of the delta band oscillation at target onset. The fastest reactions occurred during the delta phase that most commonly coincided with the target event in the high expectancy conditions. These results suggest that low-frequency oscillations play a functional role in human anticipatory mechanisms, presumably by modulating synchronized rhythmic fluctuations in the excitability of large neuronal populations and by facilitating efficient task-related neuronal communication among brain areas responsible for sensory processing and response execution.
Introduction
Oscillatory brain activity represents rhythmic changes in baseline excitability of neuronal populations at a variety of temporal and spatial scales (Başar, 1980; Buzsáki and Draguhn, 2004; Buzsáki, 2006) affecting perception and action. Very slow fluctuations (∼0.1 Hz) in EEG activity, cyclically modulating cortical gross excitability (Vanhatalo et al., 2004) have been shown to predict stimulus detection performance and amplitudes of 1–40 Hz oscillations (Monto et al., 2008). Amplitudes of prestimulus medial temporal theta oscillations in a memory task were higher for later-recalled than for later-forgotten words (Guderian et al., 2009). Momentary alpha amplitude (Romei et al., 2008; van Dijk et al., 2008) and phase (Busch et al., 2009; Mathewson et al., 2009) in visual areas were found to predict visual perceptual performance. In the human motor system, the phase of beta oscillations was observed to modulate input gain (van Elswijk et al., 2010). The large variability in evoked cortical responses to the same stimulus can be explained by the momentary state of ongoing neuronal activity (Arieli et al., 1996; Kruglikov and Schiff, 2003; Lakatos et al., 2005; Haslinger et al., 2006; Womelsdorf and Fries, 2006; Haider et al., 2007; Hasenstaub et al., 2007). Phase entrainment of rhythmic electric brain activity to periodic stimulation occurs spontaneously (Adrian and Matthews, 1934; Walter et al., 1946; Galambos et al., 1981; Will and Berg, 2007). Additionally, the entrainment of cortical delta-band (0.5–3 Hz) oscillations has been recently described and suggested as a key mechanism of selective attention in macaque monkeys (Lakatos et al., 2008). In their study, Lakatos et al. showed that the phase locking of low-frequency cortical oscillations to attended rhythmic auditory or visual stimulus streams can result in elevated response gain and faster reaction times (RT).
Attention and expectation are two complementary mechanisms of processing large amounts of stimulus information using limited capacities (Summerfield and Egner, 2009). They both lead to more efficient detection and recognition of behaviorally relevant stimuli. Expectation influences the processing of stimuli on the basis of prior likelihood, whereas attention modifies stimulus processing on the basis of motivational relevance. “Top-down” attentional influences are directed toward task-relevant features and often interact with temporal expectations (Nobre et al., 2007). One possibility is that temporal predictability allows the brain to optimize the timing of the processing of the expected stimulus and thus make them faster and more efficient (Bendixen et al., 2009). It has recently been proposed (Schroeder and Lakatos, 2009) that in various ecologically valid stimulus configurations, such as biological motion and vocal communication (reviewed by Schroeder et al., 2008), when the attended stimuli have a rhythmic or predictable temporal structure, the brain operates in a “rhythmic mode,” in which low-frequency rhythms provide a general oscillatory framework for sensory processing. However, the brain mechanisms underlying predictive processing are still poorly understood.
Here we tested the hypothesis that oscillatory neural activity may mediate the effects of expectation on the processing of task-relevant stimuli. We applied an auditory target detection paradigm, in which four cue tones of different frequencies predicted the probability of the next stimulus to be the target event while EEG was recorded. To investigate the effect of expectancy on delta activity, we detected the phase of ongoing delta oscillation at target stimulus onset. Delta rhythm was found to significantly entrain to target onset preceding stimulus processing, with increasing “accuracy” at higher expectancy levels. Our results provide the first evidence that the entrainment of cortical delta oscillations to a predicted stimulus is a key determinant of response gain in healthy humans. Furthermore, we show that the strength of phase-entrainment varies systematically with the probability of the target event, thus, it is not only a mechanistic consequence of the rhythmic structure within the stimulus stream, but it may depend on the behavioral significance of the expected events.
Materials and Methods
Participants.
Two auditory target detection tasks were performed by two groups of 13 and 11 healthy human volunteers (for age and gender composition, see supplemental Table 1, available at www.jneurosci.org as supplemental material). None of the subjects had any history of neurological or hearing disorders. The protocol of the study was approved by the Ethical Committee of the University of Pécs. The recordings were performed at the Electrophysiological Laboratory of the Department of Experimental Zoology and Neurobiology, University of Pécs, Hungary. All subjects gave their written informed consent after the nature of the experiment had been fully explained.
Stimulation.
The auditory stimuli were presented via headphones at 70 dB sound pressure level while EEG was recorded. In the first experiment, tones were delivered with random stimulus onset asynchronies (SOAs) varying between 1200 and 1500 ms at a mean repetition rate of 1350 ms. Four cue tones of different frequencies (1046, 1174, 1318, and 1480 Hz; 150 ms duration; 10 ms rise/fall times) predicted the probability (10%, 37%, 64%, 91%) of the next stimulus to be the target tone (1975 Hz frequency; 50 ms duration; 10 ms rise/fall times) (Fig. 1A). The cue tones' frequencies were chosen to be separated by one semitone, whereas the target tone was two semitones higher than the highest cue tone, which enabled the subject to carry out the task easily. In our second experiment, two cue tones of distinct frequencies (1046 and 1318 Hz; 150 ms duration; 10 ms rise/fall times) were used to predict the probability (20% and 80%) of the target stimulus (1975 Hz; 50 ms duration; 10 ms rise/fall times) to appear at either of two fixed time points (early target at 1350 ms and late target at 2700 ms) (see Fig. 1C; supplemental Fig. 1, available at www.jneurosci.org as supplemental material). In both experiments, the participants were informed about the meaning of the cues and they were instructed to quickly depress a button for target tones. A 5-min training block was applied before the experiments to allow subjects to learn the task. A total of 400 responses to targets (100 for each cue condition) were collected from each subject in the first and 1000 in the second experiment (500 for each cue condition).
Recording.
EEG was recorded from F3, Fz, F4, C3, Cz, C4, P3, Pz, P4, T3, T4, O1 and O2 sites (according to the international 10–20 system) and two bipolar EOG electrodes placed above and below the left and right outer canthi, respectively. EEG was referred to the nose, an electrode placed on the forehead served as the ground. Impedances were kept below 5 kΩ. Recording was continuous with an analog bandpass from 0.16 Hz to 150 Hz at a sampling rate of 1 kHz in the first and 0.5 kHz in the second experiment. We report the Fz, Cz, and Pz electrodes throughout the paper. Data from Cz was selected for creating the plots as it is the most representative electrode for our findings. Cz and frontocentral electrodes in general are the most favorable scalp locations for auditory experiments as sources in the auditory cortex project onto this area. We used the left central (C3) and right central (C4) electrodes for testing potential lateralization of the delta phase effect.
Behavioral data analysis.
Participants performed both tasks easily. ANOVA was calculated from behavioral data. In experiment 1, the one-way ANOVA (condition [10% vs 34% vs 64% vs 91%]) revealed a significant effect (F(3,36) = 32.819, p < 10−9) of expectancy on reaction time, which was explained by a gradient of significant differences between 10% < 34% < 64% (Tukey's test). Figure 1B shows the observed significant differences in reaction time between conditions. Subjects responded to 99.76% of the targets in experiment 1, the ANOVA of the number of missed trials (condition [10% vs 34% vs 64% vs 91%]) revealed no significant differences between conditions (p = 0.36). ANOVA of reaction times in experiment 2 (one-way ANOVA (condition [cue 1—early target vs cue 1—late target vs cue 2—early target vs cue 2—late target]) revealed a significant effect (F(3,30) = 5.963, p < 0.01) showing that responses were faster when expectancy was higher. A post hoc Tukey's test revealed that the effect was caused by significantly higher reaction times for the less expected targets in the first position following cue 1 (cue 1—early target condition) compared with the more expected target in the second position following cue 1 (cue 1—late target condition, p < 0.05) and the target in the first position following cue 2 (p < 0.01). Comparison with the cue 2—late target showed a marginal p value of 0.08 (Fig. 1D). It should be noted, however, that due to perfect contingency of the target tone to both cues, in those trials when an early target was not delivered, participants dynamically updated their subjective expectancy for the late targets (100%), which explains the similarity of response speed for late targets to the early targets predicted with high confidence (80%). Participants responded to 98.78% of the targets in experiment 2, the ANOVA on the proportion of missed trials yielded no significant effects.
EEG data analysis.
All data were analyzed off-line on a PC using built-in and self-developed functions as well as the freeware EEGLAB toolbox (Delorme and Makeig, 2004) in the Matlab (MathWorks) development environment. Epochs extending from −1500 ms before to 1500 ms after the time-locking event were created in the first experiment and −500 ms before to 4050 ms after the time-locking event in the second. Trials with a potential change below 0.1 μV or voltage exceeding ±75 μV were marked for rejection from further analysis. Additionally, data were visually inspected and trials containing artifacts, which survived the automatic artifact rejection procedure, were also marked for rejection.
Fz, Cz, and Pz electrodes were selected for analysis. We established the presence of prestimulus delta oscillations by visualizing filtered and unfiltered ERP (supplemental Figs. 2–10, available at www.jneurosci.org as supplemental material; Fig. 2M,N) and wide-band filtered single-trial data (supplemental Figs. 12, 13, available at www.jneurosci.org as supplemental material). Oscillatory activity in the prestimulus period was also supported by the calculation of frequency spectra (supplemental Fig. 11, available at www.jneurosci.org as supplemental material). Zero-phase shift filtering of the continuous raw EEG waveforms in the delta band (0.5–3 Hz) was performed using a finite impulse response filter (Matlab built-in fir1.m and filtfilt.m) and several other filters were tested to exclude the possibility of filter artifacts. We performed additional analysis and simulations to rule out the possibility that filtering artifacts significantly affected our results (see supplemental Notes, supplemental Figs. 14–16, and supplemental Tables 4–6, available at www.jneurosci.org as supplemental material). Instantaneous phase values of the filtered data were calculated using the Hilbert-transform (Gabor, 1946; Hurtado et al., 2004). Filtered and Hilbert-transformed data were epoched as described above and epochs marked as artifacts were rejected.
For statistical analysis, Rayleigh's test for uniformity of phase data (Fisher, 1993) was used to test whether phase value distributions at various time instances show any phase preference. Single-trial delta-peak amplitude and latency data were defined as the maximum amplitude and corresponding latency in the 200–480 ms poststimulus time window centered on the average delta-peak measured at Cz. Linear-circular correlation (Fisher, 1993) was calculated between delta phase and reaction time, delta-peak latency and amplitude. Linear partial correlation between X and Y variables with a single controlling variable Z (RX,Y|Z) can be decomposed to linear correlation terms (RX,Y, RX,Z, RY,Z) as follows: Linear-circular partial correlation was performed by replacing linear correlation terms to linear-circular correlation (Fisher, 1993). Significance of phase concentration (κ) differences between conditions was assessed by a permutation test with Monte Carlo randomization (10,000 permutations).
Results
Probability-based confidence of anticipation modulates delta phase entrainment to target stimuli
In two experiments, we measured the effects of the predictive value of cue stimuli on behavioral responses and the entrainment of delta-band oscillations to the expected occurrence of targets. In experiment 1 (Fig. 1A), four cue tones of different pitch predicted the occurrence of the target tone within the next 1200–1500 ms (average: 1350 ms) with different probabilities (p = 0.1, p = 0.37, p = 0.64, p = 0.91). The cue was either followed by the target tone or another cue within the predicted period. Healthy human participants (n = 13) were required to depress a response key for targets as fast as possible. In line with previous observations (Hick, 1952; Hyman, 1953; Gehring et al., 1992; Ortiz et al., 1993; Nobre et al., 2007; van Elswijk et al., 2007), reaction times (RT) decreased with increasing target probability, indicating that participants relied on the predictive power of cue information (Fig. 1B).
Delta-band oscillations extracted from the EEG signal (see Materials and Methods) recorded at midline electrode locations (Fz, Cz, Pz; figures show data from Cz) preceded the target onset and also followed the ERP elicited by the target stimulus (Figs. 2M, 3A–D; supplemental Figs. 2–7, 13A–D, available at www.jneurosci.org as supplemental material). Delta oscillation was significantly entrained to the target in all participants as was shown by the significant phase preference observed in the distribution of delta-oscillation phase values calculated at the target onset (Rayleigh's tests with p < 0.05, supplemental Table 2, available at www.jneurosci.org as supplemental material). Strikingly, phase-locking significantly increased from 0.37 to 0.91 target probabilities (permutation tests with at least p < 0.05; Fig. 2A–H). Thus, the more predictable the target stimulus was, the more precisely the phase of delta oscillations was locked to the expected time of stimulus onset.
Phase entrainment of delta oscillation to the time of expected target events
Given that in experiment 1, different cues appeared with different sequential probabilities, in the second experiment, the probabilities of the cues within the sequence were set equal and we manipulated the cue-to-target temporal relationship (Fig. 1C; supplemental Fig. 1, available at www.jneurosci.org as supplemental material). Cue 1 predicted that the target would either follow after 1350 ms (early target; p = 0.2) or after 2700 ms (late target; p = 0.8). Cue 2 predicted an early target with p = 0.8 or a later target with p = 0.2. Participants (n = 11) were instructed to respond to targets as fast as possible. As was expected on the basis of the results of experiment 1, significantly shorter RTs were obtained for higher- than for lower-probability early targets (Fig. 1D).
Experiment 2 allowed us to investigate phase entrainment without confounding stimulus-evoked and motor-related activity by analyzing instantaneous delta-phase values at the time of the expected delivery of early targets on late-target trials (i.e., when no early target was delivered). Delta-band oscillations extracted from the EEG signal recorded at midline electrode locations (Fz, Cz, Pz; figures show data from Cz) again preceded the expected target onset (Figs. 2N, 3E,F; supplemental Figs. 8–10, available at www.jneurosci.org as supplemental material). Significant phase entrainment was observed in both conditions (Rayleigh's test, p < 0.05, supplemental Table 2, available at www.jneurosci.org as supplemental material). Similarly to experiment 1, more concentrated phase distribution was found with p = 0.8 than with p = 0.2 early-target probability (permutation test, p < 0.05; Fig. 2I–L). Thus the accuracy of phase locking was stronger at higher target probability.
Task performance correlates with prestimulus delta oscillatory phase
We showed that the strength of phase entrainment of delta-band oscillations depends on the level at which a stimulus is predicted. Can this difference in the entrainment of cortical oscillatory activity be directly linked to task performance or is it the consequence of neuronal mechanisms, which are not involved in forming the overt response to the target? To test whether or not the phase entrainment of delta-band oscillations plays a role in target detection, using single trials in experiment 1, we examined the correlation between the phase of the delta-band oscillation at the onset of the target tone and 1) reaction time as well as 2) latency and 3) peak amplitude of the prominent delta-peak constituent of the P300 ERP response (referred to as the delta-peak; see Fig. 2M). All three measures significantly correlated with the phase of the prestimulus delta oscillation at each target probability (Fig. 4, see circular-linear correlation coefficients and significance values). Moreover, delta phase-reaction time correlations exhibited only subtle changes and remained highly significant when circular-linear partial correlations were calculated with P300 latency or amplitude as controlling variables (supplemental Tables 9, 10, available at www.jneurosci.org as supplemental material). These results are consistent with a scenario in which prestimulus delta phase exerts an independent influence on both reaction time and P300 response. Figure 4 also shows that fast reaction times coincided with short-latency high-amplitude delta-peaks characterizing best task performance. This combination of behavioral and EEG measures was obtained on trials in which the target was delivered during the rising slope (close to the negative peak) of the delta-band filtered EEG signal (Fig. 2M). The phase values related to optimal responses (Fig. 4) were close to the mean delta phase values at the expected onset time of the targets in high expectancy conditions (Fig. 2G,H).
Finally, we tested the laterality of the delta phase effect, as significant lateralization would suggest that motor readiness is a central factor of phase entrainment. We performed the comparison of phase locking of prestimulus delta waves recorded from the left central (C3) and right central (C4) leads in experiment 1. We could not show any significant difference either in the depth of phase modulation (supplemental Fig. 12, available at www.jneurosci.org as supplemental material and supplemental Table 7, available at www.jneurosci.org as supplemental material) or in the strength of correlation between prestimulus delta phase and (1) reaction time, (2) P300 delta peak latency, (3) P300 delta-peak amplitude (supplemental Table 8, available at www.jneurosci.org as supplemental material). This lack of lateralization indicates that motor readiness may not be the central driving force of the entrainment; however, the entrainment of motor cortical areas to predictable visual stimuli (Saleh et al., 2010) probably does contribute to the entrained delta oscillations analyzed.
Discussion
The reliability of cue-based conscious expectation improves with increasing predictive value of the cue. Here we report that expectation strengthens delta-band phase-entrainment to and speeds up the detection of relevant stimulus events. Our results further reveal that target events expected with high confidence are likely to 1) coincide with the negative-going phase of the frontocentral delta oscillations, 2) elicit short-latency and high-amplitude delta-peak (P300) ERP responses, and 3) are quickly detected consciously. Therefore, we suggest that the entrainment of cortical delta oscillations to expected stimulus onsets plays an important role in mediating the effects of anticipation on target detection in humans. Our study extends the recent finding that low-frequency oscillations entrain to rhythmic stimuli in macaques (Lakatos et al., 2008) to humans. Moreover, our results go beyond the previous understanding about passive phase-entrainment, providing evidence that phase concentration—and its beneficial effects on event processing—depends on the likelihood of occurrence of the behaviorally significant event, suggesting that phase entrainment is not merely a mechanistic consequence of periodic stimulation.
Our results were obtained by using a rhythmic auditory target detection task. One might argue that this paradigm imposes a temporal pattern on neural activity, however, so do most naturalistic stimuli, including biological motion and vocal communication. Under these predictable circumstances, neuronal oscillations can entrain to the structure of the stimulus stream (Schroeder et al., 2008). It has also been proposed recently that, at least in a set of tasks, where the temporal structure of attended stimuli is rhythmic or predictable, the brain operates in a rhythmic mode, in which low-frequency rhythms provide a general oscillatory framework for stimulus processing (Schroeder and Lakatos, 2009) (see also supplemental Notes, available at www.jneurosci.org as supplemental material). We established the presence of low-frequency oscillations during an auditory target detection task by visualizing delta-band filtered and wide-band single-trial EEG recordings (Fig. 3; supplemental Fig. 13, available at www.jneurosci.org as supplemental material). Because in experiment 1, cue stimuli were always followed by another (cue or target) stimulus in 1200–1500 ms, only a few (usually two, sometimes three) oscillatory cycles could be observed between two stimuli. Thus, one might argue that a higher minimal number of oscillatory cycles is needed to assess the presence of oscillatory activity. Nevertheless, we followed the definitions and terms of previous works related to phase entrainment that implemented similar temporal arrangement of periodic stimuli (Lakatos et al., 2007, 2008, 2009). Moreover, slow oscillations during normal human sleep often comprise only short bouts of low-frequency oscillation, usually consisting of only a few cycles (Lee-Chiong, 2006; Parrino et al., 2006). In many reports, the detection of a single cycle or even a positive or negative half wave justifies the use of the term “oscillation” (Massimini et al., 2004; Mölle et al., 2006). Additionally, that we consider rhythmic low-frequency amplitude fluctuations as oscillations despite the limited number of cycles in experiment 1 is further supported by longer stimulus-free segments in experiment 2, in which interstimulus interval was 2700 ms in late target trials. In these cases, the continuation of delta-band oscillations after expected, but not delivered target stimuli is clearly visible (Fig. 3E,F).
Although digital filtering can cause the distortion of the original signals and such effects cannot be completely ruled out in experiment 1, the design of experiment 2, in which no stimulus-evoked activity could contaminate the prestimulus signals, allowed us to control for such undesired phenomena. Moreover, we performed additional data analysis and simulations to exclude the possibility that filter artifacts substantially affected our results of both experiments, which are detailed in the supplemental material (supplemental Notes, supplemental Figs. 14–16, and supplemental Tables 4–6, available at www.jneurosci.org as supplemental material).
Delta oscillations (0.5–3 Hz) are present in the background EEG activity not just in sleep, but in awake state and are thought to be generated by neocortical and thalamocortical networks. These oscillations play an important role in large-scale cortical integration (Bruns and Eckhorn, 2004; Jerbi et al., 2007), especially in the frontocentral parietal anticipatory network (Babiloni et al., 2006; Padilla et al., 2006). Event-related responses are assumed to arise at least partly by superposition of event-related oscillations in several frequency ranges (Karakaş et al., 2000; Schürmann et al., 2001; Gruber et al., 2005; Klimesch et al., 2007). The human P300 ERP component (for review, see Polich, 2007) was observed in a variety of tasks that require stimulus discrimination. In addition to being linked to context update processes (Donchin and Coles, 1988), it was suggested to originate from the neural linkage between stimulus perception and the initiation of the response (Verleger, 1997; Verleger et al., 2005). Although P300 is a compound response consisting of delta, theta and alpha frequency constituents, it is dominated by the delta-band component (Schürmann et al., 2001; Bernat et al., 2007; Ergen et al., 2008; Ishii et al., 2009). Parts of our findings replicate well documented phenomena, including larger P300 response to lower sequence probability events (Mars et al., 2008) (supplemental Figs. 5–7, available at www.jneurosci.org as supplemental material, show that cue events with lower probability of occurrence evoke larger P300 response). However, less frequently occurring cue tones were more probably followed by the target tone, and the more predictable the target was, the larger P300 response it has evoked (Fig. 2M shows that targets following cue events with lower sequential probability evoke larger delta-band response). The latter effect of different levels of expectancy was also apparent in experiment 2, where the probabilities of the cues within the sequence were set equal.
Our present results suggest that the entrainment of cortical delta oscillations to periodic and expected stimulus onsets plays an important role in mediating the effects of anticipation on target detection in humans when the target onset is predictable, and when events in the attended task occur at time intervals that correspond to the wavelength of delta oscillations (∼1 s). Other neural correlates of predictive behavior have been suggested by Riehle et al. (1997), reporting that the synchronization of single neurons in the monkey motor cortex is related to predictable stimuli. Although it is not straightforward to make speculative comparisons between human scalp EEG recordings and monkey single-unit data, the synchronization of neuronal ensembles proposed by Riehle et al. could reflect large-scale network synchrony leading to an entrainment of oscillatory activity (see also below). This is in line with recent results indicating large-scale delta phase-coupling between primary motor cortex and multiple brain regions including structures of the frontoparietal circuit and cerebello-thatamo-cortical pathway during the performance of a continuous visuomotor task (Jerbi et al., 2007).
Rhythmic changes in the local field potential reflect the responsiveness of cortical neurons (Lakatos et al., 2005, 2008; Haider et al., 2007). As an example, visual perceptual performance fluctuates along with the phase of background EEG activity (Busch et al., 2009; Mathewson et al., 2009). Coherent oscillations can indicate synchronization of excitable phases within a large-scale network comprising distant neuronal ensembles. Phase synchronization was suggested as a key mechanism of neuronal communication by dynamically linking neurons into functional networks, which process task-relevant information and select behavior during sensory-motor integration (Womelsdorf and Fries, 2006; Womelsdorf et al., 2007). Consequently, the entrainment of the phase of cortical delta oscillations to the predicted time of a stimulus event may facilitate efficacious neuronal communication between the brain areas of sensory processing and response execution. Thus the prestimulus modulation of the delta rhythm may serve as a neural mechanism underlying the faster processing of expected events.
Footnotes
This research was supported by the Hungarian Scientific Research Fund (Grant OTKA K-81357 to I.U.) and the National Institutes of Health (Grant R21 DC010415 from the National Institute on Deafness and Other Communication Disorders to P.L.). We thank George Karmos and Viktor Varga for critical discussions and invaluable comments on the manuscript.
- Correspondence should be addressed to Gábor Stefanics, Institute for Psychology, Hungarian Academy of Sciences, 83-85 Szondi Street, H-1068, Budapest, Hungary. gstefan{at}cogpsyphy.hu