Abstract
Higher cognitive functions, such as human perceptual decision making, require information processing and transmission across wide-spread cortical networks. Temporally synchronized neural firing patterns are advantageous for efficiently representing and transmitting information within and between assemblies. Computational, empirical, and conceptual considerations all lead to the expectation that the informational redundancy of neural firing rates is positively related to their synchronization. Recent theorizing and initial evidence also suggest that the coding of stimulus characteristics and their integration with behavioral goal states require neural interactions across a hierarchy of timescales. However, most studies thus have focused on neural activity in a single frequency range or on a restricted set of brain regions. Here we provide evidence for cooperative spatiotemporal dynamics of slow and fast EEG signals during perceptual decision making at the single-trial level. Participants performed three masked two-choice decision tasks, one each with numerical, verbal, or figural content. Decrements in posterior α power (8–14 Hz) were paralleled by increments in high-frequency (>30 Hz) signal entropy in trials demanding active sensory processing. Simultaneously, frontocentral θ power (4–7 Hz) increased, indicating evidence integration. The coordinated α/θ dynamics were tightly linked to decision speed and remarkably similar across tasks, suggesting a domain-general mechanism. In sum, we demonstrate an inverse association between decision-related changes in widespread low-frequency power and local high-frequency entropy. The cooperation among mechanisms captured by these changes enhances the informational density of neural response patterns and qualifies as a neural coding system in the service of perceptual decision making.
Introduction
Temporally synchronized neural firing patterns are advantageous for efficiently representing (Von der Malsburg, 1981; Gray and Singer, 1989) and transmitting (Fries, 2005) information within and between neural assemblies (for recent reviews, compare Buzsáki, 2006; Wang, 2010). However, computational (e.g., Rolls et al., 2003), empirical (e.g., Chalk et al., 2010; Schneidman et al., 2011), and conceptual accounts (Hanslmayr et al., 2012) hold that neural firing rates are more redundant when neural synchronization is increased. The latter observation seems to contradict the claim that synchronized firing patterns improve information representation and transfer. Nevertheless, the two positions can be reconciled by assigning different roles to slow (<20 Hz) versus fast (>30 Hz) neural activity (von Stein and Sarnthein, 2000; Schroeder and Lakatos, 2009). The exact temporal interplay between slow rhythms and fast neural firing is well suited to represent stimulus-specific information (Sauseng et al., 2009; Whittingstall and Logothetis, 2009; e.g., Axmacher et al., 2010). Furthermore, population activity, such as measured by local field potentials or EEG, and single-neuron firing patterns may convey independent sources of information about visual inputs (Belitski et al., 2008; Montemurro et al., 2008). Hence, different neural codes can be expressed over a wide range of timescales and might be advantageous for encoding complementary information, ultimately enhancing the informational density of activity patterns (e.g., Panzeri et al., 2010).
Reaching decisions about action alternatives based on information from the environment is an important higher level cognitive function (Lee, 2013). In this context, the term “perceptual decision making” refers to the process of transforming sensory signals into a percept that is closely linked to an associated response (Shadlen and Newsome, 2001; Gross and Ploner, 2009). To decide between alternative behavioral responses, lower level sensory regions need to represent evidence about the identity of a given percept (e.g., Hebart et al., 2012). In addition, the sensory signal needs to be accumulated and compared against evidence for decision alternatives (Ratcliff, 1978). The latter probably involves higher-level cortical regions (e.g., Heekeren et al., 2004). Hence, the decision for one or the other action based on available sensory evidence requires the interaction of a widespread neural network comprising sensory-specific areas, as well as control regions in the parietal and prefrontal cortex in conjunction with motor areas (for an overview, see Heekeren et al., 2008). Accordingly, previous results suggest that accumulation of sensory evidence in support of behavioral decisions can be considered a broadband phenomenon resulting in neural activity patterns across various frequency ranges (e.g., Bauer et al., 2006; Donner et al., 2007, 2009; van Vugt et al., 2012). Hence, perceptual decision making presents a paradigmatic case to study the cooperative dynamics of rhythmic neural activity across frequency ranges and brain networks.
Accordingly, the present study, aimed to unravel the single-trial mechanisms by which temporally structured low-frequency neural activity patterns interact with local increases in neural firing activity, approximated as the entropy of high-pass filtered (>30 Hz) EEG traces (see, e.g., Buzsáki et al., 2012), to support processing of sensory information about decision alternatives.
Materials and Methods
Participants.
The current report used data from a subsample of the COGITO study (compare Schmiedek et al., 2010), conducted at the Max Planck Institute for Human Development (Berlin). Participants were recruited through newspaper advertisements, word-of-mouth recommendation, and flyers circulated in Berlin, for a longitudinal study on training and day-to-day variability of cognitive performance. The ethics committee of the Max Planck Institute for Human Development approved the study. All participants gave written informed consent.
In the COGITO study, a relatively large sample of individuals in an intervention group (n = 204) practiced a broad battery of perceptual speed, working memory, and episodic memory tasks for over 100 one hour sessions. A no-contact control group (n = 83) took part in an extensive cognitive assessment before (pretest) and after (posttest) the 100 d of cognitive training only. Approximately 30% of the parent sample also volunteered and were eligible for participating in EEG recordings that were conducted in the context of the pretest and post-test assessments. Extensive descriptives for the EEG subsample as well as comparisons with the COGITO parent sample were provided previously (Grandy et al., 2013a, b).
To be eligible, participants had to have normal or corrected-to-normal vision and be right-handed. In addition, they had to report not to suffer from any of the following conditions: cardiovascular disease, except for treated hypertension; diabetes; neurological or psychiatric conditions; use of antiseizure or antidepressant drugs; or drug or alcohol abuse. For the present report, we analyzed data from the pretest assessment. Hence, the sample size could be increased by aggregating across experimental and control groups (N = 40, 21 female; mean ± SD age, 25.17 ± 2.81 years).
Tasks and stimuli.
Behavioral performance in the present report is derived from choice reaction time (CRT) tasks that were administered during EEG recordings. Participants performed three versions of the task: numerical, verbal, and figural (for a schematic of stimuli, see Fig. 1). In the numerical version, they had to indicate whether a presented digit was odd or even, in the verbal version, whether a presented letter was a consonant or a vowel, and in the figural version, whether the presented figure was symmetric or asymmetric with respect to a vertical axis. The three CRT tasks were based on the same stimulus layout: the seven lines of the number “8” as displayed on pocket calculators (Fig. 1). Target stimuli were masked with a configuration that combined this “calculator 8” with extending lines in all 10 possible directions. The stimuli were presented on a 17 inch LCD screen with a frame rate of 60 Hz for ∼50 ms (i.e., 3 frames) followed by a mask for 1950 ms. The interstimulus interval was set to 2380 ms. Additionally, one-third of the trials consisted of “empty” (null) stimuli; that is, only an empty frame was shown for 50 ms followed by the mask. Each version consisted of 192 stimuli plus 96 null stimuli, split into 12 blocks each, which were presented in random order. Participants were instructed to respond as quickly and accurately as possible to which category the stimulus belonged by pressing a button on the keyboard with either their right index or right ring finger. They were asked not to respond and wait for the next stimulus when a null stimulus was presented. Stimulus presentation and recording of behavioral responses were controlled with E-Prime 1.2 software (Psychology Software Tools).
EEG data acquisition.
EEG was recorded continuously with BrainAmp amplifiers (Brain Products) from 64 Ag/AgCl electrodes. Sixty scalp electrodes embedded in an elastic cap (EASYCAP) were organized according to the 10% system (compare Oostenveld and Praamstra, 2001). AFz served as ground. Two electrodes were placed on the outer canthi (horizontal EOG) and one electrode below the left eye (vertical EOG) to monitor eye movements. During recording, all electrodes were referenced to the right mastoid electrode, whereas the left mastoid electrode was recorded as an additional channel. Electrode impedances were maintained <5 kΩ before recording. The EEG was recorded with an analog pass band of 0.1–250 Hz and digitized with a sampling rate of 1 kHz.
EEG data preprocessing.
Preprocessing and analysis of EEG data were performed using the EEGLAB (Delorme and Makeig, 2004) and FieldTrip toolboxes (developed at the F.C. Donders Centre for Cognitive Neuroimaging, Nijmegen, The Netherlands; http://fieldtrip.fcdonders.nl/) (Oostenveld et al., 2011), as well as custom-written MATLAB (MathWorks) code. EEG data were rereferenced to mathematically linked mastoids, filtered with a fourth-order Butterworth filter and a pass band of 0.5–100 Hz, and segmented into epochs of 3.5 s around stimulus onset (−1 to 2.5 s). Segments were visually inspected, and all epochs containing artifacts other than eye blinks and eye movements were excluded from further analyses. After manual artifact rejection, an extended infomax-independent component analysis (Bell and Sejnowski, 1995) was conducted to correct for eye blinks and movements. Afterward and to attenuate effects resulting from volume conduction, Laplacian current source density (CSD) (Perrin et al., 1989) was calculated for all channels of the artifact-controlled EEG recordings (order of splines: 4; maximum degree of Legendre polynomials: 10; λ: 1e-5) using the CSD-toolbox (Kayser and Tenke, 2006; Kayser, 2009). Time-varying power and phase information for each trial and electrode were extracted by applying the Hilbert-transform to bandpass filtered CSD signals (fourth-order Butterworth filter) in the α (8–14 Hz) and θ (4–7 Hz) range (Le Van Quyen et al., 2001). To assure comparability of changes in power across individuals, frequency bands, and tasks, each trial was normalized with regard to a prestimulus baseline (−0.2 to 0 s). Afterward, the oscillatory power (P) in each trial was expressed as a relative signal change with regard to the prestimulus baseline (e.g., Pfurtscheller and Aranibar, 1977). For each electrode (e) and time point (t) in trial j, the relative signal change was computed as follows:
EEG single-trial analysis of oscillatory power.
To reveal the within-trial dynamics of rhythmic neural activity patterns for each individual (e.g., Rousselet et al., 2011; Cohen and Cavanagh, 2011), single-trial correlation analyses were performed. More precisely, we computed correlations between reaction time (RT) and the relative change in signal power at each point in electrode-time space for each participant. As normality cannot be guaranteed for power and derived measures (Kiebel et al., 2005), robust Spearman correlations were used (Rousselet and Pernet, 2012). The resulting correlation coefficients (ρ) were converted to z-scores by means of the Fisher-Z transform. The resulting z-values represent a first level statistic for testing the strength of the association between relative power changes and RT at each electrode-time point within individuals (e.g., van Dijk et al., 2010; Werkle-Bergner et al., 2012 for similar approaches).
Only artifact-free trials with correct responses were entered in the single-trial analyses. Furthermore, trials with response times exceeding a threshold of 3 SD of the distribution for a given task were excluded from the analysis. Detection of RT outliers was done iteratively; that is, after exclusion of outlier trials in step i-1, the mean and SD were recomputed for the remaining trials on step i and again checked for trials exceeding the threshold. The procedure stopped when no further outliers were detected.
Time-varying entropy analysis.
The amount of information about a given stimulus contained in a neural response pattern can be quantified by measures derived from information theory (Quian Quiroga and Panzeri, 2009). The amount of information conveyed in a time-varying signal can be estimated by measuring the uncertainty about the values that might be taken by the random variable underlying the time series. The Shannon Entropy H(T) of the distribution P(v) of values v in a given time-frame T (e.g., Bishop, 2006) provides such as measure: To estimate changes in information content within a given trial and electrode, the entropy measure was calculated within sliding time windows of 256 points (i.e., 256 ms) centered at the time point of interest, between −0.2 and 1.4 s, for every available sample point (i.e., step size of 1 ms). The stability of entropy estimates for short time windows has been demonstrated previously (Bezerianos et al., 2003).
The entropy of a signal is dependent on the range of the distribution P(v), in a way that sharply peaked distributions tend toward lower entropy compared with widely spread distributions with a broad range of values (Bishop, 2006). One may argue that time-varying changes in entropy simply reflect signal compression/expansion because of variability in the high-amplitude low-frequency components that typically dominate the EEG signal, as reflected in the ubiquitous 1/fa frequency spectrum (e.g., He et al., 2010). Moreover, simultaneous intracranial and EEG recordings in monkeys have revealed that neural firing activity, reflecting stimulus-specific information, is strongly correlated with low-amplitude high-frequency EEG activity (Whittingstall and Logothetis, 2009). Hence, high-frequency activity in the scalp EEG can be regarded as a viable noninvasive approximation to modulations of local neural firing activity (Buzsáki et al., 2012). Therefore, preprocessed (see description above) artifact-free CSD-transformed (Perrin et al., 1989; Kayser and Tenke, 2006; Kayser, 2009) single-trial EEG traces were high-pass filtered (cutoff frequency: 30 Hz; fourth-order Butterworth filter) to estimate changes in information content independent of the fluctuations in the α and θ frequency ranges.
For each single trial, the poststimulus entropy (Epst) was compared against prestimulus entropy (Epre; t = [− 0.2,0]), by converting poststimulus estimates to z-values. For each electrode (e) and time point (t) in trial j, the z-values were computed as follows: The z-values represent a level 1 statistic, reflecting the poststimulus change in entropy, controlling for possible electrode and trial-specific noise confounds.
Comparison of stimulus-present versus stimulus-absent conditions.
To reveal neural mechanisms specific to evidence accumulation, trials that required a perceptual decision (STIM; stimulus present) were compared against trials were no accumulation was possible because of the absence of a stimulus (NOSTIM). Similar to the correlation analyses, a first level statistic was computed by comparing STIM versus NOSTIM trials within participants by means of independent-samples t tests at each electrode time point. The resulting t-scores were converted to z-values. This set of analyses was primarily motivated by the temporal dynamics over frontocentral and occipitoparietal electrode regions identified in the single-trial correlation analyses (for definition of electrode clusters, see Results). Accordingly, the z-values were averaged across electrodes and subjected to second-level analyses across time. STIM and NOSTIM conditions were compared for measures of oscillatory power in the α and θ frequency range.
EEG group level analyses of power and entropy: cluster-based permutation tests.
To assess the reliability of effects derived from the single-trial analyses at the group level, cluster-based permutation tests as implemented in FieldTrip (Maris and Oostenveld, 2007) were used. More precisely, for analyses across the entire scalp, at each electrode time point, the individual z-maps were subjected to a mass-univariate cluster-based t test (testing for significant differences from zero, two-sided, 1000 iterations; minimum number of neighboring channels per cluster = 2, neighborhood extent = 40 mm; cluster entry level: p < 0.01; cluster-significance level: p < 0.05). In essence, this procedure identifies time–electrode clusters for which the assumption of no association between power and RT can be rejected at the group level, and effectively controls for multiple comparisons (Maris and Oostenveld, 2007; Groppe et al., 2011). Similarly, for analyses comparing the time course at electrode regions of interest, clusters were identified as neighboring time points for which the difference between conditions, reflected in z-scores, was reliably different from zero at the group level (two-sided, 1000 iterations; cluster entry level: p < 0.01; cluster significance level: p < 0.05). The same procedure was used to assess the reliability of poststimulus changes in high-frequency entropy at the group level.
Results
Behavioral results
Overall, participants were highly efficient in detecting the stimulus-absent (NOSTIM) trials reflected in a mean accuracy of 99.9% (SE 0.02). A summary of the behavioral performance in stimulus present (STIM) trials separately for the three task conditions is provided in Table 1.
The decision accuracy, measured as the percentage of correct responses across all STIM trials of a given task condition, varied across content domains (F(2,78) = 37.5, p < 0.05) as revealed by an one-way repeated-measures ANOVA with the factor task type (numerical vs figural vs verbal). The accuracy in the verbal task was lower compared with both the numerical (t(39) = 7.9, p < 0.0167) as well as the figural task (t(39) = 6.8, p < 0.0167), whereas the latter two did not differ (t(39) = 1.49, p > 0.14).
The speed of the decision process was measured as the median RT across all STIM trials for a given task condition, separately for correct and incorrect responses. A two-way repeated-measures ANOVA with factors task type (numerical vs figural vs verbal) and response type (correct vs incorrect) revealed reliable main effects (task type: F(2,78) = 11.9, p < 0.05; response type: F(1,39) = 80.6, p < 0.05), although the interaction did not reach significance (F(2,78) = 2.5, p > 0.08). Overall, correct responses (mean ± SD, 702 ± 10 ms) were faster than incorrect ones (mean ± SD, 881 ± 19 ms). Decisions were taken faster in the figural compared with both the numerical (t(39) = 3.1, p < 0.0167) and the verbal task (t(39) = 5.7, p < 0.0167), whereas response speed was comparable for the latter (t(39) = 0.9, p > 0.39).
Single-trial analyses of oscillatory power
Previous reports point to frontocentral θ oscillations as likely neural implementation of general accumulators in decision making tasks (van Vugt et al., 2012; Guitart-Masip et al., 2013). But frontocentral θ networks do not work in isolation; they depend on the availability of task-related sensory information. Thereby both stimulus characteristics (compare Philiastides and Sajda, 2006, 2007) and the appropriate top-down states (e.g., Capotosto et al., 2009) determine the quality of available sensory information. In this regard, the dynamics of α band activity surrounding stimulus onsets carries information about controlled sensory processing states (Busch et al., 2009; Mathewson et al., 2009; Romei et al., 2010; Haegens et al., 2011a). We reasoned that fast decisions should be supported by efficient generation of sensory evidence about decision alternatives. As synchronized states of low-frequency neural activity are generally seen to reflect local inhibition (e.g., Klimesch et al., 2007), we expected decision speed to be related to increased signal entropy (e.g., Hanslmayr et al., 2012) as indexed by the duration of desynchronized low-frequency states over sensory-specific regions.
Figure 2 depicts the electrode-time points for which the single-trial correlation analyses resulted in reliable associations between changes in signal power and RT at the group level for the α and θ band, respectively. With regard to the α frequency band, for all three tasks, one significant positive and one significant negative cluster were obtained, respectively (p < 0.05, cluster level). For the θ band, only one negative cluster was obtained, except for the figural task, where an additional positive cluster reached significance (p < 0.05, cluster level).
The topographical distribution of the effects is illustrated in Figure 3. Overall, the pattern was qualitatively comparable across all tasks. A larger signal power compared with baseline was associated with slower response times (negative clusters) at electrodes over frontocentral locations for both α and θ oscillations. The time course of these effects differed between the two frequency bands. In the α band, the time period during which lower power was associated with faster decisions lasted from ∼350 ms to ∼600 ms after stimulus onset; in the θ band, the corresponding effect was observed later and persisted over a longer time period (∼450–1400 ms). In contrast, lower poststimulus power was associated with slower decisions for a cluster of electrodes located over parietal regions. This cluster was primarily observed in the α band, and extended from ∼ 550 to 1400 ms after stimulus onset. Only for the figural task, an additional positive cluster was observed in the θ range.
A close inspection of the temporal evolution of the group effects suggests an interactive within-trial dynamic between α and θ rhythms (Fig. 2). The negative α cluster appears most pronounced before execution of the manual response, given that it is mainly expressed before the median RT (marked by the red dotted line). Conversely, the positive α and negative θ clusters emerge simultaneously past the median RT.
To further unpack the underlying within-trial dynamics of α and θ oscillations, mean signal changes for a representative central (FC1, FCz, FC2, C1, Cz, C2) and posterior (P7, P5, P3, Pz, P2, P4, P8, PO7, PO3, POz, PO4, PO8, O1, Oz, O2) cluster were computed for each trial (Fig. 3, highlighted electrode locations). Afterward, single trials were sorted across all participants according to their RT (e.g., Makeig et al., 2002, 2004).
As shown in Figure 4, the RT-related within-trial dynamics of α and θ oscillations was similar across tasks. A strong stimulus-induced power increase (warm colors) was observed within the first 300 ms over posterior recording locations for both α and θ frequency ranges. This power increase was unrelated to speed of responding and possibly reflects processes related to stimulus encoding (Philiastides and Sajda, 2006; Philiastides et al., 2006). In the α band, the initial power increase at posterior electrodes was followed by a power decrease (e.g., Pfurtscheller and Lopes da Silva, 1999; Hanslmayr et al., 2012) (α desynchronization) that extended until response execution and rebounded afterward. Hence, the relation between stronger power increase and shorter RT observed in the single-trial correlation analyses can be explained by an earlier termination of the α desynchronization and a faster resynchronization afterward. Conversely, the observation of an association between prolonged RT and a relative increase in signal power was reflected in a steady increase of oscillatory neural activity until response execution over central regions that breaks down afterward (van Vugt et al., 2012). Of note, the strong increase in oscillatory θ activity over central regions was taking place simultaneously with the power decreases in α activity over posterior regions (Fig. 5).
Low-frequency EEG dynamics for stimulus-present versus stimulus-absent conditions
To reveal the specificity of neural activity in the α and θ bands for evidence accumulation, trials that required a behavioral decision (stimulus present, STIM) were compared against trials that did not require evidence accumulation because of the absence of a task-relevant stimulus (stimulus absent, NOSTIM). The analysis was focused on those central and parietal recording locations that revealed robust associations between changes in signal power and individual RT distributions (see Fig. 3; description above).
The results revealed comparable differences in the time course of power changes between STIM and NOSTIM for all three CRTs (for a graphical summary, see Fig. 5). Within the α frequency range, the processing of stimulus-related information during STIM trials was associated with a lower increase in signal power over central regions (p < 0.05, cluster level), starting at ∼0.4 s after stimulus. Similarly, over posterior electrode sites, the absence of decision-related sensory information in NOSTIM trials was related to a shorter desynchronization and a faster rebound, reflected in lower signal power for STIM trials during the resynchronization phase (p < 0.05, cluster level).
For the θ frequency range, NOSTIM trials were associated with a stronger and earlier power increase over central locations reflected in reliably larger power changes compared with STIM trials (p < 0.05, cluster level) in the first 0.5 s after stimulus onset. Only for the figural task, this difference was inverted in a later time window (∼1.1–1.4 s after stimulus, p < 0.05, cluster level). For the posterior electrode cluster, STIM trials were associated with a delayed decrease in signal power in the numerical and figural task in a time window from ∼0.35 to 0.5 s (p < 0.05, cluster level). During later time windows (>0.5 s), NOSTIM trials elicited larger power increases in the numerical and verbal task (p < 0.05, cluster level).
Changes of high-frequency signal entropy accompany low-frequency dynamics in opposing direction
Finally, we addressed the prediction that states of synchronized or desynchronized local low-frequency activity should coincide with decreased/increased signal entropy (Hanslmayr et al., 2012). For this analysis, we examined whether reliable changes in entropy during the poststimulus time interval were found when compared with a prestimulus epoch (−0.2 to 0 s). This was achieved by testing within-person z-values reflecting the change from prestimulus baseline by cluster-based permutation analyses on the group level for a central (FC1, FCz, FC2, C1, Cz, C2) and a posterior (PO7, PO3, POz, PO4, PO8, O1, Oz, O2) electrode region of interest, respectively. The main results are summarized in Figure 6.
In all three tasks, the increase in θ power over central regions was paralleled by a long-lasting decrease in high-frequency signal entropy (p < 0.05, cluster level) that peaks around the mean RT. Inspection of single-trial dynamics (Fig. 7) indeed revealed a decrease of signal entropy that was maximally expressed around response execution and only slowly returned to baseline afterward.
For posterior recordings, locations a different picture emerged. Here, high-frequency signal entropy increased in an early time window (∼0.15–0.4 s) after stimulus onset in all three tasks (p < 0.05, cluster level). Only for the numerical task, a late cluster of signal entropy increases also was statistically reliable (p < 0.05, cluster level). The single-trial dynamics of posterior entropy changes (Fig. 7) suggests that the reliable early increases reflect stimulus-related processing. A direct visual comparison of the temporal evolution of changes in posterior α power and high-frequency signal entropy (Fig. 8) shows that the increase in entropy sets in only after the initial increase in stimulus driven α power. It follows that the stimulus-related increase in high-frequency signal entropy mainly accompanies the early phase of the posterior α power desynchronization.
Discussion
The coordinated dynamics of neural activity in the α and θ frequency ranges are tightly linked to the timing of decisions about stimulus identity in numerical, verbal, and figural CRTs. The observed dissociations in time and topography point to different roles for neural operations in these frequency bands (Figs. 2 and 4).
Frontocentral θ band activity is characterized by a gradual increase until response execution. By contrast, after an initial stimulus induced peak, posterior α activity is reduced before response execution and rebounds only afterward. The comparison between STIM and NOSTIM trials (Fig. 5) suggests that these neural mechanisms are involved in evidence accumulation. The NOSTIM trials present a condition where no task-relevant sensory information is available. Accordingly, no evidence accumulation is needed. Hence, the activity patterns observed in the α and θ bands presumably reflect interactions in cortical networks related to the controlled processing of sensory evidence about decision alternatives.
The frontocentral increases (θ) and posterior decreases (α) in signal power were paralleled by inverse changes in the signal entropy of EEG frequency components >30 Hz. Overall, the present results suggest that local changes in neural firing patterns (as approximated by high-frequency signal entropy) are controlled by network activity operating in the α and θ frequency ranges (compare Hanslmayr et al., 2012). The topographical and temporal similarity across the three task domains underscores the generality of the observed dynamics between decision-relevant mechanisms operating at low and high frequencies.
Frontocentral θ activity as a possible neural accumulator
In the present study, the process of decision formation was accompanied by an increase of neural activity in the θ frequency range over frontocentral regions (see also van Vugt et al., 2012; Guitart-Masip et al., 2013). Neural activity in the θ frequency range has been suggested to coordinate and integrate information from various sources (e.g., Sauseng et al., 2010; Colgin, 2011; Lisman and Jensen, 2013). One recent study (van Vugt et al., 2012) directly identified activity in the θ range as an index of evidence accumulation. In addition, using a multivariate GLM framework, van Vugt et al. (2012) observed that α regressors were related to individual differences in drift rates (i.e., the diffusion model parameter that represents efficiency of evidence accumulation) (see Ratcliff, 1978). These findings were interpreted as reflecting a “bleeding in” of θ activity into the α frequency range resulting from wavelet filtering (van Vugt et al., 2012). In the present report, we demonstrate that the dynamics of neural oscillations in the α and θ frequency range can indeed be dissociated based on their temporal and spatial profile.
The existence of neural activity patterns in the θ range is well documented for the hippocampal formation (e.g., Green and Arduini, 1954; Vanderwolf, 1969; Bland, 1986). Nevertheless, the exact neural sources for frontocentral EEG θ activity remain unclear. One likely mechanism entails the possibility that corticohippocampal feedback loops induce θ oscillations measurable at the scalp level (Miller, 1991). Given the associative binding capabilities of medio-temporal lobe (MTL) regions in conjunction with the adaptive control functions of PFC (Miller and Cohen, 2001), the interplay between MTL and PFC/ACC regions may allow for the implementation of a general accumulator (see Bland and Oddie, 2001). In line with these suggestions, previous results report sources in ACC and MTL areas, the latter being strongly synchronized with several prefrontal regions (Guitart-Masip et al., 2013). The coordinated processing of information generated in sensory regions may then be synchronized by temporally structuring neural activity in the θ frequency range (e.g., Hyman et al., 2005; Siapas et al., 2005; Benchenane et al., 2010; Fujisawa and Buzsáki, 2011).
In summary, in line with previous results, the present findings suggest that increases in frontocentral θ activity during response formation reflect controlled processing states in an interactive PFC-ACC-MTL network in the service of adaptive behavioral responses.
Posterior α activity in the service of controlled sensory processing
Most studies demonstrating fluctuations in cortical excitability by sensory input focused on the role of α synchronization as a marker for the timing of inhibition (Thut et al., 2006; Hanslmayr et al., 2007; Dijk et al., 2008; Haegens et al., 2011b, c). The modulations of posterior α power observed in the present study are well in line with the proposition that fluctuations in neural α activity represent a general mechanism for the timing of neural inhibition (Klimesch et al., 2007), and the gating of information flow in the brain (Jensen and Mazaheri, 2010).
In particular, the single-trial fluctuations in signal power offer important insights for the role of activity in the α frequency range during perceptual decision making (Fig. 4). An initial peak in α power is found immediately after stimulus onset, but this peak is unrelated to the duration of the decision process as indexed by RT. In contrast, the following relative decrease in α power covaries with RT and is, in turn, followed immediately by a strong α resynchronization. This sequence of events is in good agreement with the notion that sensory information is actively processed during the desynchronized α state (Spaak et al., 2012) and inhibited once a decision has been made (e.g., Romei et al., 2008; Haegens et al., 2011c).
In addition, the initial decrease in α power was accompanied by higher signal entropy. The present study provides first evidence from noninvasive EEG recordings to support the assumption that relative decreases in α activity occur with increases in signal entropy (Hanslmayr et al., 2012) and may thus be used to indicate changes in cortical excitability related to active stimulus processing (Lange et al., 2013). The strongest increase in high-frequency entropy was found during the initial part of α desynchronization. This pattern points to a release from local inhibition (Klimesch et al., 2007; Jensen and Mazaheri, 2010), which is reestablished around the time of response execution. Our interpretation is consistent with the observation that lower α power relates to better memory formation and retrieval (Klimesch et al., 1996, 1997; Waldhauser et al., 2012), presumably again for the reason that α desynchronization reflects active information processing (Hanslmayr et al., 2012).
In conclusion, we found that decision-related changes in widespread low-frequency power and local high-frequency entropy are inversely related to each other. We conclude that the cooperation among mechanisms captured by these changes enhances the information density of neural response patterns (e.g., Belitski et al., 2008; Scheeringa et al., 2011) and qualifies as an ensemble of neural codes, or a “neural coding system,” in the service of perceptual decision making. Future studies need to explore the generality of this system for other forms of goal-directed behavior.
Footnotes
This work was supported by the Max Planck Society (including the Innovation Fund Grant M.FE.A.BILD0005), the Sofja Kovalevskaja Award to M.L., administered by the Alexander von Humboldt Foundation and donated by the German Federal Ministry for Education and Research, the German Research Foundation (Deutsche Forschungsgemeinschaft KFG 163), and the German Federal Ministry for Education and Research [Center for Advanced Imaging CAI]. U.L. was supported by the Deutsche Forschungsgemeinschaft Gottfried Wilhelm Leibniz Award. We thank Douglas Garrett for valuable discussions about the measurement of entropy; Myriam Sander for critical comments on an earlier version of the manuscript; and Colin Bauer, Annette Brose, and all research assistants involved in data collection.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Markus Werkle-Bergner, Center for Lifespan Psychology, Max Planck Institute for Human Development, Lentzeallee 94, 14195 Berlin, Germany. werkle{at}mpib-berlin.mpg.de