Abstract
Hemodynamic research has recently clarified key nodes and links in brain networks implementing inhibitory control. Although fMRI methods are optimized for identifying the structure of brain networks, the relatively slow temporal course of fMRI limits the ability to characterize network operation. The latter is crucial for developing a mechanistic understanding of how brain networks shift dynamically to support inhibitory control. To address this critical gap, we applied spectrally resolved Granger causality (GC) and random forest machine learning tools to human EEG data in two large samples of adults (test sample n = 96, replication sample n = 237, total N = 333, both sexes) who performed a color–word Stroop task. Time–frequency analysis confirmed that recruitment of inhibitory control accompanied by slower behavioral responses was related to changes in theta and alpha/beta power. GC analyses revealed directionally asymmetric exchanges within frontal and between frontal and parietal brain areas: top-down influence of superior frontal gyrus (SFG) over both dorsal ACC (dACC) and inferior frontal gyrus (IFG), dACC control over middle frontal gyrus (MFG), and frontal–parietal exchanges (IFG, precuneus, MFG). Predictive analytics confirmed a combination of behavioral and brain-derived variables as the best set of predictors of inhibitory control demands, with SFG theta bearing higher classification importance than dACC theta and posterior beta tracking the onset of behavioral response. The present results provide mechanistic insight into the biological implementation of a psychological phenomenon: inhibitory control is implemented by dynamic routing processes during which the target response is upregulated via theta-mediated effective connectivity within key PFC nodes and via beta-mediated motor preparation.
SIGNIFICANCE STATEMENT Hemodynamic neuroimaging research has recently clarified regional structures in brain networks supporting inhibitory control. However, due to inherent methodological constraints, much of this research has been unable to characterize the temporal dynamics of such networks (e.g., direction of information flow between nodes). Guided by fMRI research identifying the structure of brain networks supporting inhibitory control, results of EEG source analysis in a test sample (n = 96) and replication sample (n = 237) using effective connectivity and predictive analytics strategies advance a model of inhibitory control by characterizing the precise temporal dynamics by which this network operates and exemplify an approach by which mechanistic models can be developed for other key psychological processes.
Introduction
Interference control during goal-directed behavior is a key cognitive operation. Many aspects of daily life require rapid yet accurate control of an indefinite amount of information by finite processing resources. Two overlapping lines of research pursuing such mechanisms are studies of large-scale cerebral connectivity and of neuronal oscillations. Cognitive operations might be better understood by incorporating knowledge of the network of anatomical connections and of the periodicity of neuronal activity that structures information to identify mechanisms of processing and their disruption in psychopathology. However, parameterization and interpretation of such data are often difficult due to the enormously high degrees of freedom inherent in functional brain data.
In a recent hemodynamic neuroimaging study, our group used graph theory, a formal description of a network and its emergent properties, to identify a particular network constellation optimized for specialized processing during increased demands for inhibitory control (Spielberg et al., 2015). This network was characterized by increased connectivity orchestrated primarily by dorsal ACC (dACC) and inferior frontal regions showing a flexible organization predictive of behavioral outcome. Given the temporal limits of conventional fMRI, that study could not determine possible causal relationships between nodes within this network, such as whether dACC has top-down influence on inferior frontal regions in the network.
Two large samples were combined to address these unresolved issues using recent advances in EEG analysis complemented by machine-learning techniques. EEG data from 96 participants in the fMRI study performing the same task during dense-array EEG recording typically within 2 weeks of the fMRI session were combined with EEG data from a replication sample of 237 participants to develop a mechanistic account of internodal network communication. Initial analyses characterized the time–frequency profiles of network constituents and quantified their relationship to behavioral performance. Spectrally resolved Granger causality (GC) analyses quantified key parameters of a given relationship, its directionality and frequency. Distinct frequency bands are theorized to represent coordinated state changes across local networks, so the present study evaluated the role of theta (4–8 Hz) and alpha/beta (10–30 Hz) in networks associated with top-down control and behavioral performance (reaction time, RT). Finally, random forest classification analysis was used to identify the importance of neural activity per network node for predicting individual brain states characterized by high versus low demand for inhibitory control.
Evidence suggests that top-down control can be accomplished via oscillatory activity <30 Hz (Cohen et al., 2012; van Kerkoerle et al., 2014; Bastos et al., 2015; Chmielewski et al., 2016; Michalareas et al., 2016; Wang et al., 2016; Babapoor-Farrokhran et al., 2017; Popov et al., 2017; Richter et al., 2017; Schoffelen et al., 2017). More specifically, theta activity likely reflects the coordination of a distributed network engaged in inhibitory control (Chmielewski et al., 2016; Babapoor-Farrokhran et al., 2017) and memory (Anderson et al., 2010) processes. Accordingly, we predicted that connectivity between PFC and dACC (per Spielberg et al., 2015) would be directed and frequency specific via theta activity.
Multiple frequency bands are expected to be implicated in a given task, modulating task-specific networks (Engel and Fries, 2010), and experimental and theoretical models have suggested upregulation of the strength of the desired response as a critical feature of inhibitory control (Munakata et al., 2011). Therefore, the network configuration was predicted to shift toward increased influence of PFC. Specifically, this upregulation would be reflected in modulation of alpha/beta power in motor regions. Beta-band activity has been theorized to signal maintenance of a sensorimotor set to facilitate efficient processing of feedback and recalibration of the sensorimotor system (Engel and Fries, 2010). Likely indicating a change in the status quo, decreased beta power in primary motor cortex has been observed when inhibitory control is exerted in stop signal tasks (Swann et al., 2009; Wheaton et al., 2009). Therefore, a decrease in network beta power was anticipated to be related to slower response time, particularly in the context of incongruent trials.
Materials and Methods
Participants.
A total of 333 (208 female, age 26.25 ± 10.28 years) participants were recruited from the community. Exclusion criteria were as follows: left handedness determined by the Edinburgh Handedness Inventory (Oldfield, 1971), head trauma with loss of consciousness, psychoactive medication, abnormal color vision, claustrophobia, recent drug/alcohol use, excessive caffeine intake, and lack of sleep.
Experimental design.
Experimental procedures were those reported in our previous research, with overlapping samples (Silton et al., 2010; Spielberg et al., 2015). A total of 256 trials were presented in 16 blocks (four congruent, four incongruent, and eight neutral) with intertrial intervals of 2000 ± 225 ms (onset to onset). Neutral trials were intermixed (50%) in congruent and incongruent blocks to prevent word reading strategies. A trial consisted of one word presented in one of four ink colors (red, yellow, green, or blue). Congruent (CON) trials were characterized by word meaning matching ink color (e.g., “GREEN” with green ink), whereas incongruent (INC) trials were characterized by word meaning differing from ink color (e.g., “GREEN” in red ink). In neutral (NEU) trials, word meaning and colors were unrelated (e.g., “CENTURY” in green ink). Control of stimulus order effects was achieved by each participant receiving one of eight stimulus orders. Participants were asked to respond via button press to the color of the word while ignoring the meaning of the word.
Data acquisition and preprocessing.
Data acquisition was performed while participants were seated in a comfortable chair in a quiet room (for details, see Silton et al., 2010). EEG was recorded with a custom Falk Minow 64-channel cap with equidistant Ag/AgCl electrodes. Electrode digitization followed cap placement for source localization purposes. The left mastoid served as the online reference for all sites. Impedances were below 20 kΩ, which is appropriate given the high input impedance of the amplifier (James Long). Half-power amplifier band pass was 0.1–100 Hz with a sampling rate of 250 Hz. Data analyses were performed with the MATLAB (The MathWorks) FieldTrip toolbox (Oostenveld et al., 2011) and the scikit-learn toolbox (Pedregosa et al., 2011) complemented by custom-written software. Trials with saccades or muscle artifact were excluded from analysis. On average 30.8 ± 1.7 CON trials and 28.3 ± 3.2 INC trials per subject entered the analysis. An independent components analysis (ICA) (Jung et al., 2001) was applied after demeaning and removing the linear trend from the data. Components associated with eye blink and cardiac activity were removed using ICA.
Spectral analysis.
Spectral analysis was computed for each trial using a fast Fourier transformation with a 500 ms sliding window multiplied by a Hanning taper. The time window advanced in 50 ms increments and spectral resolution was 2 Hz. Spectral power was calculated for each trial and then averaged across trials. Average baseline from −500 to −250 ms before stimulus onset was subtracted and the data converted to units of decibels change from prestimulus baseline.
Source analysis.
Anatomical MRI and EEG data were coregistered to a common coordinate system (Montreal Neurological Institute, MNI) by applying an initial coarse registration between the fiducial locations (nasion, left and right pre-auricular points) and subsequent refined matching between the scalp surface extracted from the MRI and digitized electrode positions. A three-compartment (skin, skull, and brain) pseudorealistic boundary element forward solution was constructed on the basis of a standard MRI (MNI152 linear template). Scalp EEG time series were projected into brain source space by applying a time-domain spatial filtering algorithm (linearly constrained minimum variance; Van Veen et al., 1997). This algorithm uses the covariance matrix of the EEG data to construct a spatial filter for a given location (voxel). These spatial filters were estimated for each participant on the basis of all trials and for the 1000 ms interval after stimulus onset. Subsequently, these filters were applied to the data to estimate the time series for each of the 11 network node locations previously reported as a network exhibiting greater hemodynamic coupling during high demands for inhibitory control (Spielberg et al., 2015). The other 237 participants completed the same protocol in the same laboratory at another time, providing a large replication sample. MNI coordinates (x y z) and labels for these locations were as follows: right posterior middle frontal gyrus (MFG)/precentral, (44 10 40); right anterior insula, (38 19 6); right IFS, (47 35 20); right posterior middle temporal gyrus/superior temporal gyrus, (57 −24 −3); left posterior MFG/precentral, (−48 8 37); left inferior frontal gyrus (IFG) pars opercularis, (−52 21 18); right IFG pars opercularis, (53 14 20); medial precuneus, (1 −61 55); right amygdala/hippocampus, (23 −10 −15); left posterior superior frontal gyrus (SFG), (−17 14 65); and medial dACC, (0 26 34).
GC analysis.
Spectrally resolved GC analysis (Granger, 1969; Ding et al., 2006) was used to identify potential directionality in internodal communication. Briefly, GC analyses represent the result of a comparison between two models: a univariate autoregressive model, in which the future behavior of time series x is predicted from past values of time series x, and a bivariate autoregressive model, in which the prospective values of x are predicted from the past values of time series x and another time series y. Initially formulated in the time domain, GC can also be estimated in the frequency domain (Kamiński et al., 2001; Chen et al., 2006; Bastos and Schoffelen, 2015; for detailed review, see Bastos and Schoffelen, 2015), which requires the estimation of the spectral transfer matrix between a set of signals. The total interdependence fx,y(f) between two signals x and y is computed by their power spectral functions Sxx(f) and Syy(f) and the corresponding crossspectral function Sxx(f) as follows: The natural logarithm is denoted by ln and S(f) = Sxx(f)Syy(f) − Sxy(f)Syx(f) using the coherence Cxy(f) between the two signals defined as follows: the total interdependence can be expressed as follows: Subsequently, the total interdependence can be decomposed into three causality terms as follows: The first two terms denote the directed influence (over time) from x to y and y to x, respectively, and the third term denotes the instantaneous dependence. The latter is often a consequence of, for example, common input from a third signal, nonlinear interactions between x and y, and shared noise.
The time series were resegmented to include 500–1500 ms after stimulus onset to circumvent the presence of transient evoked activity at stimulus onset. For each trial, the Fourier coefficients were computed for the entire spectrum up to the Nyquist frequency after padding the data with zeros for 3 s at the beginning and the end of each trial to provide improved frequency resolution. A nonparametric matrix factorization of the crossspectral density matrix (Wilson, 1972; Dhamala et al., 2008) was applied to estimate the spectral transfer matrix.
It has been demonstrated that: and where Σ2 and Γ2 are the variance of the noise term in the bivariate model of signal x and y, H̃xx is the spectral transfer function of the signal x, and * is the conjugate (Ding et al., 2006). Therefore, the instantaneous causality is represented by the difference in total interdependence of x and y and the sum of the causality terms fx→y and fy→x. For a given pair of locations, GC was computed conditional on the presence of the remaining locations similar to the procedures applied previously (Wen et al., 2013; Bastos et al., 2015). Significant differences in GC between locations were determined on the basis of nonparametric cluster permutation tests (see next section). In addition, GC influences were computed twice: on the original and on the time-reversed time series. The latter strategy, proposed previously (Haufe et al., 2013; Winkler et al., 2015), accounts for the presence of so-called weak asymmetries as a possible interpretational confound, contributing to an apparent dominant directional drive between two areas. Briefly, as opposed to strong asymmetries, which are caused by actual time-lagged relations between signals, weak asymmetries are the consequence of differences in univariate signal properties (e.g., the local signal-to-noise ratio). Time reversal of the signals does not affect these univariate signal properties, whereas it should reverse the dominant direction of interaction. In other words, if the direction of the GC asymmetry is unchanged after the time reversal, then the GC is most likely artificial, whereas a reversal of the direction of the asymmetry implies a true time-lagged, and thus directed, relationship. Original versus time-reversed GC analysis outputs were compared statistically using the nonparametric permutation technique described above.
Statistical analysis.
Statistical evaluation of the neuronal data from the sample described above was performed by a cluster-based approach based on permutations (Maris and Oostenveld, 2007). This approach identifies clusters of activity on the basis of whether the null hypothesis can be rejected while controlling for multiple comparisons. The test statistic for comparing condition A and condition B was built on the basis of relative changes defined as follows: test statistic = [A − B]/[(A + B)]. Behavioral performance (RT) was expressed as a difference score, ΔRT = RTIC − RTC. Relationships between neuronal and behavioral data were assessed by means of Spearman's rank-order correlations (ρ). Effect sizes and exact p-values are reported when appropriate (i.e., parametric statistical tests).
Classification analysis.
Classification with random forest models (Breiman, 2001) was used to distinguish the predictive strength of different variables, such as network characteristics and RT, on inhibitory control demands.
The random forest approach is able to handle highly correlated features notably well (Cutler et al., 2009). This specialty makes random forests well suited for the analysis of MEEG data, in which highly correlated variables are an often encountered problem (Lehmann et al., 2007; Donos et al., 2015; Westner et al., 2018). Furthermore, it provides a ranking of the informational value that each predictor had for the model (Breiman et al., 1984; Cutler et al., 2009), thereby enabling the direct comparison of predictor variables.
The random forest algorithm is an ensemble method aggregating the predictions of a large number of decision-tree classifiers (Breiman et al., 1984). Each decision tree partitions the data by adopting binary splits, aiming to reduce the impurity in the resulting subsets; that is, as many observations of a class as possible should be concentrated in one subset. To yield stable results even with outliers or highly correlated predictor variables, the random forest model introduces randomness at two levels. First, each tree in the ensemble is fit on a bootstrap sample of the data. Second, at each split within a tree, only a random subset of all predictors is searched to find the best predictor and split point to minimize the impurity in the resulting subsets. The high variance among a large number of such classification trees converges to a robust prediction when aggregating them across the ensemble.
Random forest classification was done using the scikit-learn toolbox (Pedregosa et al., 2011) in Python. Every model incorporated 5000 trees and considered randomly drawn predictors at each split. Classifier performance was evaluated by using prediction accuracy within a fivefold cross-validation framework. Cross-validation prevents biased accuracy estimates because the model is always tested on data it was not trained on. Accuracy estimates were tested against chance level using a binomial test (Combrisson and Jerbi, 2015). To obtain a measure of the relative importance of each variable within the classification model, the Gini index was computed (Breiman et al., 1984; Cutler et al., 2009). This measure is based on the impurity reduction generated across all trees by a predictor and reflects the importance of this variable for the classifier's prediction.
The first classification model was fit to predict congruent and incongruent class membership using theta-power values at network nodes as predictors. For each participant, baseline-adjusted theta power (averaged over 4–8 Hz and 500–800 ms) was averaged within conditions and these averages were provided to the classifier, yielding 666 observations in total (333 participants × 2 conditions).
In addition to the theta-power model, random forest classifiers were computed using baseline-adjusted beta power (averaged over 15–25 Hz and 800–1200 ms), RT, and all possible combinations of these entities as predictors.
This approach was chosen over linear correlation analyses to compare different models (theta, beta, beta + RT, etc.). These comparisons are difficult to achieve via pure correlations between predictors. Moreover, linear models are not suitable for the aforementioned comparison because of the highly correlated nature of the predictors of interest.
Results
As expected, RT was longer on INC trials, with higher demand for inhibitory control, than on CON and NEU trials (M/SD INC = 831.5/146.5, CON = 636.8/103.2, NEU = 645.2/92.5; F(2,996) = 297.7, p < 0.001, Huynh–Feldt ε = 0.7; INC vs NEU t(332) = 33.1, p < 0.001, Cohen's d = |1.52|; INC vs CON t(332) = 32.3, p < 0.001, Cohen's d = |1.53|; CON vs NEU t(332) = −2.9, p < 0.003, Cohen's d = |0.086|).
Global network configuration of inhibitory control is dominated by theta and alpha/beta neural oscillations
Using the same participants (n = 96) and a replication sample of 237 participants (see Materials and Methods) to evaluate the oscillatory dynamics in the cerebral subnetwork (Fig. 1A) identified via fMRI (Spielberg et al., 2015), EEG data (same task recorded in a separate session; see Silton et al., 2010) were back-projected to the brain locations identified in that network. Time–frequency analysis of each EEG source revealed reliable modulations of theta (4–7 Hz) and alpha/beta (10–30 Hz) oscillations (Fig. 1B), which is consistent with previous reports (Hanslmayr et al., 2008; Cavanagh and Frank, 2014; Cohen, 2014; Popov et al., 2015). Theta activity was higher in the INC condition than in the CON condition, starting ∼300 to 400 ms after word onset and lasting several hundred milliseconds. This increase was accompanied by a condition-specific (INC > CON) decrease in alpha/beta activity starting around 500 ms after word onset and lasting for ∼1000 ms. Whole-brain source analysis revealed that theta generators were predominantly confined to frontal regions (Fig. 2A), whereas beta activity was most pronounced in parietal areas (Fig. 2B).
Inhibitory demands drive theta and alpha/beta oscillations
Initial contrasts were performed with the n = 96 sample reported previously (Spielberg et al., 2015) and shown in the top row of Figure 3. Figure 3A illustrates significant (INC − CON) condition differences in alpha/beta power (p < 0.01, nonparametric cluster permutation approach, see Materials and Methods). As predicted, higher demand for inhibitory control was associated with an alpha/beta power decrease from prestimulus baseline. This decrease was inversely related to behavioral performance (Fig. 3B): slower responses were related to greater decrease in alpha/beta power (Fig. 2C). The n = 237 sample replicated these effects (bottom row of Fig. 3). This much larger sample also showed more theta power in the high-demand INC condition (p < 0.01) (this effect was evident in the first sample but did not reach significance).
In summary, high demand for inhibitory control was related to reliable changes in theta and alpha/beta power. The latter in turn related to slower responses. The general consistency across the panels of Figure 1B suggests that power changes were distributed throughout the entire network, precluding inferences about internodal communication from these initial analyses.
SFG and dACC are primary nodes of prefrontal inhibitory control
Internodal communication was assessed in the combined sample via spectrally resolved GC analysis (see Materials and Methods). Conditional GC was computed on original and reversed time series (Chen et al., 2006; Wen et al., 2013). Robust connectivity relationships are illustrated in Figure 4A. There were directionally asymmetric connections, especially in the theta range between SFG and dACC and between SFG and IFG, revealing a strong top-down influence of SFG over both dACC and IFG (GC spectra, p < 0.001; Fig. 4B). Moreover, there was an exchange between frontal and parietal brain areas from precuneus to IFG and MFG in low theta-alpha frequency range (7–10 Hz; p < 0.01, nonparametric cluster permutation; Fig. 4B, middle). Finally, dACC exerted control over MFG at a range of frequencies, although this was particularly pronounced in the theta range (p < 0.001; Fig. 4B, right). These corticocortical relationships were reversed after time reversal of the original time series (Fig. 4C), which confirms that the reported lagged relationships between any two brain regions likely reflect unbiased directed effective influence of A on B beyond mere correlation. In summary, there was a dynamic exchange within frontal and between frontal and parietal brain areas: prefrontal control manifested in theta and parietal to frontal communication evident at higher frequencies overlapping with common posterior alpha oscillations (8–13 Hz).
Inhibitory control is best predicted by combination of brain and behavioral variables
The analyses presented above indicate that theta and beta oscillations reliably differed in high- and low-demand conditions and revealed effective connectivity patterns within the inhibitory control network. These results raise the question of the extent to which theta and beta power dynamics reflect high versus low demands for inhibitory control and perhaps mediate effects of these demands on overt performance.
The random forest machine learning algorithm was used to evaluate the predictive strength of theta, beta, and behavioral performance and their combinations for predicting the condition membership (INC or CON) of averaged data at the single-subject level. Random forest is an ensemble technique that aggregates the results of many decision trees into one classifier. These trees partition the data by implementing binary splits (Breiman, 2001). Every decision tree is built on random samples of observations and predictors. This ensemble approach makes random forest a robust technique able to handle data with highly correlated predictors particularly well. Figure 5A provides the classification results. Condition could be predicted above chance on the basis of any subset of the three variables, with all three in combination achieving 77% accuracy. Figure 1B shows that both theta and beta oscillatory activity were distributed across the entire network, with some variability in amplitude. The information value of a particular node for prediction of inhibitory control demands was estimated by random forest models (Fig. 4B). Estimates of theta oscillatory activity derived from SFG were most important for accurate classification of condition membership (Fig. 5B, left). In the beta power model, parietal oscillations (precuneus; Fig. 5B, right) were the most important.
Figure 5A suggested an interesting pattern: although theta by itself provided the lowest accuracy, the combination of theta and RT had higher classification accuracy than the combination of beta and RT. This suggests more variance shared by beta and RT. This post hoc hypothesis was addressed by first extracting the envelope of beta and theta activity as a function of time for each participant. These individual time courses were sorted according to mean RT across trials of each participant. The shift from beta decrease to increase coincident with button press in Figure 5C suggests a close coupling of beta activity immediately preceding button press, whereas the time courses of theta and button press were more loosely associated, consistent with more variance shared by beta activity and RT.
In summary, the application of predictive analytics to neuroimaging data revealed the following: (1) neither behavioral nor brain-derived variables, but rather, a combination of both is the best predictor of cognitive state; (2) SFG and not dACC theta provides numerically higher feature importance during classification of low versus high demand for inhibitory control; and (3) posterior beta has the highest classification value and reliably tracks the onset of a behavioral outcome (RT).
Discussion
The present study evaluated the effective connectivity within a previously reported (Spielberg et al., 2015) network of inhibitory control during a color–word Stroop task. In a sample of 333 human volunteers, network nodes exhibiting greater interconnectivity during periods of higher demands for inhibitory control (Spielberg et al., 2015) are dominated by oscillatory activity in the beta- and theta-frequency bands (Fig. 1). A broad frequency range around beta was related to RT (Fig. 3), whereas internodal effective connectivity was most pronounced around theta (Fig. 4), although the full dissociation was not tested. Participant brain state during conditions with high versus low demand for inhibitory control can be predicted by incorporating knowledge of both brain and behavior better than either alone (Fig. 5). Frontal (SFG) and parietal (precuneus) nodes were found to be most informative for inferences about inhibitory control, underscoring the functional specificity of network nodes in time, frequency, and space.
Spectrotemporal dynamics of the inhibitory control network
The present study used the color–word Stroop task to evaluate the temporal course of critical nodes associated with inhibitory control processes. In addition to recruiting top-down inhibitory control mechanisms, the task required motor output (button press) to respond to the color of the word while ignoring the meaning of the word. Therefore, sensorimotor regions (e.g., premotor, primary motor, supplementary motor, and somatosensory cortices) were presumably recruited after conflict resolution and response selection to implement a behavioral response. Based on a large body of evidence (Cavanagh and Frank, 2014), we hypothesized that inhibitory control demands would be reflected in an increase in theta (4–8 Hz) oscillatory activity (Chmielewski et al., 2016), whereas behavioral performance (RT) would be predominantly associated with power modulations in alpha/beta (10–30 Hz) activity (Engel and Fries, 2010).
Present analyses (Figs. 1, 2) are consistent with the literature linking theta oscillatory activity to inhibitory control (Cohen et al., 2008; Cavanagh et al., 2009; Hanslmayr et al., 2012; Cohen and Donner, 2013; Pastötter et al., 2013; Cavanagh and Frank, 2014; Cohen, 2014; Oehrn et al., 2014; Zavala et al., 2014; Chmielewski et al., 2016) and beta activity to conflict resolution (Fan et al., 2007; Cohen et al., 2008; Engel and Fries, 2010; Brittain et al., 2012; Pastötter et al., 2013). Replicating previous findings, theta activity was greater during higher inhibitory demand. This activity was spatially distributed, involving both frontal and parietal brain regions and not confined to a particular node (e.g., dACC). Paralleling theta findings, power decrease in the alpha/beta range was greater during higher inhibitory demand and had similar spatial distribution because both were reliably present in each of the nodes displayed in the network in Figure 1B. In addition, a larger condition difference in alpha/beta power decrease was correlated with a larger RT condition difference. Typically, power decreases in the alpha/beta range are considered reflective of efficient cognitive performance (Hanslmayr et al., 2012; Frey et al., 2015), in which “efficiency” is typically operationalized behaviorally by higher accuracy and/or response speed (i.e., shorter RT). Present results extend these findings by demonstrating that RT could be a potentially misleading parameter in the context of inhibitory control tasks. Rather than response speed, reduction of alpha/beta activity was a better index of task engagement and effort. Despite being characterized by a clear midfrontal (theta) and posterior (alpha/beta) topography, both rhythms were evident throughout the entire network. The present study aimed to identify whether these rhythms fostered effective internodal information exchange.
Effective network communication: a mechanism of upregulation of the desired response
A number of connections exhibited significant effective connectivity even after controlling for spurious connections (see Materials and Methods). Theta evidenced a directed relationship from SFG to dACC and IFG (Fig. 4), whereas left hemisphere MFG was under dACC control via theta. These findings provide a novel view of the role of dACC in inhibitory control: that theta carries the information fed forward from SFG to dACC, perhaps communicating information about the interference itself; for example, the mismatch between the task-relevant verbal content and the interfering color. dACC appears to signal the need for maintenance of verbal information to left MFG, fostering response accuracy at the expense of speed. It is worth noting that communication within the frontal portion of this inhibitory control network was dominated primarily by theta activity despite the presence of alpha/beta activity. A directed parietofrontal influence was evident from precuneus to the middle and inferior portions of the frontal gyrus at theta frequencies (Fig. 4B, middle).
A potential alternative explanation for the present corticocortical interactions could be prolonged motor inhibition. A control analysis subsampling trials in INC and CON conditions to eliminate the difference in RT yielded largely similar corticocortical interactions, evidence against that nonspecific explanation. The present results are consistent with a recent proposal based on animal and human evidence arguing that cerebral communication feedback operates at alpha/beta frequencies, whereas feedforward exchange is propagated via theta- and gamma-frequency ranges (Bastos et al., 2015). According to this view, sensory information regarding both task-relevant and interfering information is initially propagated from parietal to frontal cortex. Within the frontal module, information about the desired response is actively propagated within a loop comprising SFG-dACC and MFG. GC analyses were performed for the entire spectrum from 0 to the Nyquist frequency of 125 Hz and no effects were observed for frequencies above 40 Hz.
At a behavioral level, this processing is achieved at the expense of RT, in support of theories highlighting the selection and upregulation of the desired response rather than mere suppression of interfering input (Munakata et al., 2011). The present findings in the alpha/beta range provide further evidence for this view. First, accurate yet slower responses during high inhibitory demand were associated with greater decrease of alpha/beta activity. Although not part of the model evaluated here, it is well documented that decreases in alpha/beta activity before motor output originate mainly from primary motor regions (Davis et al., 2012; Pape and Siegel, 2016). Second, this motor cortex beta modulation is prolonged (Fig. 5C), nicely tracking the behavioral outcome (RT) on a subject-by-subject basis. Therefore, in addition to theta-mediated selection within the frontal subcircuit, a further upregulation of the desired response is reflected in deliberate, accuracy-directed, motor activation manifested in alpha/beta activity.
Prediction of inhibitory control demands based on neural oscillations
A further focus of the present study was the predictability of inhibitory demand on the basis of neural activity. Such a research question is often hampered by low sample size, but the large sample size of the present study enabled state-of-the-art predictive analyses. Both theta and beta activity were informative enough to classify high versus low levels of inhibitory demand above chance (Fig. 5). Classification performance was lowest for theta followed by beta and reached almost the level of RT alone when the two bands were combined. However, although beta-based classification was better than theta-based classification, the combination of beta and RT was less accurate than the combination of theta and RT (Fig. 5A). This frequency-dependent classification specificity is further supported by the fact that theta in SFG and alpha/beta in precuneus were the most important brain regions for accurate classification. Therefore, combining three (theta, beta, and RT) out of, in principle, an infinite number of psychological and biological variables that can be associated with a given individual allows for accurate prediction of need for inhibitory control in at least three out of four trial averages (Fig. 5A).
Conclusions
In the present study, a large sample of healthy human participants facilitated identification of mechanisms within a recently identified brain network supporting inhibitory control. Combining two imaging modalities, fMRI and EEG, led to several conclusions. Information flow during demand for inhibitory control is performed primarily by theta and alpha/beta oscillations. Theta activity establishes directed frontoparietal communication. Extending common views of the role of dACC as a key structure in inhibitory control, the present results indicate that dACC is not simply a control region; it exerts control, yet is itself subject to control within a prefrontal subcircuit connecting with SFG and directing information to MFG. Inhibitory control thus involves a dynamic routing process during which the desired response is upregulated via theta-mediated effective connectivity within key nodes of the PFC and beta-mediated motor preparation. The present findings provide mechanistic insights into the biological implementation of a psychological phenomenon.
Footnotes
This work was supported by the National Institute of Mental Health–National Institutes of Health (Grants P50 MH079485, R01 MH61358, and R21 DA14111) and the Deutsche Forschungsgemeinschaft (Grant Ro805/17).
The authors declare no competing financial interests.
- Correspondence should be addressed to Tzvetan Popov, University of Konstanz, PF 905, 78464 Konstanz, Germany. tzvetan.popov{at}uni-konstanz.de