Abstract
The ability to stop motor responses depends critically on the right inferior frontal cortex (IFC) and also engages a midbrain region consistent with the subthalamic nucleus (STN). Here we used diffusion-weighted imaging (DWI) tractography to show that the IFC and the STN region are connected via a white matter tract, which could underlie a “hyperdirect” pathway for basal ganglia control. Using a novel method of “triangulation” analysis of tractography data, we also found that both the IFC and the STN region are connected with the presupplementary motor area (preSMA). We hypothesized that the preSMA could play a conflict detection/resolution role within a network between the preSMA, the IFC, and the STN region. A second experiment tested this idea with functional magnetic resonance imaging (fMRI) using a conditional stop-signal paradigm, enabling examination of behavioral and neural signatures of conflict-induced slowing. The preSMA, IFC, and STN region were significantly activated the greater the conflict-induced slowing. Activation corresponded strongly with spatial foci predicted by the DWI tract analysis, as well as with foci activated by complete response inhibition. The results illustrate how tractography can reveal connections that are verifiable with fMRI. The results also demonstrate a three-way functional–anatomical network in the right hemisphere that could either brake or completely stop responses.
- cognitive control
- inferior frontal cortex
- presupplementary motor area
- subthalamic nucleus
- tractography
- stop signal
Introduction
A fundamental question is how thoughts and actions are controlled. It is likely that the prefrontal cortex is the source of control, exerting its effect by biasing posterior cortical and subcortical structures (Fuster, 1997; Knight et al., 1999; Miller and Cohen, 2001; Miller and D'Esposito, 2005). Here we investigated whether specific frontal and subcortical regions, known to relate to cognitive control, could be directly connected via white matter tracts.
Our tract analysis focused on brain regions important for the rapid countermanding of initiated motor responses, as revealed by the stop-signal paradigm (Logan and Cowan, 1984). On Go trials, subjects respond as fast as they can to an imperative stimulus. On Stop trials, a stop signal is presented at some delay after the Go signal, and subjects try to inhibit their response. Functional magnetic resonance imaging (fMRI) has shown that going activates a premotor–striatal–pallidal–motorcortical circuit (Aron and Poldrack, 2006), whereas stopping activates prefrontal regions, especially the right inferior frontal cortex (IFC) (Rubia et al., 2003; Aron and Poldrack, 2006; Pliszka et al., 2006; Chevrier et al., 2007). These findings are supported by neuropsychological studies indicating that the right IFC is critical for stopping (Aron et al., 2003; Chambers et al., 2006). Neurophysiological, lesion, and imaging studies also implicate the right IFC in stopping in the related Go/NoGo paradigm (for review, see Aron et al., 2004; Buchsbaum et al., 2005; Garavan et al., 2006; Sakagami et al., 2006)
The right IFC, or some other frontal region, may block the execution of the Go response via the basal ganglia. Both patients and experimental animals with basal ganglia damage have trouble stopping (Eagle and Robbins, 2003; Rieger et al., 2003). In particular, a likely role for the subthalamic nucleus (STN) in response inhibition has been revealed by deep brain stimulation in human patients (van den Wildenberg et al., 2006), lesions in rodents (Eagle et al., 2004), local field potential changes (Kuhn et al., 2004), and high-resolution fMRI (Aron and Poldrack, 2006). Activation of the STN may lead to response suppression insofar as the STN projects to the globus pallidus (Mink, 1996), which subsequently may inhibit the thalamo-motorcortical program initiated by the Go response (Coxon et al., 2006).
This model of frontal control over the STN needs an account of the connecting circuitry. Here we used diffusion-weighted imaging (DWI) tractography in humans to examine whether the right IFC projects directly to the ipsilateral STN region, via a so-called “hyperdirect” pathway, as suggested by nonhuman tract tracing (Nambu et al., 1997). We also developed a novel “triangulation” method to examine whether these two regions project to others. We found that they did overlap at another (third) focus, in the presupplementary motor area (preSMA). Identifying a new node in this way predicts the locus of fMRI activation for a task that engages the network functionally. We hypothesized that the preSMA could play a conflict-detection or monitoring role within the overall network, and, if so, the locus of preSMA activation would correspond with the focus predicted by DWI analysis. To evaluate this, we report a second experiment with fMRI. We used a conditional stop-signal test, yielding behavioral and neural signatures of conflict-induced engagement of the posited network.
Materials and Methods
Experiment 1: DWI
Subjects.
Ten healthy young subjects participated (six males; age range, 20–28 years). All subjects gave informed written consent in accordance with ethical approval from the Oxford Research Ethics Committee.
DWI acquisition.
DWI was performed using a 1.5T Siemens (Erlangen, Germany) Sonata at a resolution of 2 × 2 × 2 mm, with 15 repeats of the b = 0 (no diffusion weighting image) and 3 repeats of each of 60 gradient directions at b = 1000 [repetition time (TR), 9 s; echo time (TE), 89 ms]. Diffusion data were corrected for eddy currents and head motion by using affine registration to the first b = 0 reference volume (Jenkinson et al., 2002). Data from the three acquisitions of each diffusion direction were averaged to improve the signal-to-noise ratio.
DWI analysis overview.
DWI analysis consisted of several steps for each individual subject: calculating probability distributions on fiber direction at each voxel, running probabilistic tractography to identify a possible tract between seed regions, and thresholding the result to remove noise. Then, we used a simple metric to determine whether sufficient evidence existed for the posited tract at the group level. We verified our results using the novel triangulation method.
DWI analysis in detail.
We calculated probability distributions on fiber direction at each voxel using previously described methods (Behrens et al., 2003). Then, we ran probabilistic tractography with a model allowing for crossing fibers, using seed points defined in standard [Montreal Neurological Institute (MNI)] space (Behrens, 2006). We used seed regions for the right IFC and putative STN [sphere of 3 mm radius centered on voxel 42, 26, 14 for the IFC and 10, −14, −4 for the STN region, based on a previous fMRI study of stopping (Aron and Poldrack, 2006)]. Although our previous study had used high-resolution fMRI to localize stopping activation to a midbrain hypointense region corresponding to the STN (Aron and Poldrack, 2006), we note that limitations in resolution mean that such fMRI studies cannot unequivocally locate activation in the STN itself [e.g., the substantia nigra and red nucleus are close nearby (Dormont et al., 2004)]. Thus, although circuitry considerations (Mink, 1996) as well as adjunct evidence (Eagle et al., 2004; Kuhn et al., 2004; van den Wildenberg et al., 2006) strongly suggest that the STN itself is recruited by response inhibition, we nevertheless refer hereafter to the “STN region.”
Probabilistic tractography proceeded at the single-subject level to identify a putative IFC–STN tract. For each subject, 5000 tract-following samples were initiated, resulting in a probabilistic map of connectivity. Tracts were only accepted if they passed through both seed areas. The resulting map for each subject was then thresholded at a low level to remove noise (this was 0.02% of the maximum value in the connectivity map, which was a value of 10, in arbitrary units, for IFC and STN regions).
A first analysis at the group level consisted in binarizing the thresholded maps, averaging them, thresholding the result at 0.5, and inspecting it to examine whether a continuous tract was present (i.e., if at least 5 of the 10 subjects had a continuous tract). However, this “maximum overlap” method is overly conservative because it can only show the maximum number of subjects who have a connecting tract between two regions that overlaps in the same point in standard space. Instead, we inspected the individual thresholded subject maps one by one to determine whether a continuous tract existed in each individual, and we then reported the number of subjects (of 10) who met this criterion. This “individual inspection” method is capable of showing the number of subjects who have a continuous tract between two regions, regardless of the standard space location of the trajectory.
We used a novel triangulation method, which proceeded at the single-subject level. We generated tract maps (with 5000 samples, as described above), seeded in the STN region with no target, and seeded in the IFC with no target. Each map was thresholded at 0.02% maximum value (as described above) and binarized. The maps were then multiplied, to show overlap between tracts, and the average overlap map was produced at the group level. This group-level map was thresholded at 0.5 (i.e., at least 5 of 10 subjects had overlap at a given point) and masked using a gray matter standard space mask to identify regions of overlap that were not (trivially) located in white matter.
Experiment 2: fMRI with the conditional stop-signal paradigm
Subjects.
Fifteen right-handed, healthy English-speaking subjects (10 males; mean age, 28.1 ± 4.1 SD) participated in the fMRI study. All subjects were free of neurological or psychiatric history and gave informed consent according to a University of California Los Angeles (UCLA) Institutional Review Board protocol. All 15 subjects had also performed the standard stop-signal paradigm within the same scanning session. We administered conditional and standard paradigms in an approximately counterbalanced order. Behavioral and imaging data for the standard paradigm were reported previously for 13 of these subjects (Aron and Poldrack, 2006).
The conditional stop-signal task.
The stop-signal paradigm consists of a Go task and a Stop task. On each trial, a left- or right-pointing arrow stimulus was displayed on a computer screen. For the Go task, the subject responded as fast as possible with a left or right key press (using index and middle fingers of the right hand). For the Stop task (25% of trials), the subject attempted to stop his/her response when a stop signal was sounded at a particular stop-signal delay (SSD) subsequent to the arrow stimulus, but only if the arrow was pointing in the critical direction: for 8/15 of the subjects, this was leftward pointing; for 7/15 of the subjects, this was rightward pointing (see procedure below). There were 32 Stop and 96 Go trials per scan (128 trials total). In every four trials, there was one Stop trial and three Go trials, and the number of leftward- and rightward-pointing arrows was equal. The SSD value for the Stop trial was sampled from one of the four staircases in turn (see below). Null events were interposed between every Stop or Go trial. The duration of null time ranged between 0.5 and 4 s (mean, 1 s; sampled from an exponential distribution truncated at 4 s). We used a custom Matlab code to select sequences of Go, Stop, and null events and to select the distribution of null time in a way that optimized the detection of hemodynamic responses for the critical contrast of Stop and Go events (cf. Liu et al., 2001).
A Go trial started with a white circular fixation ring (subtending 4.3 × 6.2° when viewed with scanner goggles) appearing in the center of a black background screen. This was followed after 500 ms by a white arrow within the fixation ring. The fixation ring and arrow remained on the screen for up to 1 s (limited hold), after which they disappeared, and the background screen was shown for the null period. When the subject responded within the limited hold window, the fixation ring and arrow disappeared, and the background was shown for the remainder of the limited hold and for the null period. If the subject pressed the wrong key in response to the arrow, the computer recorded it as an erroneous response. A Stop trial was identical to a Go trial, except that a tone (900 Hz; duration, 500 ms) was played at some SSD after the arrow stimulus. If the subjects inhibited their response, then the arrow and fixation ring remained onscreen for the duration of the limited hold. If the subject responded, then the arrow and fixation ring disappeared, and the background screen was displayed for the remainder of the time. The SSD changed dynamically throughout the experiment, depending on the subject's behavior with respect to the critical key. If the subject inhibited successfully on a Stop trial, then inhibition was made more difficult on a subsequent Stop trial by increasing the SSD by 50 ms; if the subject did not successfully inhibit, then inhibition was made easier by decreasing the SSD by 50 ms. Four step-up and step-down algorithms (staircases) were used in this way to ensure convergence to P(inhibit) of 50% by the end of the experiment. The four staircases started with SSD values of 100, 150, 200, and 250 ms, respectively. In each block/scan of trials, there were 96 Go trials and 32 Stop trials, with 16 of these Stop trials for the critical direction. Hence, each staircase moved four times within each block for these critical direction trials. For Stop trials with the noncritical direction, SSDs were yoked to the critical direction values. The staircases were independent but were randomly interleaved (i.e., each particular Stop trial belonged to one particular staircase, but the order of the staircases was random trial by trial).
Conditional stop-signal procedure.
Before scanning, the subjects were trained to perform Go and Stop trials. We made clear to the subjects that stopping and going were equally important and that it would not always be possible to stop. Within the scanner, subjects responded with the index and middle fingers of their right hands on an MR-compatible button box. For eight subjects, the instruction was to only inhibit an index finger response if the stop signal occurred. For the other seven subjects, the critical finger was the middle one. Stop tones were played through headphones at a level that was both sufficient to exceed scanner noise and comfortable to the subject. Each scan was preceded by an instruction screen and was followed by feedback in the form of median-correct reaction time (RT) and the number of discrimination errors on the Go trials.
Behavioral analysis.
Several measures were computed for each subject: mean Go RT for both critical and noncritical trials; percentage of Stop trials with inhibition (StopInhibit) for the critical direction; number of Stop trials with inhibition for the noncritical direction; and mean RT on Stop trials without inhibition (StopRespond) for both critical and noncritical directions. Our main comparison of interest was noncritical Stop trials versus noncritical Go trials, which indexes “conflict-induced slowing.” To better characterize the difference between these conditions, we also examined the SD of responses (within subjects) as well as the overall shape of the distributions (pooling observations for all subjects in each condition). In addition, we examined successful inhibition itself by applying the standard Race Model (Logan and Cowan, 1984) to compute the stop-signal reaction time (SSRT). Because the dynamic tracking of SSD (see above) was varied to yield ∼50% P(inhibit), we estimated SSRT by subtracting the average SSD from the mean correct Go RT (Logan and Cowan, 1984; Band et al., 2003). We computed the average SSD for each subject, using the values of the four staircases after the subject had converged on 50% P(inhibit). To give a stable SSD estimate, we averaged the values for the last six moves in each of the four staircases (see above). This proved to be highly reliable within subjects, as assessed by comparing the average of staircases 1 and 2 with staircases 3 and 4 (r = 0.81; p < 0.001).
fMRI data acquisition.
The experiment was run using a 3T Siemens Allegra MRI scanner at the UCLA Ahmanson-Lovelace Brain Mapping Center. Each scanning run acquired 166 functional T2*-weighted echoplanar images (EPIs) [4 mm slice thickness; 33 slices; TR, 2 s; TE, 30 ms; flip angle, 90°; matrix, 64 × 64; field of view (FOV), 200; in-plane resolution, 3.125 mm]. The first two volumes in each run were discarded to allow for T1 equilibrium effects. Additionally, for registration purposes, a T2-weighted matched-bandwidth high-resolution anatomical scan (same slice prescription as EPI) and magnetization-prepared rapid gradient echo (MP-RAGE) were acquired for each subject. The MP-RAGE acquisition parameters were as follows: TR, 2.3; TE, 2.1; FOV, 256; matrix, 192 × 192; sagittal plane; slice thickness, 1 mm; 160 slices. Stimulus presentation and the timing of all stimuli and response events were achieved using Matlab (Mathworks, Natick, MA) and the Psychtoolbox (www.psychtoolbox.org) on an Apple PowerBook running Mac OS 9.
fMRI preprocessing and registration.
Data were preprocessed using the Functional Magnetic Resonance Imaging of the Brain (FMRIB) Software Library (FSL) (www.fmrib.ox.ac.uk/fsl). The functional time series were realigned to compensate for small head movements (Jenkinson et al., 2002). Translational movement parameters never exceeded 1 voxel in any direction for any subject or scan. All images were denoised using MELODIC independent components analysis within the FSL (Poldrack et al., 2005). Data were spatially smoothed using a 5 mm full-width half-maximum Gaussian kernel. The data were filtered in the temporal domain using a nonlinear high-pass filter with a 66 s cutoff. We used a three-step registration procedure whereby we first registered the EPIs to the matched-bandwidth high-resolution scan, then to the MP-RAGE structural image, and finally into standard space by using 12-parameter affine transformation to the MNI152 template with the FSL FLIRT registration tool (Jenkinson and Smith, 2001).
fMRI modeling.
Three different models were fit, with slightly different regressors. These were as follows: (1) basic model: Go_critical, Go_noncritical, StopInhibit_critical, StopRespond_critical, StopRespond_noncritical, and a nuisance event consisting of Go trials in which subjects did not respond or made errors and/or of StopInhibit_noncritical trials; (2) parametric analysis: this consisted in the basic model plus two extra regressors, Go_noncritical_param and StopRespond_noncritical_param, which added RT for these trial types; and (3) verification analysis: this consisted of the basic model in which Go_noncritical and StopRespond_noncritical were split into three regressor types each: fast, medium, and slow according to a division of RT into three bins (with equal numbers of observations) for each subject.
fMRI statistical analysis.
For the basic model, for each subject, and for each of three scans, we computed two contrast images: StopInhibit_critical − Go_critical, and StopRespond_noncritical − Go_noncritical. For the parametric analysis, we computed the following contrast: StopRespond_noncritical_param − Go_noncritical_param. For the verification analysis, we computed the following: StopRespond_noncritical − null and Go_noncritical − null, all done separately for fast, medium, and slow RT conditions.
Analysis was performed using FEAT (FMRI Expert Analysis Tool) version 5.1, part of FSL. Higher-level analysis (one-sample t test) was performed using OLS (ordinary least squares simple mixed effects). Z (Gaussianized T/F) statistic images were thresholded using clusters determined by Z > 2.3 and a (corrected) cluster significance threshold of p = 0.05 [using Gaussian Random Field Theory (GRFT)].
A regression of SSRT against StopInhibit_critical − Go_critical activation was performed using a small-volume correction for anatomically defined regions of interest (ROIs) for the right IFC, preSMA, and the STN region. For this analysis, we corrected activation maps for multiple comparisons by using small-volume correction based on GRFT (Z > 2.3; corrected cluster threshold, p < 0.05).
ROIs were as follows: the right IFC consisted of combined pars opercularis and pars triangularis from the Automated Anatomical Labeling (AAL) template (Tzourio-Mazoyer et al., 2002); the right STN region consisted of a box of size of 10 × 10 × 10 mm, centered at MNI coordinates 10, −15, −5, which provides full STN coverage according to an atlas of subcortical structures (Lucerna et al., 2002); and the preSMA comprised the SMA (y > 0) region from the AAL template.
We performed a conjunction analysis (Nichols et al., 2005) for the parametric conflict-induced slowing effect and outright inhibition by taking the parametric contrast of StopRespond_noncritical − Go_noncritical (Z > 2.3; p < 0.05, corrected for multiple comparisons), binarizing it, and multiplying it by the StopInhibit − Go contrast (Z >2.3; p < 0.05, corrected for multiple comparisons; also binarized).
Results
Experiment 1: tractography analysis of a frontal–subcortical network
We discovered that a continuous tract between the right IFC and the right STN region was evident in 9 of 10 subjects after low-level thresholding to exclude noise (Fig. 1 A). Because there is currently no formal test to establish the statistical significance of a tractography pathway, we used a novel triangulation approach to increase confidence in our results. Given any two potential nodes in a network, this approach made predictions about other potential connected nodes. If a third node was found, then we repeated the triangulation analysis to validate the existence of the original nodes.
When we performed triangulation analysis with tracts originating in the IFC and in the STN region, there was only a single gray matter region where the tracts overlapped in >50% of subjects: This was a region of the superior frontal gyrus, consistent with the anterior preSMA (Fig. 1 B, top row). This indicates a putative network involving the preSMA, the IFC, and the STN region in the right hemisphere. To confirm this, we examined whether tracts existed between the preSMA and the STN region and between the preSMA and the IFC. We found a continuous tract between the preSMA and the STN region in 7 of 10 subjects and a continuous tract between the preSMA and the IFC in 8 of 10 subjects.
Next, we used the triangulation method to examine whether the STN region and the preSMA could predict the IFC region and whether the IFC and the preSMA could predict the STN region. When pathways seeded in the STN region and the preSMA were overlapped, there were two areas outside the seeds that survived thresholding: one of these was the IFC (Fig. 1 B, middle row), and the other was in the anterior prefrontal cortex (data not shown). When pathways seeded in the IFC and the preSMA were overlapped, the only gray matter region outside the seed areas to survive in 50% of subjects was one that included the STN region and the centromedial nucleus of the thalamus (Fig. 1 B, bottom row).
These results clearly establish a network in the right hemisphere including the preSMA, IFC, and STN region. This is unlikely an artifact of choice of IFC and putative STN seed regions because, in each case, independently choosing two seed regions showed overlap almost uniquely in the vicinity of the third. This DWI analysis also predicted the potential functional importance of the preSMA region, something we examined in experiment 2 using fMRI.
Experiment 2: functional imaging of conflict-induced slowing
The tractography analysis in experiment 1 established that the right IFC and STN are connected and, moreover, that they are both connected to the preSMA. Our finding of connectivity with the preSMA is striking because a recent study reported that patients with damage to the right superior medial frontal lobe had deficits in stop-signal response inhibition (Floden and Stuss, 2006). It is also noteworthy that microstimulation studies in human subjects have identified a “negative” motor region in the anterior SMA (preSMA) where stimulation produces hesitation, arrest, or control of manual or vocal responses (Penfield and Welch, 1951; Luders et al., 1988; Fried et al., 1991; Stuphorn and Schall, 2006; Isoda and Hikosaka, 2007).
Although the right IFC could play a role in implementing the inhibition at the neural level (Aron, 2007), via inputs to the STN, the role of the preSMA remains unclear. One possibility is that the preSMA detects/monitors/resolves conflict and then either generates control or else interacts with the control functions of the lateral prefrontal cortex, consistent with much research (Bush et al., 2000; Gehring and Knight, 2000; Schall et al., 2002; Botvinick et al., 2004; Ridderinkhof et al., 2004; Rushworth et al., 2004; Yeung et al., 2004; Isoda and Hikosaka, 2007). The hypothesis that a preSMA-based conflict function may interact with a response control system was also motivated by computational modeling work, which suggests that the same putative STN mechanism that supports outright response inhibition could be recruited to slow responses in high-conflict situations (Frank, 2006).
Although we did find robust preSMA activation in our previous fMRI study of standard stopping (Aron and Poldrack, 2006), it was not possible to test the conflict hypothesis because there was no behavioral index of conflict detection/monitoring/resolution. Therefore, to test the hypothesis that the preSMA plays a conflict role in a network including the IFC and the STN, we used a conditional version of the stop-signal paradigm (Logan, 1994; De Jong et al., 1995). For each subject, one finger was designated as “critical,” and another was designated as “noncritical” (Fig. 2). The instruction was to inhibit the response only if the stop signal occurred and the response involved the critical finger. If the response involved the noncritical finger and the stop signal occurred, then the subject was required to respond anyway. In this design, a noncritical Stop trial involves conflict between a salient stop signal and a rule that says to ignore this stop signal for this particular stimulus context.
Based on previous results (De Jong et al., 1995), we expected that responses on noncritical trials would not be canceled by the stop signal but that they would, nevertheless, be significantly slowed (relative to noncritical trials without a stop signal), something we refer to as the “conflict-induced slowing” effect. Furthermore, we hypothesized that if the slowing effect was generated by conflict detection/monitoring/resolution, then there would be significant activation of the preSMA region. Moreover, if the preSMA region performed a functional role within the circuit between the preSMA, the IFC, and the STN region, then there would also be activation for the IFC and the STN region, and critically, activation at these three foci would correspond to the locations predicted by the DWI tractography analysis.
Behavior
Behavioral data are reported in Table 1. There was a significant conflict-induced slowing effect of 41 ms, on average (StopRespond_noncritical, 416 ± 51 ms; Go_noncritical, 375 ± 44 ms; t (14) = 6.19; p < 0.001). Although the presence of stop signals on noncritical trials slowed responses, it almost never canceled them; on average, there was less than one such trial for each subject. To better characterize the difference between StopRespond and Go on noncritical trials, we examined the SD of responses as well as the shape of the underlying distributions. StopRespond RT was significantly more variable than Go RT (t (14) = 6.77; p < 0.001). This might suggest that subjects sometimes completely stopped their responses on such StopRespond trials, only to restart them again. This would predict that the shape of the underlying distributions would be different, with a longer tail for StopRespond than Go. However, this was not borne out when examining the shape of the underlying distributions, which was very similar (supplemental Fig. 1, available at www.jneurosci.org as supplemental material).
With respect to critical trials, subjects responded quickly (465 ± 85 ms) and accurately (making only 2.7 discrimination errors, on average). The dynamic adjustment of SSD converged on a probability of inhibition that was close to 50% (actual value, 54 ± 11.5%). Using the Race Model, SSRT was estimated at 266 ± 53 ms.
Functional imaging
Unless stated otherwise, all contrasts reported here were corrected for multiple comparisons at the whole-brain level by using the cluster-based GRFT (see Materials and Methods).
Conflict-induced slowing
To examine conflict-induced slowing, we contrasted noncritical trials with a stop signal (StopRespond) with noncritical trials that had no stop signal (Go). We found significant activation in the bilateral auditory cortex (the stop tone was presented binaurally), the right preSMA, IFC, the STN region, and in other regions (supplemental Table 1, available at www.jneurosci.org as supplemental material) (Fig. 3 A). If activation of the preSMA, the IFC, and the STN region is specifically related to conflict-induced slowing, activation of these regions may change in proportion to the degree of RT slowing. To address this, our parametric analysis compared whether activation on noncritical trials with a stop signal correlated with RT to a greater degree than activation on noncritical trials without a stop signal (see Materials and Methods). We noted a significantly greater correlation with RT for noncritical Stop versus Go trials in a mainly right-lateralized network that included the preSMA, the IFC, and the STN region (Fig. 3 B). A confirmatory analysis examined mean activation of these regions, anatomically defined (see Materials and Methods), for noncritical trials with and without stop signals across three quantiles of RT (fast, medium, and slow). Activation on stop signal trials increased significantly more with increasing RT than for trials without stop signals (interaction between trial type and RT level: F (2,28) = 11.1; p < 0.0001) (Fig. 3 C). We examined the spatial correspondence between activation of the preSMA, the IFC, and the STN region with the foci predicted by the DWI tractography by overlaying the tracts on the parametric image (Fig. 3 B). There was a strong correspondence in location (see Table 2 for coordinates). This validates the identification of new tracts between the preSMA and the IFC, the preSMA and the STN region, and the IFC and the STN region because the predictions about the nodes making up the network were verified by the functional activation pattern.
These results are consistent with a role for the preSMA in conflict monitoring/detection/resolution, as well as with the hypothesis that the STN could be recruited to slow responses, pending the resolution of conflict (Frank, 2006). Both the STN region and the IFC were significantly activated, moreover, in proportion to the degree of response slowing. This result suggests that the same system that outright inhibits responses could also be recruited to temporarily withhold them (i.e., to put on the brakes). If so, IFC activation and activation of the STN region should be evident for outright stopping, the locus of activation should correspond with activation for the IFC and the STN region with conflict-induced slowing, and the degree of activation at these foci should predict how fast subjects can stop themselves. We evaluated these predictions by examining outright inhibition for critical trials.
Comparing outright stopping with conflict-induced slowing
We examined the neural correlates of successful (outright) response inhibition for critical direction trials. We contrasted successfully stopped trials (StopInhibit_critical) with Go trials (Go_critical). This contrast controlled for the initiation of a motor response (which occurs in both cases) and looks at the effect of the stop process (triggered some 200 ms later, on average) (cf. Aron and Poldrack, 2006). There was significant activation of the bilateral auditory cortex (the stop tone was presented binaurally), the right STN region, the bilateral caudate, the right preSMA, the right parietal cortex, the right globus pallidus, the right middle frontal gyrus, and the right IFC extending from the insula into the pars opercularis/triangularis region (Brodmann's area 44/45) (see Fig. 4 A,B and supplemental Table 2, available at www.jneurosci.org as supplemental material). Although the STN is a small region, the locus of activation was very similar to standard stopping in our previous study using identical MRI parameters and the same scanner (Aron and Poldrack, 2006) (see supplemental Fig. 2, available at www.jneurosci.org as supplemental material, for a conjunction analysis). Moreover, in that previous study, we verified activation of the STN region using the standard stopping paradigm and a high-resolution imaging protocol.
We performed a conjunction analysis to examine whether the IFC, the STN, and the preSMA regions that were activated by outright response inhibition all correspond to those activated by conflict-induced slowing. The conjunction map reflected regions that were significantly activated for (parametric) conflict-induced slowing and for outright inhibition (see Materials and Methods). The preSMA, IFC, and STN regions were present in the conjunction image, and the location of activation corresponded strongly with the DWI tractography results (Fig. 3 D, Table 2).
To relate behavioral stopping to activation, SSRT was regressed against mean activation on critical trials (StopInhibit − Go) across subjects, separately for anatomically defined right IFC, right STN, and right preSMA regions (using a small-volume correction for multiple comparisons). There was no significant relationship between the preSMA activation and SSRT. However, at very similar foci to those we reported for standard stopping (Aron and Poldrack, 2006), those subjects who inhibited more quickly activated the IFC (MNI: 44, 22, 12; Z = 3.84; p = 0.0080, small-volume correction) and the STN region (MNI: 14, −8, −4; Z = 3.02; p = 0.049, small-volume correction) significantly more (Fig. 4 C). Also, mean activation in the IFC and the STN region was significantly correlated (r = 0.53; p = 0.04). Overall, these results confirm that functional activation at the right IFC and the STN region relates to the behavioral index of stopping.
Discussion
The results provide converging tractography and functional imaging evidence for a frontal–subcortical network for response control. DWI tractography demonstrated a three-way white matter network between the preSMA, the IFC, and the STN region in the right hemisphere. The conditional-stopping paradigm taxed this network functionally. The presentation of a stop signal on noncritical trials led to a slowing of RT (relative to trials with no such signal) and to a concomitant increase in activation of right-lateralized preSMA, the IFC, and the STN region. There was a strong correspondence between activation of this functional network and the nodes predicted by triangulation analysis of tractography data in different subjects. Moreover, this functional network overlapped heavily with that recruited by outright stopping. This suggests that conflict-induced slowing may operate by temporarily suppressing the motor system.
Our tract analysis was motivated by previous nonhuman primate tracing that had demonstrated a connection between the ventral premotor cortex and the STN (Nambu et al., 1997). Although we find evidence for a connection between the IFC and the STN region, this does not necessarily imply that the IFC (as implicated here) should be considered part of the human premotor cortex. As the primate tracing data show, there are multiple connections between the STN and ipsilateral sectors of the frontal cortex (Nambu et al., 2002).
By seeding the IFC and the STN-originating tracts in the triangulation analysis, we revealed a unique gray matter overlap in the preSMA (Fig. 1). This allowed us to assess putative tracts between the IFC and the preSMA and between the preSMA and the STN region. It also allowed us to repeat our triangulation analysis for each pair of regions. The IFC and the preSMA-originating tracts overlapped in the STN region as well as in the centromedial nucleus of thalamus. Overlap in the thalamus is expected (because the thalamus projects to multiple frontal regions), but this probably represents a different focus from the STN region. We note that the locus of the STN predicted by triangulation analysis corresponded very well with activation of the STN region by stopping (see also Aron and Poldrack, 2006) and that engagement of the STN by stopping is predicted by circuitry considerations (Mink, 1996) as well as adjunct evidence from other stop-signal and NoGo studies (Eagle et al., 2004; Kuhn et al., 2004; van den Wildenberg et al., 2006).
Tracts originating in the STN region and the preSMA also overlapped in the IFC as well as in an anterior prefrontal cortex focus. The finding of a tract between the preSMA and the STN region is supported by nonhuman tract tracing (Inase et al., 1999), whereas the finding of a preSMA–IFC tract is consistent with a previous DWI study in humans (Johansen-Berg et al., 2004). Importantly, the DWI triangulation analysis revealed a network that predicted the loci of activation in the fMRI experiment.
The conditional-stopping paradigm required subjects to hold in mind a conditional rule concerning what to do when a stop signal sounded depending whether they had initiated critical or noncritical responses (Fig. 2). As expected, although the stop signal was irrelevant on noncritical trials, subjects were significantly slowed by it (Table 1). One possible explanation is that the subject could always outright inhibit the response and then restart it if needed. However, this appears unlikely because the conflict-induced slowing effect was only 41 ms, on average, which is too quick for a response to be reinitiated. Another possibility is that responses are outright inhibited on just a few trials (producing the activation profile of response inhibition) and reinitiated on a few trials, which would produce the observed small average increase in StopRespond RT. If so, the shape of the distributions of noncritical StopRespond and Go trials would be different, which was not the case. Instead, the data are more consistent with a “braking” account. Conflict-induced slowing may be induced on noncritical Stop trials because of conflict between a salient stop signal and a rule that says to ignore this stop signal for this particular stimulus context. This could induce some degree of inhibition until the conflict is resolved. Presumably, conflict resolution requires retrieving the relevant conditional rule, which takes time. We found that the degree of such slowing predicted the degree of activation in a mainly right-hemisphere network including the preSMA, the IFC, and the STN region, even when controlling for a simple relationship between activation and RT (Fig. 3). This result suggests that the neural braking system is held on for longer, producing greater metabolic activity, when the conflict effect is greatest.
There was a striking overlap of activation in the preSMA, the IFC, and the STN region for conflict-induced slowing and outright response inhibition, verified by conjunction analysis (Fig. 3 D). This suggests that the braking system for conflict could recruit the same system by which responses are suppressed outright. Although the data reveal an underlying anatomic network between these three regions, they cannot speak to the necessity, causality, or directionality of the functional connections between regions. The functional projection between the preSMA and the STN region could be direct (via a putative preSMA–STN tract) or indirect (via a putative preSMA–IFC–STN route). But if conflict-induced slowing is generated via a preSMA–STN route, then it is unclear why the right IFC should be activated. It is unlikely that the role of the right IFC is merely to implement stop-signal processing or attentional functions. A recent study found robust right IFC activation for NoGo response inhibition even when controlling for oddball frequency (Chikazoe et al., 2007) (see also Heinen et al., 2006), and neurophysiological recording studies in the lateral prefrontal cortex have demonstrated neural responses specific to the meaning of the NoGo cue, rather than the stimulus properties (Sakagami et al., 2001). Based on diverse data, we have speculated that the role of the right IFC in the stop-signal paradigm is one of implementing an active mechanism of neural inhibition (Aron et al., 2004; Aron, 2007). The current finding of a direct tract between the right IFC and the STN region supports this account, as does the finding that those subjects who inhibited more quickly activated the right IFC and STN regions significantly more (Fig. 4). The loci showing the correlation between stopping activation and SSRT were almost identical to those found in our previous study with standard stopping (Aron and Poldrack, 2006), and they corresponded well with the nodes predicted by triangulation analysis (Fig. 3 B).
Although such a relationship between SSRT and activation was not evident in the preSMA (at a corrected threshold), neural inhibition could still depend, in some way, on the preSMA. A recent study found that SSRT was significantly longer in nine patients with right dorsomedial frontal damage relative to controls (Floden and Stuss, 2006). Microstimulation studies in human subjects have identified a “negative” motor region in the anterior SMA (preSMA) where stimulation produces hesitation or arrest of manual or vocal responses (Penfield and Welch, 1951; Luders et al., 1988; Fried et al., 1991). Furthermore, microstimulation of the superior medial frontal cortex (including specifically the preSMA) in nonhuman primates has been shown to exert executive control over saccades, moreover at a time scale consistent with a causal role in behavioral control (Stuphorn and Schall, 2006; Isoda and Hikosaka, 2007). The preSMA could thus implement an active mechanism of neural inhibition (perhaps by inputting directly to the STN) or it could play some other role such as monitoring or control recruiting, consistent with an extensive literature on the dorsomedial frontal cortex (Bush et al., 2000; Gehring and Knight, 2000; Schall et al., 2002; Botvinick et al., 2004; Ridderinkhof et al., 2004; Rushworth et al., 2004; Yeung et al., 2004). Future research may elucidate the precise functional role of the preSMA within the wider network of preSMA, IFC, and STN regions apparently underlying behavioral stopping.
The current results accord with computational simulations of basal ganglia circuitry (Frank, 2006). These model how cortical conflict signals could generate a temporary delay of motor output via activation of the STN. Simulated STN lesions led to premature responding and impaired high-conflict decisions. This is consistent with observations that lesions of the STN in rodents produce premature responding (Baunez et al., 2001; Desbonnet et al., 2004) and that deep-brain stimulation of the STN can impair cognitive control (Hershey et al., 2004). A recent study shows that stimulating the STN interferes with patients' ability to modulate their RTs by the degree of response conflict, thereby causing impulsive behavior (M. J. Frank, J. Samanta, and S. J. Sherman, unpublished observation).
We thus find that the preSMA, the IFC, and the STN region are key nodes in a network that is apparently recruited across a range of motor control, from outright inhibition to conflict-induced slowing. The results are relevant for disorders such as stuttering and Parkinson's disease, which are associated with pathology of this same network (Brown et al., 2005; Burghaus et al., 2006; Grafton et al., 2006). The results also illuminate everyday phenomena such as “motoric freezing” or hesitation when one's stride, swing, or conversation is interrupted by conflict or surprise. These experiences could arise from a temporary braking of motor output by the same means as responses are completely stopped. Future studies could examine whether this braking system could explain other cognitive psychological phenomena such as Stroop and interference effects. The study also has methodological import. Currently, it is difficult to interpret results from diffusion tractography studies that posit the existence of a previously unknown connection. We propose a method by which tractography can form specific hypotheses which are testable in an independent experiment with fMRI.
Footnotes
-
This work was supported by a 21st Century Science Award from the James S. McDonnell Foundation (R.A.P.) and by the United Kingdom Medical Research Council (T.E.B.). We thank David Flitney for three-dimensional rendering, Joe Devlin for helpful comments on this manuscript, and Bill Lanouette for editing assistance.
- Correspondence should be addressed to Dr. A. R. Aron, Department of Psychology, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093. adamaron{at}ucsd.edu