Abstract
Standard fear extinction relies on the ventromedial prefrontal cortex (vmPFC) to form a new memory given the omission of threat. Using fMRI in humans, we investigated whether replacing threat with novel neutral outcomes (instead of just omitting threat) facilitates extinction by engaging the vmPFC more effectively than standard extinction. Computational modeling of associability (indexing surprise strength and dynamically modulating learning rates) characterized skin conductance responses and vmPFC activity during novelty-facilitated but not standard extinction. Subjects who showed faster within-session updating of associability during novelty-facilitated extinction also expressed better extinction retention the next day, as expressed through skin conductance responses. Finally, separable patterns of connectivity between the amygdala and ventral versus dorsal mPFC characterized retrieval of novelty-facilitated versus standard extinction memories, respectively. These results indicate that replacing threat with novel outcomes stimulates vmPFC involvement on extinction trials, leading to a more durable long-term extinction memory.
SIGNIFICANCE STATEMENT Psychiatric disorders characterized be excessive fear are a major public health concern. Popular clinical treatments, such as exposure therapy, are informed by principles of Pavlovian extinction. Thus, there is motivation to optimize extinction strategies in the laboratory so as to ultimately develop more effective clinical treatments. Here, we used functional neuroimaging in humans and found that replacing (rather than just omitting) expected aversive events with novel and neutral outcomes engages the ventromedial prefrontal cortex during extinction learning. Enhanced extinction also diminished activity in threat-related networks (e.g., the insula, thalamus) during immediate extinction and a 24 h extinction retention test. This is new evidence for how behavioral protocols designed to enhance extinction affects neurocircuitry underlying the learning and retention of extinction memories.
Introduction
Animals are exceptionally good at learning and retaining associations between environmental cues and threatening events. However, it is difficult to change these associations if threat cues are later experienced as safe. This imbalance between expressions of threat and safety is captured by Pavlovian threat (fear) conditioning, wherein conditioned defensive behaviors reemerge following extinction (Bouton, 2002). As the principles of extinction form the basis for exposure therapy (Foa and Kozak, 1986; Milad and Quirk, 2012; Vervliet et al., 2013; Ball et al., 2017), there is motivation to develop behavioral strategies that more effectively prevent the relapse of maladaptive behavior (Craske et al., 2008, 2014; Dunsmoor et al., 2015a). But it remains unclear how behaviorally enhanced extinction strategies affect neural processes underlying the learning and retention of extinction memories.
In a set of cross-species behavioral threat conditioning experiments (Dunsmoor et al., 2015c), we showed that replacing an expected aversive outcome (electric shock) with a novel, surprising, and neutral outcome (a tone) on extinction trials enhanced the long-term effects of extinction training. That is, compared with merely omitting shocks, replacing shocks with a neutral outcome (a procedure we referred to as novelty-facilitated extinction [NFE]) improved 24 h retention of extinction, evidence by diminished conditioned skin conductance responses (SCRs) in humans and freezing in rats to the conditioned stimulus (CS) (see also Lucas et al., 2018). One potential mechanism by which replacing threat with novel outcomes improved extinction retention concerns the general role played by surprise in associative learning (Rescorla and Wagner, 1972). Most associative learning models describe extinction as new learning generated by the surprising absence of an expected unconditioned stimulus (US) (Pearce and Hall, 1980; Wagner, 1981; Larrauri and Schmajuk, 2008). In popular computational learning models (Pearce and Hall, 1980), surprise generated by the omission of the US governs the rate and effectiveness of extinction by modulating a property of the CS referred to as “associability.” A core feature of associability is that the ability for the CS to enter into a new association is dynamically determined by the unsigned (absolute value) prediction error on the previous trial. In this way, an associability model would predict that maximizing surprise on extinction trials could accelerate and strengthen a secondary (nonthreat) association.
The generation of a secondary competing association during extinction is linked to interactions between the amygdala and ventromedial prefrontal cortex (vmPFC), a region necessary for the formation of extinction memories (Myers and Davis, 2002; Quirk and Mueller, 2008; Hartley and Phelps, 2010; Giustino and Maren, 2015). Electrical (Milad et al., 2004) and optogenetic (Do-Monte et al., 2015) stimulation of the vmPFC on extinction trials enhances learning and retention of extinction memories in rodents. Enhancing vmPFC activity immediately after extinction via dopamine administration also improves extinction retention (Haaker et al., 2013). It is possible that increasing associability to the CS is one way to (behaviorally) stimulate engagement of the vmPFC to accelerate new associative learning. Because the strength of associative memory retention is proportional to the effectiveness of learning (Miller and Laborda, 2011; Laborda and Miller, 2012), stimulating involvement of extinction-related brain regions should then lead to the formation of a more durable extinction memory to combat expression of the original threat memory at a future test (Bouton, 2002). The results of enhanced extinction might then be revealed at test via strengthened amygdala-vmPFC functional connectivity involved in balancing the expression of high or low fear states (Quirk et al., 2003; Likhtik et al., 2005; Krabbe et al., 2018).
We examined neural mechanisms by which enhanced extinction strengthens extinction learning and diminishes the return of threat. We predicted that computational modeling would characterize enhanced extinction as stimulating involvement of the vmPFC during new learning. Further, we predicted corresponding diminution of activity in canonical threat appraisal and expression regions (e.g., the insula, thalamus, and dorsal anterior cingulate cortex [dACC]) during extinction learning and retention test, and stronger connectivity between the amygdala and vmPFC at test.
Materials and Methods
Participants.
Forty-eight right-handed participants who were self-reportedly free of neurological and psychiatric disorders were scanned for this experiment. Data from 2 subjects were unusable because they did not return for the second day. The remaining 46 participants (29 female; mean age 23.43 years; SD = 4.76 years; age range 18–35 years) were randomly assigned to the standard extinction group (EXT; N = 23; 14 females) or NFE group (N = 23; 13 females). All subjects provided written informed consent approved by the University Committee on Activities Involving Human Subjects at New York University (Institutional Review Board #20162).
Task and procedure.
This was a between-subjects design that included Pavlovian threat conditioning, immediate extinction, and a 24 h test, based on the human behavioral experiment described by Dunsmoor et al. (2015c) (see Fig. 1A). The first day included 4 functional imaging runs of equal length: threat conditioning (2 runs) and extinction (2 runs); the second day included retention test. Before scanning on the first day, the electric shock was attached to the right wrist and calibrated to be at level deemed “highly annoying but not painful” (e.g., Dunsmoor et al., 2016). The CSs were two angry faces (Ekman and Friesen, 1976) with a jittered stimulus duration of 6.5 ± 0.5 s, followed by a jittered waiting period with a fixation cross on a blank screen for 10.5 ± 1.0 s. Subjects were not instructed about the CS-US relationship at any point and had to learn this association for themselves.
Threat conditioning included a total of 18 CS+ trials unpaired with the shock, 12 CS+ trials paired with the shock (reinforcement rate of 40%), and 18 CS− trials. The first two trials (1 CS+ and 1 CS−) were considered habituation trials and were not included in the analysis. The third trial was always a CS+ trial paired with shock. A 200 ms shock coterminated with the CS+. Shocks were delivered to the right wrist using pregelled MRI-compatible electrodes connected to a stimulator (Grass Medical Instruments). Trials with the shock were removed from analysis in keeping with past protocols (e.g., Dunsmoor et al., 2016). Extinction included a total of 24 CS+ and 24 CS− trials all unpaired with shock. During EXT, the shock was simply omitted. During NFE, the shock was omitted and each CS+ trial instead coterminated with a low-volume 500 ms 440 Hz tone. The next day, shock electrodes were reattached, and subjects underwent a test of extinction–retention (12 CS+ and 13 CS− trials) in the absence of the shock or the tone. The first trial during the 24 h test was always a CS− to account for initial orienting responses and was not included in analysis (compare Schiller et al., 2013; Kroes et al., 2017). The second and third trial were counterbalanced between subjects as either a CS+ then a CS−, or a CS− then a CS+.
Psychophysiology.
Autonomic arousal was measured throughout the scanning session by SCRs collected from pregelled MRI-compatible electrodes connected to the MP100 (BIOPAC Systems). Electrodes were attached to the hypothenar eminence of the left palm, and SCRs were calculated according to our previous criteria (Dunsmoor et al., 2015b). In brief, an SCR was considered related to stimulus presentation if the trough-to-peak deflection occurred within a time window that extended from 0.5 s following CS onset to the CS offset (jittered 6.5 ± 0.5 s), lasted between 0.5 and 5.0 s, and was >0.02 μS. If an SCR did not meet these criteria, then the trial was scored as 0. Responses were obtained using the MATLAB (The MathWorks) script Autonomate that extracted SCRs for each trial using these criteria (Green et al., 2014).
Subjects were not excluded from any analysis based on SCR results. That is, some prior studies have come under a bit of scrutiny for excluding subjects based on physiological performance (e.g., “poor learners” who do not reach a criterion of differential SCRs on CS+ vs CS− trials during some phase of the experiment). To circumvent these issues, we did not exclude subjects based on SCR performance. This necessarily presents a trade-off whereby subjects who evinced weak SCRs (for whatever reason) are included along with subjects who evinced robust SCRs, but the alternative of selectively removing subjects presents its own drawbacks that potentially outweigh including all subjects into the analysis (for a thorough description of this issue, see Lonsdorf et al., 2017).
Computational modeling.
In keeping with our prior computational model fitting approach (Raio et al., 2017), we fit and validated an associability model using individual subject SCRs. In the model, xn indicates the CS on trial n (CS+ or CS−) and rn is the US (1 for US, 0 for no US). Value (i.e., shock) predictions Vn(xn) were defined for each stimulus type (xn) and trial. The prediction error δn = rn − Vn(xn) measures the difference between the expected and predicted shock on trial n. We replaced the constant learning rate from the Rescorla–Wagner model with a dynamic learning rate that gates the speed of learning based on a Pearce–Hall associability rule (Sutton, 1992; Le Pelley, 2004; Li et al., 2011; Raio et al., 2017). The resulting model for threat conditioning was as follows:
where ηc indicates the weight assigned to the most recent absolute value of prediction error (indicating the accuracy of value prediction) in the conditioning phase and κ indicates a normalization factor. We reasoned that η might be modulated by replacing the aversive US with a novel neutral stimulus. We thus postulate that a new η might govern the updating of the rate of extinction learning (ηE) as follows:
We tested the fit of this hybrid model by minimizing the difference between model-predicted associability and the skin conductance responses in both sessions. We optimized the free parameters of the model ηC, ηE and k) by maximizing the posterior probability of observing the measured sequence of SCRs following each CS. This maximization was achieved via the maximum a posteriori method that has been widely used in individual model fitting to avoid extreme parameter estimation.
The initial expected value V0 and associability α0 were set to 0.5, and prediction error weights (i.e., ηC, ηE) were constrained to the range (0, 1) with a β (1.2, 1.2) prior distribution slightly favoring values in the middle of the range; normalization factor κ was constrained to be positive values with a γ (1.2, 1) prior distribution. Log posterior probability was calculated by the summation of the observed data log-likelihoods and parameter log-priors for each subject. Further additional details on this model fitting procedure were reported previously (Li et al., 2011; Raio et al., 2017).
Imaging parameters and preprocessing.
Whole-brain functional imaging was collected on a 3T Siemens Allegra head-only scanner at the Center for Brain Imaging at New York University. Preprocessing was conducted using SPM8 (Wellcome Trust Centre; www.fil.ion.ucl.ac.uk) implemented in MATLAB. Images were corrected for head motion using a 3 mm movement cutoff in any dimension. No subjects were removed for excessive head motion. Functional images were coregistered to each participant's high-resolution T1-weighted structural scan, spatially normalized into MNI space, voxel size resampled to 2 × 2 × 2 mm, and smoothed using an isotropic 8 mm3 Gaussian FWHM kernel. The first four volumes of each functional run were discarded for T1 equilibrium.
Imaging analysis.
Statistical analysis of preprocessed data was conducted using the GLM in SPM8. First-level (individual subject) analysis included covariates for the onset and offset of each CS modeled using an impulse function (0 s duration). Regressors for trial events were temporally convolved using a canonical hemodynamic response function. Covariates of no interest (i.e., nuisance regressors) included in the GLM were the CS+ trials paired with shock and the shock itself, tones (for the NFE), 6 head-motion parameters, and estimates of signal intensity in white matter and CSF. A high-pass 128 s filter was applied to account for low-frequency drifts.
Second-level (group) analysis in SPM8 included a full factorial model with CS type (CS+, CS−) as the within-subjects factor and group (EXT, NFE) as the between-subjects factor. Statistical thresholds for whole-brain analyses were set at p < 0.001 with an extent threshold of 65 voxels, calculated using 10,000 Monte Carlo simulations in AlphaSim (Cox et al., 2017). Threshold for small-volume correction in the amygdala was set at FEW-corrected p < 0.05 using bilateral anatomical masks from the PickAtlas toolbox (Maldjian et al., 2003). Anatomical labels used in the tables for regions of activation were provided by Talairach Client (www.Talairach.org) (Lancaster et al., 2000) based on peak coordinates in MNI space, converted to Talairach space using GingerALE 2.3.6 (www.brainmap.org) (Laird et al., 2010). Any statistical test conducted outside SPM8 was from extracted mean β parameters and analyzed in SPSS 25 (IBM). A priori ROIs included those with a purported role in threat learning and expression (Fullana et al., 2016) and threat inhibition (Fullana et al., 2018b) as shown in extant fMRI research on Pavlovian conditioning and extinction; this included the insula, dACC, thalamus extending into midbrain, vmPFC, and lateral PFC. The amygdala was an a priori ROI as well, but it is not commonly observed in human fear conditioning fMRI (for meta-analysis, see Fullana et al., 2016), and did not emerge from the threat conditioning analysis, consistent with numerous failures to see amygdala activity in human fear conditioning fMRI (Fullana et al., 2018a).
Parametric modulation analysis of associability and prediction errors.
To estimate regions tracking trial-by-trial changes in associability and prediction errors during the course of extinction learning, we used values derived from the Hybrid model (Li et al., 2011; Raio et al., 2017) as parametric regressors modulating CS onset (associability) and CS offset (prediction errors). For the first-level analysis, we concatenated all four runs from day 1 (2 runs of fear conditioning and 2 runs of extinction) and included session-specific constants (see also Boll et al., 2013) along with regressors for the shock, the 6 head-motion regressor, and estimates of signal intensity in white matter and CSF. To separately analyze associability and prediction error-related activity specific to extinction, we generated separate regressors for CS trials from threat conditioning and extinction. Regressors were convolved with the canonical hemodynamic response within the GLM at the first level and carried forward to the second level for whole-brain group level analysis using a threshold of p < 0.001, cluster-corrected for multiple comparisons. Contrast estimates of associability and prediction error-related activity during extinction were derived using a one-sample t tests for each group separately, and a two-sample t test to find differences in activity between the groups.
Psychophysiological interaction (PPI).
A PPI (Friston et al., 1997) was conducted using SPM8 to examine patterns of task-related functional connectivity during the 24 h retention test (day 2). The representative time course was extracted from source ROIs (described in Results) using a 4 mm sphere centered on the peak voxel within the ROI from the group-level analysis. The interaction term between the time series and the psychological context (PPI) was included in the GLM, using the CS+ versus CS− as the trial regressor and including the same nuisance regressors as described above at the single-subject level. Whole-brain results were identified at p < 0.001 cluster-corrected for multiple comparisons, and a small-volume correction of FWE < 0.05 for the amygdala.
Results
Computational modeling dissociates standard and NFE
Conditioned SCRs (Fig. 1B) replicated prior findings that the NFE procedure accelerates extinction and diminishes conditioned SCRs at a 24 h extinction retention test (Dunsmoor et al., 2015c; Lucas et al., 2018). Given that the focus of this report is on the computational modeling and imaging results, we limit discussion of the SCR results. In short, both groups showed heightened SCRs to the CS+ versus the CS− during threat conditioning (day 1), indicating successful acquisition of conditioned threat. Repeated-measures ANOVA using CS type (CS+, CS−) as a within-subjects factor and group (EXT, NFE) as a between-subjects factor showed a main effect of CS type (F(1,44) = 35.243, p < 0.001, η2 = 0.445), but not group (p = 0.192) and no CS type × group interaction (p = 0.363). Notably, while there was no difference in conditioning between groups, the EXT group on average showed nominally elevated SCRs to the CS+, especially during late conditioning. This is likely due to considerable interindividual variability in SCRs, especially collected within a scanning environment. Notably, we included all subjects in the analysis, and did not exclude subjects on the basis of SCR performance.
Experimental design and behavioral results. A, Two groups underwent threat conditioning with a picture of a face (CS+) paired with wrist shock on a partial CS-US pairing schedule, and a second picture (CS−) not paired with shock. Conditioning was followed immediately by either EXT, in which the shock was omitted on CS+ trials, or NFE, in which the shock was replaced by a tone at the end of each CS+ trial. Subjects returned 24 h later, and the CSs were presented in the absence of any shocks or tones. B, Conditioned SCRs replicate prior findings (Dunsmoor et al., 2015a), showing faster extinction and comparatively diminished SCRs 24 h later in the NFE group compared with the EXT group. C, Best-fit associability trace for CS+ trials illustrate accelerated updating during NFE compared with EXT. D, The weighting parameter (η) that governs the rate of associability updating was elevated during NFE compared with EXT. E, Individual differences in the associability weighting parameter during NFE was correlated with recovery of conditioned SCRs the next day, such that subjects who assigned more weight to the prediction error on NFE trials showed better retention of extinction 24 h later. Error bars indicate ± SEM. ***p < 0.001. **p < 0.01. *p < 0.05.
The primary index of 24 h return of threat responses (day 2) was the mean SCR to the early (first four) CS+ versus CS− trials. We focused on the mean of the early trials, rather than the entire testing phase, as this is the period of time when group differences in extinction retention tend to be most evident (Milad et al., 2009; Schiller et al., 2010; Kroes et al., 2016). After several unreinforced trials, subjects will begin to reextinguish, rendering it difficult to dissociate differences in extinction retention from variable rates of reextinction. The focus on early trials, and the first 4 trials in particular (e.g., Milad et al., 2009), is also in keeping with existing fMRI research using 24 h tests (Fullana et al., 2018b). Independent-samples t test on the SCR difference (CS+ − CS−) in both groups showed stronger differential SCRs in the EXT than NFE group (t(44) = 2.487, p = 0.017, 95% CI [0.048, 0.460]). The EXT group showed substantially elevated SCRs on the CS+ versus the CS− trials, as revealed by a one-sample t test on the CS+ versus the CS− (t(22) = 5.632, p < 0.001, 95% CI [0.276, 0.598]). Independent-samples t test on the CS+ trials alone showed heightened mean SCRs in the EXT versus NFE group (t(44) = 2.015, p = 0.050, 95% CI [−0.576, 0.0012]), but no difference in mean SCRs to the CS− (p = 0.719). While conditioned SCRs were diminished in the NFE group relative to the EXT group, conditioned arousal to the CS+ at a 24 h test was not eliminated by the NFE procedure, and SCRs were elevated to CS+ versus CS− (t(22) = 2.756, p = 0.012, 95% CI [0.045, 0.321]). Together, these physiological results confirm that omitting the shock and replacing it with a nonaversive tone improved the rate of extinction learning led to comparatively less return of threat than just omitting the shock, replicating previous findings (Dunsmoor et al., 2015c).
To examine whether NFE affects trial-by-trial extinction learning differently than EXT, we fit an associability model using subject's trial-by-trial SCRs from day 1, in keeping with prior implementation of this model during human fear conditioning (Li et al., 2011; Raio et al., 2017). Repeated-measures ANOVA of the best-fit associability trace (α) on CS+ trials (Fig. 1C) showed a main effect of group (F(1,44) = 5.062, p = 0.030, η2 = 0.103). Associability values were mostly undifferentiated between groups during threat conditioning for CS+ trials but showed accelerated updating for the CS+ during NFE compared with EXT. Repeated-measures ANOVA of the mean weighting factor (η) that governs the rate of trial-by-trial associability updating showed a phase (conditioning, extinction) × group interaction (F(1,44) = 10.708, p = 0.002, η2 = 0.196). Figure 1D shows that the mean weighting factor (η) was no different between groups during conditioning (independent-sample t test: p = 0.31), as expected given that the threat conditioning task was no different between groups. However, during extinction, the weighting factor η was significantly elevated in the NFE group versus the EXT group (independent-sample t test: t(44) = 3.150, p = 0.003, 95% CI [0.088, 0.401]).
We also investigated whether individual differences in the associability weighting parameter η at the time of NFE (day 1) had any predictive value in determining the strength of extinction retention 24 h later (Fig. 1E). That is, we asked whether subjects who evince strong updating of associability during NFE on day 1 are those who show less return of threat responses on day 2. There was an inverse relationship between the associability weighting parameter at the time of NFE learning and differential (CS+ − CS−) SCRs at the time of test the next day (r(21) = −0.427, p = 0.04, two-tailed). This suggests that those subjects for whom the surprising outcome on extinction trials most effectively increased associability were those who exhibited the strongest benefit, in terms of inhibiting future threat expression. There was no correlation between the associability weighting parameter derived from EXT and SCRs evinced 24 h later (r(21) = −0.177, p = 0.42, two-tailed).
Imaging results
We first characterized whole-brain group-level fMRI analysis of threat conditioning to ensure equivalent threat conditioning-related activity in both groups. Across groups, threat conditioning (CS+ > CS−) was associated with activity in the bilateral insula, striatum, thalamus extending into the midbrain, and dACC (Table 1). The inverse contrast (CS+ < CS−) revealed activity in the vmPFC, posterior cingulate cortex, and left angular gyrus. These regions are consistently implicated in fMRI studies of human conditioning (Sehlmeyer et al., 2009; Mechias et al., 2010; Fullana et al., 2016). Importantly, no regions emerged as showing a main effect of group, or a group × CS type interaction, confirming that activity related to initial acquisition was equivalent between groups, as expected.
Whole-brain ANOVA of CS type (CS+, CS−) and group (EXT, NFE), identified at p < 0.001 (cluster-corrected p < 0.05) during threat (fear) conditioning across all participants (n = 46)
Whole-brain analysis of extinction
Repeated-measures 2 × 2 ANOVA of whole-brain activity during extinction revealed a main effect of CS type in a number of areas that were also active during threat conditioning, including the dACC, bilateral insula, and striatum (for full report, see Table 2). This overlap in activity within threat learning and expression networks across conditioning and extinction has been reported in a meta-analysis of human fear extinction fMRI (Fullana et al., 2018b). The group × CS type interaction revealed group differences in the vmPFC, dACC, insula, and superior frontal gyrus (Fig. 2A; Table 3). To characterize the interaction, we extracted parameter estimates for the CS+ and CS− contrast from a priori ROIs (Fig. 2A). This confirmed that the interaction in ROIs associated with threat inhibition (the vmPFC and dorsolateral PFC; Fig. 2A,B) threat appraisal and expression (i.e., the insula and dACC; Fig. 2C,D) were driven by heightened differential activity between the CS+ and CS− during EXT compared with NFE. Notably, in the vmPFC and lateral PFC, the pattern in activity between the CS+ and CS− reversed from conditioning to extinction, such that deactivation to the CS+ was diminished. Thus, despite equivalent acquisition-related activity, NFE produced a dramatic and highly distinguishable pattern of BOLD fMRI in regions involved in threat inhibition and threat appraisal and expression.
Whole-brain ANOVA of CS type (CS+, CS−) and group (EXT, NFE), identified at p < 0.001 (cluster-corrected p < 0.05) during threat (fear) extinction across all participants (n = 46)
Whole-brain ANOVA and associability-modulated vmPFC activity during extinction. The group × CS type interaction of extinction revealed activations in vmPFC, superior frontal gyrus, dACC, and insula. A, Parameter estimates extracted from these regions characterized the interaction as deactivations to CS+ versus CS− in vmPFC and superior frontal gyrus during both conditioning and extinction in the EXT group but a switch in CS+ versus CS− activity during extinction in the NFE group. dACC and left insula exhibited heightened CS+ versus CS− differential activity during condition and extinction in the EXT group but diminished CS+ versus CS− activity during extinction in the NFE group. B, NFE exhibited stronger associability-modulated engagement of the vmPFC than EXT. This region of the vmPFC corresponds to an area of the sgACC considered homologous to the rodent infralimbic cortex. p.e., Parameter estimates, arbitrary units. Error bars indicate ± SEM. ***p < 0.001 (one-sample t test). *p < 0.05 (one-sample t test).
Whole-brain ANOVA of main effects of group (EXT, NFE) and CS type (CS+, CS−) × group interactions, during threat (fear) extinction, identified at p < 0.001 (cluster-corrected p < 0.05)
Associability-related activity in the vmPFC during NFE
We compared associability-related brain activity during extinction by using parametric modulation analyses of trial-by-trial associability values for each participant. To detect differences in associability signals at the whole-brain level, we conducted a two-sample t test on the associability regressor from the parametric modulation analysis. This revealed stronger associability-modulated activity in the vmPFC during NFE compared with EXT (Fig. 2E). This was the only region identified that survived whole-brain correction for multiple comparisons. A two-sample t test on the prediction-error regressor for extinction did not reveal any differences between groups.
Whole-brain analysis of retention test
A whole-brain 2 × 2 ANOVA of the 24 h test phase showed a group × CS type interaction in the vmPFC, a region of the vmPFC consistent with the subgenual ACC (sgACC, Brodmann area 25), and bilateral posterior parietal cortex (Table 4). Further analysis of the vmPFC confirmed that differences between CS+ and CS− in the EXT were driven by deactivations to the CS+ relative to the CS−, whereas the NFE showed a relative increase in activity to the CS+ that was near the baseline level of the learned safety signal, the CS− (Fig. 3A). The main effect of CS type also revealed activity in a number of regions that overlapped with areas identified in threat conditioning (e.g., bilateral insula, thalamus, striatum, ACC, posterior cingulate cortex, vmPFC; Table 5). Interestingly, this analysis also revealed activity in the left amygdala (MNI −18, 0, −12; 7 voxels, F = 17.62, PFWE-corrected = 0.01). Given the general interest in the role of the amygdala in extinction processes (Ehrlich et al., 2009; Pape and Paré, 2010), we examined activity in this amygdala ROI during the 24 h retention test and found that both groups exhibited enhanced CS+ versus CS− responses in the left amygdala (Fig. 3B). This amygdala ROI was used as a seed region for a subsequent functional connectivity analysis reported below.
Whole-brain ANOVA of main effects of group (EXT, NFE) and CS type (CS+, CS−) × group interactions, during 24 h extinction recall, identified at p < 0.001 (cluster-corrected p < 0.05)
Whole-brain ANOVA and functional connectivity at 24 h test. A, The group × CS type interaction revealed activity in the vmPFC, including a region in the sgACC. Parameter estimates extracted from the vmPFC/sgACC characterized the interaction as strong CS+ deactivations 24 h after EXT compared with NFE. B, The main effect of CS type revealed activation in left amygdala. C, Task-based functional connectivity analysis using the vmPFC/sgACC as a seed region showed stronger correlations between the vmPFC and amygdala 24 h after NFE compared with EXT. D, A complementary exploratory analysis using the left amygdala as a seed region showed dissociable patterns of connectivity between groups, with the NFE group exhibiting stronger connectivity with the vmPFC (mirroring the prior analysis) and the EXT group exhibiting stronger connectivity with the dACC 24 h later. p.e., Parameter estimates, arbitrary units. Error bars indicate ± SEM. **p < 0.01.
Whole-brain ANOVA of the main effect of CS type (CS+, CS−) identified at p < 0.001 (cluster-corrected p < 0.05) during 24 h extinction recall across all participants (n = 46)
Amygdala-vmPFC connectivity characterizes recall of NFE versus EXT
The vmPFC is often associated with successful extinction recall in humans (Phelps et al., 2004; Milad et al., 2007) and corresponds to an area of the mPFC consistent with the infralimbic cortex in rodents that is generally implicated in threat extinction processes (Quirk and Mueller, 2008). Connectivity patterns between the mPFC and amygdala have also been characterized in research on inhibitory networks controlling emotional and defensive responding in humans and rodents (Duvarci and Paré, 2014; Frank et al., 2014; Tovote et al., 2015; but see Bukalo et al., 2015). To further probe the role of this region during recall of standard versus NFE, we used this ROI from the extinction–retention test as a seed region in a whole-brain functional connectivity analysis (PPI). At the whole-brain level, using a somewhat liberal threshold of p < 0.001 uncorrected and a voxel extent of 5 voxels, the only two regions to emerge as exhibiting stronger task-based (CS+ > CS−) functional correlations with the vmPFC in the NFE versus EXT group during extinction recall were in right (MNI −20, −2, −22; 15 voxels, t = 3.72, Puncorrected < 0.001) and left (MNI 26, −4, −24; 8 voxels, t = 3.73, Puncorrected < 0.001) amygdala (Fig. 3C).
A secondary and complementary analysis used the left amygdala ROI identified from the main effect of CS type across both groups during extinction recall as the PPI seed region. At the whole-brain level, the only region to emerge as showing stronger positive correlations with the amygdala during the 24 h test in the NFE group versus the EXT group was the vmPFC (Fig. 3D), perhaps unsurprisingly mirroring the results using the vmPFC as the PPI seed region. At a more lenient exploratory threshold of p < 0.005, we found that the EXT group exhibited stronger amygdala connectivity with the dACC compared with the NFE group (two-samples t test).
Discussion
Behavioral strategies can enhance extinction to prevent relapse of extinguished behaviors (Craske et al., 2008, 2014; Laborda et al., 2011; Dunsmoor et al., 2015a). However, despite increasing knowledge on the neuroscience of fear extinction, the neural correlates underlying behaviorally enhanced extinction strategies are unclear. Here we found that enhanced extinction that involves replacing threats with novel outcomes (Dunsmoor et al., 2015c) diminished activity in threat appraisal and expression regions, including the insula, thalamus, and dACC. A computational learning model that emphasizes surprise-oriented attention and dynamically governs how easily the CS can form an association with its outcome characterized vmPFC activity during NFE, and individual differences in associability-modulated extinction correlated with diminished conditioned arousal the next day. Finally, vmPFC activity was functionally correlated with the amygdala during 24 h later, in line with rodent neurobiological models emphasizing prefrontal-amygdala pathways regulating the balance of threat and safety.
The whole-brain imaging results showed that enhancing extinction reduced BOLD activity in a number of regions associated with threat appraisal and expression, including the insula, dACC, and thalamus. Activity to the CS+ is traditionally maintained in these regions during EXT learning in human fMRI (Fullana et al., 2018b), perhaps because extinction tends to be slow and the mere omission of shock is not sufficient to reduce neural activity related to threat anticipation. It may be especially difficult to reduce threat anticipation through passive omission of the US following partial CS-US pairing (Grady et al., 2016). However, replacing threat outcomes on extinction trials appears to effectively attenuate activity in regions that are otherwise involved in maintaining and expressing threat expectations. One psychological effect of replacing the shock with a perceptible outcome (rather than just passively omitting it) might involve reducing threat uncertainty (Dunsmoor et al., 2015c; Morriss et al., 2016; Lucas et al., 2018).
Computational modeling showed that associability modulated vmPFC activity during NFE. Given that extinction is considered new associative learning, it is intriguing that a region involved in extinction learning (the vmPFC) was engaged by dynamic changes in associability. This suggests that novelty modulates the effectiveness of new associative learning in the vmPFC. In this framework, the vmPFC might provide a “teaching signal” that serves to instruct the formation of extinction memories (Bukalo et al., 2015) while simultaneously downregulating regions involved in maintaining threat appraisal and emotional expression, such as the insula, thalamus, brainstem, and dACC. This explanation fits with studies in rodents showing that electrical (Milad et al., 2004) or optogenetic (Do-Monte et al., 2015) stimulation of the infralimbic cortex facilitates extinction learning and strengthens extinction memories. This also fits with an investigation in rodents showing that replacing a shock US with an appetitive US increases activity in the infralimbic cortex and decreases long-term defensive responses (Correia et al., 2016). It is noteworthy, therefore, that the region of the vmPFC identified as tracing associability better in NFE than EXT was the sgACC (Brodmann area 25). This area is commonly thought of as homologous to infralimbic cortex (Ongür et al., 2003), which is crucial for extinction learning (Milad and Quirk, 2012).
A consequence of enhanced extinction learning might be the formation of a more durable extinction memory, which then provides stronger retrieval competition against the original threat association at test (Miller and Laborda, 2011; Laborda and Miller, 2012). This idea of strengthening retrieval competition is further supported by patterns of amygdala-vmPFC connectivity 24 h later. Specifically, activity between the vmPFC and amygdala was positively correlated on CS+ versus CS− trials 24 h after NFE, whereas activity between the dACC and amygdala was positively correlated 24 h after EXT (albeit at a more liberal statistical threshold). These connectivity results fit with rodent neurobiological research showing that distinct pathways between the amygdala and the ventral (infralimbic) and dorsal (prelimbic) regions of the mPFC mediate the balance between inhibition and expression of conditioned threat (Sierra-Mercado et al., 2011; Senn et al., 2014; Krabbe et al., 2018). A positive correlation between the vmPFC and amygdala might be related to prefrontal inhibition via excitatory inputs to the basolateral amygdala (Bloodgood et al., 2018) that in turn activate GABAergic intercalated cell masses that project onto and inhibit the central nucleus of the amygdala (Amano et al., 2010; Strobel et al., 2015), although it has to be noted that fMRI methods are unable to substantiate that speculation. Overall, enhancing extinction might engage the vmPFC to strengthen new associative learning during extinction, which may then lead to stronger interference with the threat memory at test (Bouton, 1993).
Further research testing the NFE protocol, or methodological offshoots, is warranted. For instance, the present study and prior behavioral study (Dunsmoor et al., 2015c) used an immediate extinction design in humans, although the NFE protocol in rats in Dunsmoor et al. (2015c) did incorporate a 3 d delayed extinction and 24 h extinction retention test. Immediate extinction may be a weaker form of extinction that produces less extinction retention than delaying extinction by at least a day (Maren, 2014). It is perhaps unsurprising then that the EXT group exhibited enhanced activity to the CS+ versus CS− throughout extinction, and showed weak extinction retention. At the same time, it is noteworthy that the NFE protocol reduced activity in regions associated with threat appraisal and improved extinction retention, despite the immediate extinction design. It will be of interest to test effects of an immediate versus delayed NFE protocol, as its possible NFE is even more effective at a delay, in keeping with findings of EXT standard extinction (Maren and Chang, 2006; Schiller et al., 2008; Huff et al., 2009). This study also used threat-relevant CSs (angry faces) that are more difficult to extinguish than neutral CSs (Ohman and Mineka, 2001). This again seems to support the idea that NFE is a more optimal technique that overcomes deficits in extinction to threat-relevant CSs, but it might be of interest to compare NFE to EXT of neutral CSs in humans. It also remains important to probe the similarity and differences between NFE, which incorporates neutral outcomes, and counterconditioning, which incorporates rewarding outcomes. We have speculated previously (Dunsmoor et al., 2015c) that NFE might be more effective than counterconditioning by effectively neutralizing the emotional significance of the CS, but there is also evidence that rewarded extinction is effective at diminishing future threat through an amygdala-ventral striatum circuit (Correia et al., 2016). Where these procedures overlap and differ is of theoretical interest. It is possible that the neutral tone used here to replace the shock acquires rewarding properties by being associated with omission of shock.
These results have implications for treatments of pathological anxiety based on associative learning theory. Exposure therapy is informed in large measure by basic principles of experimental extinction (Foa et al., 1989), and advances in the neuroscience of extinction offer potential insight into treatment for fear and anxiety disorders. Standard models of extinction rely on the repeated absence of the outcome to eventually produce a new CS association, at which point cue associability diminishes as the absence of the outcome is fully predicted (Pearce and Hall, 1980). Yet learning by omission seems to be a rather passive strategy to generate a competitive long-term extinction memory, as evidenced by a century of research on postextinction recovery of conditioned behaviors (Pavlov, 1927; Bouton, 2002; Rescorla, 2004). The imbalance between conditioning and extinction is especially evident in threat learning, where there is an evolutionarily conserved bias toward anxiety conservation (Solomon and Wynne, 1954; Bateson et al., 2011) to ensure that threat associations are maintained “just in case.” Likewise, in many real-world situations, the mere absence of an expected threat is not sufficient to reduce threat uncertainty, exemplified in specific phobias (e.g., fear of flying: “just because the plane didn't crash this time doesn't mean it won't crash next time.”). Extinction strategies that rely on passive US omission might simply be insufficient in some cases (e.g., following partial reinforcement) (Li et al., 2016) or in some populations (e.g., stress disorders) (Milad et al., 2009) to stimulate vmPFC involvement. An inability to properly learn from experiences that disconfirm negative expectations could also explain why humans have more difficulty retrieving explicit memories of safety after extinction (Dunsmoor et al., 2018). Amplifying the effects of surprise by replacing expected threat with novel outcomes may be a potential technique to strengthen new safety memories to outcompete reactivation of unwanted negative memories.
Footnotes
This work was supported by National Institutes of Health Grant R01 MH097085 to E.A.P. and Grant K99R00 MH106719 to J.E.D. M.C.W.K. was supported by an H2020 Marie Sklodowska-Curie Fellowship and a Branco Weiss Fellowship, Society in Science. We thank Michael Evans for assistance with data collection.
The authors declare no competing financial interests.
- Correspondence should be addressed to Joseph E. Dunsmoor at joseph.dunsmoor{at}austin.utexas.edu