Abstract
Learning can be guided by unexpected success or failure, signaled via dopaminergic positive reward prediction error (+RPE) and negative reward-prediction error (−RPE) signals, respectively. Despite conflicting empirical evidence, RPE signaling is thought to be impaired in drug addiction. To resolve this outstanding question, we studied as a measure of RPE the feedback negativity (FN) that is sensitive to both reward and the violation of expectation. We examined FN in 25 healthy controls; 25 individuals with cocaine-use disorder (CUD) who tested positive for cocaine on the study day (CUD+), indicating cocaine use within the past 72 h; and in 25 individuals with CUD who tested negative for cocaine (CUD−). EEG was acquired while the participants performed a gambling task predicting whether they would win or lose money on each trial given three known win probabilities (25, 50, or 75%). FN was scored for the period in each trial when the actual outcome (win or loss) was revealed. A significant interaction between prediction, outcome, and group revealed that controls showed increased FN to unpredicted compared with predicted wins (i.e., intact +RPE) and decreased FN to unpredicted compared with predicted losses (i.e., intact −RPE). However, neither CUD subgroup showed FN modulation to loss (i.e., impaired −RPE), and unlike CUD+ individuals, CUD− individuals also did not show FN modulation to win (i.e., impaired +RPE). Thus, using FN, the current study directly documents −RPE deficits in CUD individuals. The mechanisms underlying −RPE signaling impairments in addiction may contribute to the disadvantageous nature of excessive drug use, which can persist despite repeated unfavorable life experiences (e.g., frequent incarcerations).
Introduction
Reinforcement-learning theories posit that a reward-prediction error (RPE) signal reflects a discrepancy between a predicted and actual rewarding outcome that is used to update the expected value of the stimulus/action leading to the rewarding outcome (Schultz, 1998; Glimcher, 2011). According to both preclinical and human neuroimaging studies, these RPEs are produced by phasic dopaminergic signaling in the midbrain projecting to other regions within the mesostriatocortical network that increases in response to the delivery of an unpredicted reward [positive RPE (+RPE)] and decreases in response to either the omission of a predicted reward or delivery of negative outcome [negative RPE (−RPE); Mirenowicz and Schultz, 1994; Hollerman and Schultz, 1998; McClure et al., 2003; Liu et al., 2007; Cohen et al., 2012; Hart et al., 2014]. Consequently, these signals drive learning (Reynolds et al., 2001) that guides future behavior (Montague et al., 2004; Frank and Claus, 2006).
In the course of developing drug addiction, the pharmacologic actions of drugs of abuse enhance +RPE signal (Jones et al., 1995), facilitating pathological overlearning of drug-related cues (Di Chiara, 1998). The mechanism invokes dysregulated corticostriatal dopamine transmission (Rice and Cragg, 2004) that facilitates unrestricted habitual drug use (Everitt and Robbins, 2005) and results in gradual decline in dopaminergic functions (Koob and Le Moal, 2001), including reward prediction (Watanabe, 1996) and movement initiation (Schultz and Romo, 1992). Using associative learning and guessing/financial decision-making tasks, some studies have shown reduced RPE signals in nicotine (Rose et al., 2012), polysubstance (Tanabe et al., 2013), and stimulant (Rose et al., 2014) users, whereas others have shown intact RPE signals in alcohol (Park et al., 2010), nicotine (Chiu et al., 2008), and opioid (Gradin et al., 2014) users. Thus, whether RPE signaling is globally impaired in addiction remains an open question.
The feedback-related negativity [feedback negativity (FN), feedback-related negativity, feedback-error-related negativity, or medial-frontal negativity] component of the event-related potentials (ERPs) originates from the medial prefrontal cortex and the major termini of mesencephalic dopaminergic projections (Nieuwenhuis et al., 2004a; Carlson et al., 2011). FN is reported to be sensitive to phasic corticostriatal dopaminergic activity (Hauser et al., 2014) and tracks bidirectional (±)RPE signaling (Holroyd et al., 2003; Pfabigan et al., 2011). Previous studies in addiction have investigated FN as a marker of sensitivity to reward expectation (Franken et al., 2010; Euser et al., 2011; Muñoz et al., 2012) or of outcome evaluation (Fein and Chang, 2008; Euser et al., 2013), but importantly never as a marker of RPE.
Here, we reasoned that a targeted task could leverage FN to index RPE in addiction. Therefore, we evoked FN using a gambling task and hypothesized that cocaine-use disorder (CUD) will manifest deficits in computing both positive RPE (+RPE) and negative RPE (−RPE). Given previous reports of more severe executive function (Woicik et al., 2009) and reward processing impairments (Parvaz et al., 2012) in addicted individuals with less-recent drug use (i.e., CUD individuals with longer current abstinence length and less-frequent current drug use), we further hypothesized that less-recent cocaine use will be associated with more-severe RPE deficits.
Materials and Methods
Participants.
Fifty CUD individuals (46 male) and 25 healthy controls (23 male) were recruited through advertisements in local newspapers, word of mouth, and local treatment facilities. All participants underwent full physical and neurological examinations by a neurologist and a diagnostic interview by a clinical psychologist. The diagnostic interview included the Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Axis I Disorders (First et al., 1996); the Addiction Severity Index (McLellan et al., 1992); the Cocaine Selective Severity Assessment Scale (Kampman et al., 1998); the Cocaine Craving Questionnaire (Tiffany et al., 1993); and the Severity of Dependence Scale (Gossop et al., 1992). Based on this interview, all CUD individuals were classified as meeting diagnostic criteria for current cocaine dependence.
Participants were further screened to exclude for the following: (1) history of head trauma or loss of consciousness (>30 min) or any other neurological disease; (2) abnormal vital signs at time of screening and history of major medical conditions; (3) history of major psychiatric disorders, including current dependence or abuse of any substance (other than cocaine abuse or dependence for CUD and/or nicotine dependence for all participants); (4) severe levels of self-reported depression (Beck Depression Inventory; Beck et al., 1996; scores, >20); (5) history of gambling as measured with the South Oaks Gambling Scale (Lesieur and Blume, 1987; scores, ≥5); (6) except for cocaine in the CUD participants, positive urine screens for any other psychoactive drugs; and (7) current use of any medication, including psychotropic medications.
Similar to our previous studies (Moeller et al., 2010, 2013; Dunning et al., 2011; Parvaz et al., 2012), recent use of cocaine was objectively indexed by urine screening on study day using the triage urine panel (Biopsych), which detects drug use within the past 72 h. Individuals positive for cocaine in urine, though not currently intoxicated, were classified as CUD+ (N = 25), while those negative for cocaine in urine were classified as CUD− (N = 25). All CUD individuals used crack/cocaine in the previous 90 d. As anticipated, the two CUD subgroups differed on frequency of recent cocaine use (days per week in past 30 d, p < 0.001), duration of current abstinence (p < 0.001), self-reported craving (p < 0.001), and withdrawal from cocaine (p = 0.006; Table 1). With the exception of cigarette smoking, the combined group of CUD individuals and controls did not differ on any demographic characteristics (Table 1). Only 5 (of 25) controls were current smokers compared with the majority (32 of 50) of CUD individuals, and this represented a significant difference (p = 0.001). However, daily cigarette use among the current smokers did not differ between the groups. Cigarette smokers were not required to abstain from smoking to avoid acute nicotine withdrawal effects. Participants were fully informed of all study procedures and risks and provided written consent in accordance with the Stony Brook University's Institutional Review Board.
Task.
The overall design of the task is similar to Experiment 2 reported in a previous study by Hajcak et al. (2007). The task was administered using Presentation software (Neurobehavioral Systems) to control the presentation and timing of all stimuli. Participants' primary objective on each trial was to guess which of four doors presented horizontally in a color graphic hid a prize by pressing one of four keys on a keypad corresponding to each door (Fig. 1A). At the beginning of each trial, a white “1,” “2,” or “3” appeared on the screen for 1000 ms to inform participants the number of doors (out of four) that contained a monetary prize (60¢); thus, these cues indicated the probability of reward on the upcoming trial (25, 50, and 75%, respectively). The graphic of the doors appeared 1500 ms after cue offset, and remained until the participant made a selection to indicate their prediction of the winning door. Immediately following their choice, the question “Do you think you won or lost in this trial?” appeared on the screen and remained there until participants indicated via button press a predicted win or loss. Thus, participants were first presented with a cue that conveyed the objective probability of reward on the upcoming trial, then asked to make a choice and, finally, predicted whether or not they thought they chose correctly. Five hundred milliseconds following their prediction, a feedback stimulus appeared on the screen for 1000 ms: a green arrow pointing upward indicated a win (i.e., a win of 60¢), and a red arrow pointing downward indicated a loss (i.e., a loss of 30¢; Fig. 1A). Winning probability was always consistent with the cue type (e.g., 75% winning probability for three-cue trials). Participants' prediction on each trial was recorded and subsequently analyzed to validate the success of the task manipulations in biasing reward prediction. Such a task design specifically allows for modulating valence-related variability in FN by explicit prediction errors (Hajcak et al., 2007).
All stimuli were positioned in the center of the screen. The cue and feedback stimuli occupied ∼21° of visual angle horizontally and 21° vertically. A fixation cross was then presented in the intertrial interval for 1000 ms. The task consisted of six blocks of 50 pseudorandom trials (i.e., a total of 300 trials; 100 trials per cue type) interspaced by a brief break. Participants were told that the task earnings would be given to them at the end of the EEG session. Unbeknownst to the participants, the task earnings were not dependent on their choice and they were always paid $75 for the task.
Electroencephalogram recording and data reduction.
Electroencephalogram (EEG; Neuroscan) recordings were obtained using a 64 silver–silver chloride electrode cap positioned according to the International 10/20 system. Electro-oculogram electrodes at the left supraorbital and infraorbital sites and the right and left outer canthi recorded the blinks (and vertical eye movements) and horizontal eye movements, respectively. EEG recordings were sampled at 500 Hz and bandpass filtered from DC to 70 Hz. Electrode impedances were kept <10 kΩ.
Offline preprocessing of the EEG signal was performed using SPM8 (Wellcome Department of Cognitive Neurology, London, UK; http://www.fil.ion.ucl.ac.uk/spm) and custom Matlab (The MathWorks) scripts. Data were first bandpass filtered (0.01–30 Hz) and rereferenced to the averaged electrical activity from all 64 scalp sites. To evaluate the effect of participant's actual predictions on the feedback-locked ERP and to maximize the number of trials for each analysis, across the objective winning probabilities (25, 50, and 75%), averaged waveforms were created for all four feedback trial types (i.e., predicted win, unpredicted win, predicted loss, and unpredicted loss). Therefore, the continuous EEG data were segmented beginning 200 ms before the feedback onset and continuing for 1000 ms. For each trial, the 200 ms baseline was subtracted from the poststimulus data for baseline correction. Eye-blink and ocular corrections were performed using the partial signal space projection method proposed by Nolte and Hämäläinen (2001), such that the contribution to the estimated spatial structure of eye-blink artifact was removed only from the artifact-ridden epochs, leaving as much information as possible in the data (Nolte and Hämäläinen, 2001). This artifact-rejection procedure identified a voltage step of >75 μV between sample points and a peak-to-peak voltage difference of 150 μV within an epoch. Additional artifacts were identified through visual inspection, and the contaminated epochs were subsequently rejected. Finally, robust averaging was used to create artifact-free ERPs (Wager et al., 2005) separately for each of the four conditions.
Based on the scalp topography of the grand-averaged difference waveforms (loss minus win for predicted and unpredicted conditions; Hajcak et al., 2007), the FN was defined as the averaged maximum negative amplitude at the Fz, FC1, FCz, and FC2 electrode sites from 200 to 350 ms following feedback onset. FN was scored in two different ways: (1) based on subjective prediction and outcome (i.e., predicted win, unpredicted win, predicted loss, and unpredicted loss); and (2) based on objective win probability and outcome (i.e., 25% win, 25% loss, 50% win, 50% loss, 75% win, and 75% loss). We expected the former, but not the latter, to specifically target RPE as there might be incongruencies in objective win probability and the participants' own expectation of reward. Additionally, to show that RPE uniquely modulates FN, P300 was also isolated for each outcome based on winning probability and prediction type at sites where P300 was maximal (i.e., F1, Fz, F2, FCz sites). The P300 was defined as the most positive peak in the 400–600 ms window following feedback onset.
Using SPSS (version 20), task behavior (subjective win/loss prediction) was analyzed with a 3 [Winning probability (25, 50, and 75%)] × 2 [Predicted outcome (wins and losses)] × 3 [Group (Controls, CUD+, and CUD−)] mixed-model ANOVA. In contrast, FN and P300 amplitudes were each analyzed with (1) 2 [Prediction (predicted and unpredicted)] × 2 [Outcome (wins and losses)] × 3 [Group (Controls, CUD+, and CUD−)] mixed-model ANOVA, to specifically assess the RPE signaling; and (2) 3 [Winning probability (25, 50, and 75%)] × 2 [Outcome (wins and losses)] × 3 [Group (Controls, CUD+, and CUD−)] mixed-model ANOVA, to assess the specificity of objective win probability to target RPE. In instances where the sphericity assumption was violated, the Greenhouse–Geisser correction was applied. All significant main effects and interactions were followed with pairwise t tests.
Finally, due to our a priori hypothesis regarding more severe impairments as a function of longer current abstinence length and less-frequent current drug use, correlations between the dependent measures (ERPs and behavior) and variables indexing recent drug-use history and related symptoms (listed in Table 1) were examined within each CUD subgroup.
Results
Behavioral results
The 3 (Winning probability: 25, 50, and 75%) × 2 (Predicted outcome: Win and Loss) × 3 (Group: Controls, CUD+, and CUD−) mixed ANOVA revealed a significant main effect of predicted outcome (Win > Loss: F(1,72) = 54.22, p < 0.001) and a winning probability-by-predicted outcome interaction (F(2,71) = 58.54, p < 0.001), while all other main effects and interactions did not reach significance (p > 0.1). Follow-up paired t tests showed that all participants predicted losing more often than winning for the 25% winning probability (t(74) = 3.30, p = 0.002) and predicted winning more often than losing for both the 50% (t(74) = 6.83, p < 0.001) and 75% (t(74) = 12.00, p < 0.001) winning probabilities, confirming that the task manipulations were successful in biasing reward prediction (Fig. 1B).
ERP results
FN
The 2 (Prediction: Predicted and Unpredicted) × 2 (Outcome: Win and Loss) × 3 (Group: Controls, CUD+, and CUD−) mixed ANOVA for FN amplitude indicated a significant main effect for outcome (Win > Loss: F(1,72) = 102.4, p < 0.001), a significant prediction-by-outcome interaction (F(1,72) = 10.64, p = 0.002), and a significant prediction-by-outcome-by-group interaction (F(2,72) = 5.58, p = 0.006). Follow-up comparisons revealed that, in controls only, FN amplitude was significantly higher for unpredicted versus predicted wins (Unpredicted Win > Predicted Win: t(24) = 3.12, p = 0.005) and significantly lower for unpredicted versus predicted losses (Unpredicted Loss < Predicted Loss; t(24) = 2.50, p = 0.020) as would be expected if the FN was tracking both +RPEs and −RPEs.
CUD+ individuals also showed significantly higher FN amplitude for unpredicted versus predicted wins (Unpredicted Win > Predicted Win; t(24) = 2.38, p = 0.026) but no significant effects were observed for unpredicted versus predicted losses (p = 0.337). CUD− individuals did not show FN differences either between unpredicted and predicted wins (p = 0.719), or between unpredicted and predicted losses (p = 0.203). Thus, the current results show that, compared with healthy controls, both CUD subgroups manifest impairments in −RPE signaling as indexed by the lack of FN amplitude modulation to unpredicted versus predicted losses; CUD− individuals show additional impairments in +RPE signaling as indexed by the lack of FN amplitude modulation by unpredicted versus predicted wins (Table 2; Fig. 2A–D).
The exploratory 3 (Winning probability: 25, 50, and 75%) × 2 (Outcome: Win and Loss) × 3 (Group: Controls, CUD+, and CUD−) mixed ANOVA revealed a significant main effect of outcome (Win > Loss: F(1,72) = 69.72, p < 0.001) and a winning probability-by-outcome interaction (F(2,71) = 7.24, p = 0.001), while all other main effects and interactions did not reach significance (p > 0.1). Follow-up paired t tests showed that the loss minus win differential FN amplitude was significantly higher for the 50% condition compared with the amplitude for the 75% condition across all participants (t(74) = 3.82, p < 0.001).
P300
The 2 (Prediction: Predicted and Unpredicted) × 2 (Outcome: Win and Loss) × 3 (Group: Controls, CUD+, and CUD−) mixed ANOVA for P300 amplitude indicated a significant main effect for prediction (Unpredicted > Predicted: F(1,72) = 15.42, p < 0.001). All other main effects and other interactions did not reach significance.
The 3 (Winning probability: 25, 50, and 75%) × 2 (Outcome: Win and Loss) × 3 (Group: Controls, CUD+, and CUD−) mixed ANOVA revealed a significant winning probability-by-outcome interaction (F(2,71) = 17.6, p < 0.001), while all other main effects and interactions did not reach significance (p > 0.1). Follow-up paired t tests showed higher P300 amplitude to win compared with loss for the 25% winning probability (t(74) = 3.30, p = 0.002), no difference for the 50% condition (p = 0.65), and higher P300 amplitude to loss compared with win for the 75% winning probability across all participants (t(74) = 4.79, p < 0.001).
Examination of potential confounds
History of cigarette smoking significantly differed between the groups such that most CUD individuals (32 of 50) were current smokers, whereas only 5 of 25 healthy controls were current smokers. Therefore, a subanalysis was performed only in currently nonsmoking participants (20 controls and 18 CUD individuals). In this analysis CUD+ and CUD− individuals were assessed together as a CUD group due to statistical power constraints. For analyses based on subjective prediction, the results were consistent with findings in the entire sample such that there was a significant main effect for outcome (Win > Loss: F(1,36) = 42.05, p < 0.001) and a significant prediction-by-outcome interaction (F(1,36) = 7.81, p = 0.008). The three-way prediction-by-outcome-by-group interaction (F(1,36) = 2.74, p = 0.11) showed a trend in the same direction as in the larger sample, but did not reach significance, likely due to reduced power and fewer degrees of freedom (i.e., the group factor was reduced from three levels to two). Similarly, for analyses based on the objective win probability, the 3 × 2 × 2 mixed ANOVA revealed a significant main effect of outcome (Win > Loss: F(1,36) = 22.16, p < 0.001) and a winning probability-by-outcome interaction (F(1,36) = 3.70, p = 0.038), while all other main effects and interactions did not reach significance (p > 0.1).
Discussion
The current study examined +RPE and −RPE signaling as indexed by the modulation of the FN component of the ERPs by predicted and unpredicted wins and losses and the component's putative impairment in drug addiction. Our results showed increased FN amplitude in response to unpredicted reward relative to predicted reward and decreased FN amplitude in response to unpredicted loss relative to predicted loss in healthy controls, which is consistent with the proposed biphasic corticostriatal dopaminergic activity underlying RPEs (Liu et al., 2007; Maia and Frank, 2011). Indeed, these results corroborate previous findings of FN modulation by unpredicted wins or losses compared with predicted wins or losses (Hajcak et al., 2007; Oliveira et al., 2007; Holroyd et al., 2008) and strengthen the developing literature that highlights sensitivity of the FN amplitude to the bidirectional RPE computation in health (Wu and Zhou, 2009; Pfabigan et al., 2011; also see, Talmi et al., 2013).
The novel results of the current study show impaired RPE signaling in CUD individuals. Specifically, the current results show that FN amplitude in CUD individuals with less-frequent current cocaine use and longer abstinence (i.e., CUD−) did not differ in response to predicted and unpredicted outcomes (i.e., wins and losses), indicative of an overall deficit in RPE computation. Those individuals with more-frequent recent cocaine use and shorter abstinence (i.e., CUD+) showed intact +RPE computation similar to that of healthy controls and impaired −RPE similar to that of CUD− individuals. The results in CUD− individuals are consistent with those of a recent fMRI study that reported attenuated medial orbitofrontal cortex response to a computationally modeled absolute RPE measure in abstinent polysubstance users compared with controls (Tanabe et al., 2013).
The +RPE deficits in CUD− individuals perhaps point to an underlying reduction of tonic dopaminergic transmission in the striatum and reduced dopamine receptor availability resulting in reduced excitability of the reward system, as indicated by an increase in the threshold for brain stimulation for reward (Melis et al., 2005). Acute cocaine administration (as in CUD+ individuals) yields an enhanced dopamine surge, which lowers the threshold for exciting the reward system (Koob et al., 1998) and therefore may also normalize +RPE signaling in the addicted individuals with the most-frequent and most-recent cocaine use. Therefore, the current +RPE results in CUD individuals are consistent with the self-medication hypothesis of addiction (Khantzian, 1997). The temporary suppression of processing deficits by acute cocaine use is not surprising given similar prior results when acutely administering cocaine (Garavan et al., 2008) or methylphenidate (a dopamine partial agonist; Goldstein et al., 2010) to cocaine-addicted individuals. These results also extend our previous findings of more-severe deficits in executive functioning (Woicik et al., 2009) and reward-magnitude sensitivity (Parvaz et al., 2012) in CUD− individuals compared with CUD+ individuals. Note that these results cannot be attributed to more withdrawal in the CUD− individuals as craving and withdrawal were higher in CUD+ individuals.
Most interestingly, the current results highlight deficits in −RPE across all CUD individuals; that is, regardless of the frequency and recency of current cocaine use, there was no impact of prediction on the neural response to loss in CUD individuals. Although novel, these results are not surprising given previous neurochemical reports of sluggish activity in dopaminergic neurons (Volkow et al., 1997; Martinez et al., 2009), further contributing to individual vulnerability to relapse (Hyman et al., 2006; Schultz, 2011). This vulnerability may become more severe with longer abstinence (Tran-Nguyen et al., 1998; Ciccocioppo et al., 2001). Indeed, an impaired response to unpredicted loss in CUD individuals is consistent with findings in other disorders of inhibitory control, such as gambling, where, compared with healthy controls, pathological gamblers show reduced frontostriatal activity on a modified monetary incentive delay task specifically to a loss versus a neutral condition (Balodis et al., 2012).
Alternatively, dysregulation in the serotonin system may also underlie −RPE deficits in CUD individuals. It has previously been suggested that dopamine codes positive and negative prediction errors in tandem with a second opponent system specialized for the negative component of the prediction error, the dorsal raphe serotonin system (Daw et al., 2002). Acute tryptophan depletion, which reduces the availability of central serotonin (5-HT), selectively potentiates punishment prediction (i.e., leaving reward prediction intact; Cools et al., 2008). Interestingly, cocaine-dependent individuals exhibit reduced 5-HT1B receptor availability in several cortical and subcortical regions relative to controls (Matuskey et al., 2014), which may possibly reflect increased endogenous 5-HT levels or 5-HT1B receptor downregulation. Thus, impaired −RPE in CUD individuals may also reflect a primary dysfunction in the serotonin system, or in its interaction with the dopamine system. Lateral habenula (LHb) is one such extrastriatal region that is posited to influence both serotonin (Stern et al., 1981) and striatal dopamine, specifically in dynamic trial-by-trial aversive learning in humans (Lawson et al., 2014) and in nonhuman primates (Matsumoto and Hikosaka, 2007). Thus, the current results of −RPE deficits in CUD individuals are also consistent with recently implicated dysfunctional LHb activity in drug addiction (Velasquez et al., 2014).
Together, the −RPE deficits in both CUD subgroups may reflect an underlying impairment in dopaminergic regulation of learning from disadvantageous experiences, regardless of the recent drug-use pattern. Additionally, this dysregulated dopamine activity may contribute to problems in executive function, such as impaired cognitive control/flexibility (Schoenbaum et al., 2004; Ersche et al., 2008; Liu et al., 2008) and working memory (Tomasi et al., 2007; George et al., 2008; Porter et al., 2011) that are frequently reported in drug addiction. Unlike +RPE signaling, which was intact in CUD+ individuals, deficits in −RPE in all CUD individuals could highlight predisposing factors that may enhance an individual's proclivity toward drug addiction. Alternatively, impaired −RPE could also be a consequence of chronic drug use, but one that is not affected by recent drug use. Future studies could probe the fidelity of −RPE signaling in young adults and at-risk populations to explore whether there are RPE-related deficits that exist before the initiation of drug use or the transition to drug addiction. If this hypothesis is ultimately supported, −RPE could potentially serve as a biomarker for the propensity to develop addiction.
A potential limitation of the current study is that the 25 and 75% winning probability conditions greatly biased the prediction toward predicted loss and predicted win, respectively, and yielded very few unpredicted loss trials for the 25% condition and unpredicted win trials for the 75% condition. Future studies may use other winning probabilities (such as 0, 30, 60, 90, and 100%) to allow increased variation per condition so that RPEs could be evaluated separately for each probability. Second, the task requirement of making a dichotomous prediction on each trial may have changed the meaning of the feedback. For example, when participants correctly predicted a loss, the subsequent negative feedback could simultaneously indicate a correct prediction as well as a loss. However, previous studies have shown that while FN tracks the monetary value of the feedback, it is insensitive to its performance-related aspect (Nieuwenhuis et al., 2004b; Hajcak et al., 2007). Nevertheless, these are the first results using electrocortical biomarkers to elucidate RPE-related disruption in CUD individuals and its association with current drug use. This is important because the increased temporal resolution of neuronal activity afforded by EEG compared with fMRI may be particularly well suited to capture phasic neural processes, such as RPE.
In sum, the current results show absence of a modulation of a direct electrocortical measure of RPE (i.e., the FN) by the prediction of loss in CUD individuals. Similar deficits in both active and abstinent users may be indicative of an underlying trait-like vulnerability to addiction, although this speculation needs to be verified in future longitudinal studies. These results are consequential, as they reflect the neurophysiology underlying persistent disadvantageous behaviors in addicted individuals, such as frequent imprisonment, loss of family and friends, and relapse to drug use, which may reflect the inability of these individuals to learn from unfavorable outcomes. Although most reward processing-related studies investigate the underlying mechanisms of positive prediction error in addiction, the current results highlight the importance of investigating mechanisms underlying the processing of negative prediction error. Moreover, FN modulation by positive reward prediction differed as a function of recency of cocaine use such that it was intact in active users but impaired in cocaine-addicted individuals with recent abstinence. These deficits may thus contribute to relapse risk; that is, to the extent that cocaine may temporarily normalize underlying cognitive, behavioral, and emotional disruptions, abstinent users may revert back to drug use to normalize these impairments.
Footnotes
This work was supported by grants from the National Institute on Drug Abuse (F32DA033088 to M.A.P.; R21DA034954 to R.Z.G.) and from the National Institute on Mental Health (5T32MH019524 to A.B.K.; R01MH090134 to N.A.K.).
The authors declare no competing financial interests.
- Correspondence should be addressed to Rita Z. Goldstein, One Gustave Levy Place, New York, NY 10029. rita.goldstein{at}mssm.edu