Reward expectation modulates feedback-related negativity and EEG spectra

doi:10.1016/j.neuroimage.2006.11.056

NeuroImage

Volume 35, Issue 2, 1 April 2007, Pages 968-978

https://doi.org/10.1016/j.neuroimage.2006.11.056 Get rights and content

Abstract

The ability to evaluate outcomes of previous decisions is critical to adaptive decision-making. The feedback-related negativity (FRN) is an event-related potential (ERP) modulation that distinguishes losses from wins, but little is known about the effects of outcome probability on these ERP responses. Further, little is known about the frequency characteristics of feedback processing, for example, event-related oscillations and phase synchronizations. Here, we report an EEG experiment designed to address these issues. Subjects engaged in a probabilistic reinforcement learning task in which we manipulated, across blocks, the probability of winning and losing to each of two possible decision options. Behaviorally, all subjects quickly adapted their decision-making to maximize rewards. ERP analyses revealed that the probability of reward modulated neural responses to wins, but not to losses. This was seen both across blocks as well as within blocks, as learning progressed. Frequency decomposition via complex wavelets revealed that EEG responses to losses, compared to wins, were associated with enhanced power and phase coherence in the theta frequency band. As in the ERP analyses, power and phase coherence values following wins but not losses were modulated by reward probability. Some findings between ERP and frequency analyses diverged, suggesting that these analytic approaches provide complementary insights into neural processing. These findings suggest that the neural mechanisms of feedback processing may differ between wins and losses.

Introduction

To optimize behavior, organisms must evaluate outcomes of their actions, and use these evaluations to guide decision-making. The neural mechanisms of feedback evaluation are receiving increasing attention in cognitive neuroscience. In particular, researchers using event-related potentials (ERPs) have identified a component of the feedback-locked ERP that is sensitive to the valence of the feedback. This feedback-related negativity (FRN) is a relatively negative deflection in the ERP following losses or error feedback compared to wins or positive feedback. The FRN peaks at around 300 ms and is maximal at fronto-central scalp electrode sites (Hajcak et al., 2005, Holroyd et al., 2003, Yasuda et al., 2004). Convergent findings from source modeling, fMRI, and single-unit recording studies suggest that the FRN is generated in the medial frontal cortex, and probably in the anterior cingulate cortex (Amiez et al., 2005, Brown and Braver, 2005, Mars et al., 2005, Miltner et al., 2003, Niki and Watanabe, 1979, Paulus et al., 2004, Ridderinkhof et al., 2004, Shidara and Richmond, 2002, Tsujimoto et al., 2006, van Schie et al., 2004, Williams et al., 2004). Topographically and functionally similar feedback-locked ERP modulations have been called the medial frontal negativity and feedback error-related negativity (Gehring and Willoughby, 2002, Holroyd et al., 2003). These effects also share many similarities with the error-related negativity (ERN), a negative-going mid-frontally distributed potential elicited by erroneous responses on speeded response tasks. These potentials are thought to reflect activation of a reinforcement learning system that rapidly evaluates outcomes of decisions to guide reward-seeking behavior (Holroyd and Coles, 2002, Nieuwenhuis et al., 2004). This system is capable of rapidly determining whether feedback is better or worse than expected, and encodes this difference between expectations and actual outcomes as a reward prediction error. The anterior cingulate cortex might use these prediction errors to improve performance due to its role in cognitive control and action monitoring (Barber and Carter, 2005, Bokura et al., 2001, Botvinick et al., 2004, Kerns et al., 2004).

Given that a reward prediction error is the difference between an expected and received reward, differences in expectations of rewards should modulate the size of prediction error signals. Single-unit recording studies in nonhuman primates suggest that this is indeed the case, with more unexpected outcomes yielding larger neural responses in midbrain dopamine neurons (Fiorillo et al., 2003). It is unclear whether the magnitude of the FRN is also modulated by reward expectation, because previous studies have yielded inconsistent findings. In two studies (Holroyd et al., 2003, Yasuda et al., 2004), the magnitude of the FRN was larger when outcomes were unexpected. In another study, no statistically significant modulation was observed (Hajcak et al., 2005), although from visual inspection, it appears that the FRN was larger for unexpected than expected outcomes. Of the two studies that found a significant modulation, Yasuda and colleagues (2004) found that ERPs following both losses and wins were enhanced. In the Holroyd et al. (2003) study, however, it appears from visual inspection that only the win-related ERPs were modulated, although a statistical test of this asymmetry was not reported. We designed an experiment to investigate this issue further by examining not only how reward probability might modulate outcome-locked ERPs, but also how changes in reward expectation that occur during learning might further modulate ERPs.

Because the FRN (and ERPs in general) is measured by averaging single-trial EEG traces, this potential will not reflect oscillatory activity that varies in phase from trial-to-trial (particularly in high frequencies, such as gamma). Such event-related oscillations can be assessed using time–frequency decomposition analyses such as complex wavelet convolutions, from which one can obtain estimates of instantaneous power (i.e., energy at different frequencies) and inter-trial phase coherence (i.e., consistency of oscillation onset across trials). Recent findings using this approach have revealed novel insights into task-related cognitive processes beyond what is evident in averaged ERPs (Fell et al., 2004, Makeig et al., 2002, Salinas and Sejnowski, 2001). Although the frequency characteristics of feedback processing are largely unknown, research into the frequency characteristics of the response-related ERN (Bernat et al., 2005, Luu and Tucker, 2001, Luu et al., 2004, Trujillo and Allen, submitted for publication) suggests it reflects enhanced theta (i.e., 4–8 Hz) activity following incorrect compared to correct responses. Based on the idea that the ERN and FRN reflect similar mechanisms of monitoring and controlling behavior (Holroyd and Coles, 2002), we hypothesized that feedback processing would therefore induce increased EEG theta activity for losses compared to wins.

In the present study, we sought to investigate the effects of reward probability on ERP and oscillatory correlates of neural feedback processing. Subjects chose one of two targets on each trial, and received positive or negative feedback (± 10 cents) following each choice. In blocks of 80–150 trials, we manipulated the probability of winning and losing such that subjects had to learn which of the two targets rewarded more often in order to maximize their winnings. This design allowed us to examine neural responses to winning and losing as a function of the probability of wins and losses, using both conventional ERP and time–frequency analyses.

Section snippets

Subjects

Seventeen (6 males) subjects aged 20–30 from the University of Bonn community participated in this experiment. Subjects were paid the amount they earned in the experiment or 10 Euros per hour (whichever was higher), and typically earned around 25 Euros. Informed consent documents were signed prior to the start of the experiment, which was approved by the local ethics committee.

Experiment

On each of 1200 trials during the experiment, subjects saw two small targets on the left and right side of the screen,

Behavior

Although subjects were not told about changes in probabilities of rewards, they quickly adapted their behavior to find the optimal strategy: During blocks when the right-hand target rewarded 25%, 50%, and 75% of the time, subjects selected the right-hand target on 36.1%, 53.4%, and 71.4% of trials (SEM: 2.1%, 1.4%, 1.7%), respectively (Fig. 1b). A 3-way ANOVA revealed a main effect of probability (F_2,32 = 97.70, p < 0.0001), and planned comparisons of the simple effects confirmed that each

Discussion

In the present study, we examined whether and how expectations of rewards and losses affected ERP and oscillatory correlates of feedback processing. We found that ERPs, theta, and gamma activity following wins, but not losses, were modulated by the feedback probability manipulation. This was seen both across and within (i.e., learning effects) blocks of trials. Additionally, we found enhanced power and cross-trial phase coherence in the theta frequency band (4–8 Hz) for losses compared to wins,

Acknowledgments

We thank Erin McMorris for her help running subjects, Juergen Fell and Deborah Hannula for their insightful comments and discussions, and two anonymous reviewers for their comments and suggestions. MXC is supported by a NIDA NRSA.

References (58)

C. Basar-Eroglu et al.
Event-related theta oscillations: an integrative and comparative approach in the human and animal brain
Int. J. Psychophysiol.
(2001)
C. Basar-Eroglu et al.
P300-response: possible psychophysiological correlates in delta and theta frequency channels. A review
Int. J. Psychophysiol.
(1992)
E.M. Bernat et al.
Decomposing ERP time–frequency energy using PCA
Clin. Neurophysiol.
(2005)
H. Bokura et al.
Electrophysiological correlates for response inhibition in a Go/NoGo task
Clin. Neurophysiol.
(2001)
M.M. Botvinick et al.
Conflict monitoring and anterior cingulate cortex: an update
Trends Cogn. Sci.
(2004)
A. Delorme et al.
EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis
J. Neurosci. Methods
(2004)
G. Hajcak et al.
The feedback-related negativity reflects the binary evaluation of good versus bad outcomes
Biol. Psychol.
(2006)
A. Keil et al.
Human large-scale oscillatory brain activity during an operant shaping procedure
Brain Res. Cogn. Brain Res.
(2001)
I.J. Kirk et al.
The role of theta-range oscillations in synchronising and integrating activity in distributed mnemonic networks
Cortex
(2003)
W. Klimesch
EEG alpha and theta oscillations reflect cognitive and memory performance: a review and analysis
Brain Res. Brain Res. Rev.
(1999)

C.D. Fiorillo et al.

Discrete coding of reward probability and uncertainty by dopamine neurons

Science

(2003)

Cited by (465)

Exploration of the influence of the quantification method and reference scheme on feedback-related negativity and standardized measurement error of feedback-related negativity amplitudes in a trust game
2024, Cortex
Various approaches have been taken over the years to quantify event-related potential (ERP) responses and these approaches may vary in their utility connecting empirical research and scientific claims. In this work we compared different quantification methods as well as the influence of three reference methods (linked mastoids, average reference, and current source density) on the resulting ERP amplitude. We use the experimental effects and effect sizes (Cohen's d) to evaluate the different methodological variants and we calculate intraclass correlation coefficients (ICC). In addition, the bootstrapped standard error of the means (SME, Luck et al., 2021), which was recently suggested as a quality criterion for ERP research, is used for this purpose. Our example for an ERP is the feedback-related negativity (FRN) to feedback about trustee behavior in a trust game with participants in the trustor position. We found that the quantification methods concerning the FRN influenced the absolute value of condition effects in the experimental paradigm. Yet, the patterns of effects were detected by all chosen methods, except for the ‘individual difference wave’-based peak window approach. In addition, our findings stress the importance of checking the reference electrodes concerning effects of the experimental conditions. Furthermore, interactions of topographical distribution and reference choice should be considered. Finally, we were able to show that the SME is lower for more datapoints that are given in the quantification period of the FRN, and higher for more negative FRN amplitudes. These biases may lead to divergence of SME and effect size detection. Therefore, if the SME was used to compare different processing choices one should consider controlling for these important aspects of the data and possibly include other quality criteria like effect sizes.
Distinct spatiotemporal brainstem pathways of outcome valence during reward- and punishment-based learning
2023, Cell Reports
Learning to seek rewards and avoid punishments, based on positive and negative choice outcomes, is essential for human survival. Yet, the neural underpinnings of outcome valence in the human brainstem and the extent to which they differ in reward and punishment learning contexts remain largely elusive. Here, using simultaneously acquired electroencephalography and functional magnetic resonance imaging data, we show that during reward learning the substantia nigra (SN)/ventral tegmental area (VTA) and locus coeruleus are initially activated following negative outcomes, while the VTA subsequently re-engages exhibiting greater responses for positive than negative outcomes, consistent with an early arousal/avoidance response and a later value-updating process, respectively. During punishment learning, we show that distinct raphe nucleus and SN subregions are activated only by negative outcomes with a sustained post-outcome activity across time, supporting the involvement of these brainstem subregions in avoidance behavior. Finally, we demonstrate that the coupling of these brainstem structures with other subcortical and cortical areas helps to shape participants’ serial choice behavior in each context.
Working memory training for reward processing in university students with subsyndromal depression: The influence of baseline severity of depression
2023, Biological Psychology
Previous studies have tentatively suggested that working memory training (WMT) has the potential to improve reward processing, but it is not known how long this improvement lasts, whether there is a lag effect, or whether it is reflected in neurophysiological indicators. In this study, 40 university students with subsyndromal depression were randomly assigned to a training group or a control group and completed a 20-day working memory training task and a simple memory task, respectively. All participants completed the Temporal Experience of Pleasure Scale (TEPS) and a doors task with electroencephalogram (EEG) signals recorded simultaneously on a pre- and post-test and a 3-month follow-up. The reward-related positivity (RewP) amplitude, theta power, and their differences between conditions (i.e., ΔRewP and Δtheta power, respectively) in the doors task were the primary outcomes, and the score on TEPS was the secondary outcome. The results indicated no group-related effects were demonstrated in primary and secondary outcomes at post-test and 3-month follow-up. Furthermore, the differences in the pre- and post-test in Δtheta power were moderated by the baseline severity of depression. This was primarily driven by the fact that the change values in the control group increased with the severity of depression, while the change values in the training group had high homogeneity. Our findings did not provide support for the effect of WMT on reward processing across the whole sample, but without intervention, there would be high heterogeneity in the change in the cognitive control ability to loss feedback, which is detrimental to individuals with high depression severity.
Neural symphony of risky decision making in children with ADHD: Insights from transcranial alternating current stimulation and cognitive modeling
2023, Neurophysiologie Clinique
The ventromedial prefrontal cortex (vmPFC) and dorsolateral prefrontal cortex (dlPFC) are key brain regions involved in risky decision making, affected in individuals with attention deficit hyperactivity disorder (ADHD). This study aims to examine how entrainment of these areas impacts the process and outcome of risky decision making in children with ADHD.
Eighteen children with ADHD performed the balloon analogue risk-taking task (BART) during five different sessions of tACS (1.5 mA, 6 Hz), separated by one-week intervals, via (1) two channels with synchronized stimulation over the left dlPFC and right vmPFC, (2) the same electrode placement with anti-phase stimulation, (3) stimulation over the left dlPFC only, (4) stimulation over right vmPFC only, and (5) sham stimulation. Four-parameter and constant-sensitivity models were used to model the data.
The study showed that synchronized stimulation was associated with a reduction in positive prior belief, risk propensity, and deterministic selection. Desynchronized stimulation was associated with accelerated learning from initial selections. Isolated stimulation of the dlPFC leads to riskier decision enhanced learning updates and risk propensity, whereas isolated stimulation of the vmPFC facilitated faster learning and increased probabilistic selection.
The results highlight the important roles of the dlPFC and vmPFC and their communication in decision making, showcasing their impact on various aspects of the decision-making process. The findings provide valuable insights into the complex interplay between cognitive and emotional factors in shaping our choices.
Freedom of choice boosts midfrontal theta power during affective feedback processing of goal-directed actions
2023, Biological Psychology
Sense of agency, the feeling of being in control of one’s actions and their effects, is particularly relevant during goal-directed actions. During feedback learning, action effects provide information about the best course of action to reinforce positive and prevent negative outcomes. However, it is unclear whether agency experience selectively affects the processing of negative or positive feedback during the performance of goal-directed actions. As an important marker of feedback processing, we examined agency-related changes in midfrontal oscillatory activity in response to performance feedback using electroencephalography. Thirty-three participants completed a reinforcement learning task during which they received positive (monetary gain) or negative (monetary loss) feedback following item choices made either by themselves (free-choice) or by the computer (forced-choice). Independent of choice context, midfrontal theta activity was more enhanced for negative than positive feedback. In addition, free, compared to forced choices increased midfrontal theta power for both gain and loss feedback. These results indicate that freedom of choice in a motivationally salient learning task leads to a general enhancement in the processing of affective action outcomes. Our findings contribute to an understanding of the neuronal mechanisms underlying agency-related changes during action regulation and indicate midfrontal theta activity as a neurophysiological marker important for the monitoring of affective action outcomes, irrespective of feedback valence.
Effort and Appetitive Responding in Depression: Examining Deficits in Motivational and Consummatory Stages of Reward Processing Using the Effort-Doors Task
2023, Biological Psychiatry Global Open Science
Reward sensitivity is a dimensional construct central to understanding the nature of depression. Psychophysiological research on this construct has primarily focused on the reward positivity, an event-related potential (ERP) that indexes consummatory reward sensitivity. This study extended prior research by focusing on ERPs that index the motivational component of reward.
A novel effort-for-reward task was used to elicit motivational and consummatory ERPs. Groups consisting of 34 participants with depression and 32 participants without depression were compared across a range of reward-related ERPs.
Participants with depression exhibited reduced responsivity to effort completion cues following high effort expenditure, reduced anticipation of rewards after low effort expenditure (i.e., the stimulus preceding negativity), and reduced reward positivity following high effort expenditure. ERPs occurring prior to reward receipt accounted for unique variance in depression status and differentiated between subgroups of depressed individuals.
Findings support the utility of leveraging multiple ERPs that index separate reward processing deficits to better characterize depression and depressive subtypes.

View all citing articles on Scopus

View full text

Reward expectation modulates feedback-related negativity and EEG spectra

Abstract

Introduction

Section snippets

Subjects

Experiment

Behavior

Discussion

Acknowledgments

Int. J. Psychophysiol.

Int. J. Psychophysiol.

Clin. Neurophysiol.

Clin. Neurophysiol.

Trends Cogn. Sci.

J. Neurosci. Methods

Biol. Psychol.

Brain Res. Cogn. Brain Res.

Cortex

Brain Res. Brain Res. Rev.

Brain Res. Cogn. Brain Res.

Clin. Neurophysiol.

Clin. Neurophysiol.

NeuroImage

Biol. Psychol.

Neurosci. Biobehav. Rev.

Brain Res.

NeuroImage

Biol. Psychiatry

Biol. Psychol.

Neurosci. Res.

Neuroscience

NeuroImage

Anterior cingulate error-related activity is modulated by predicted reward

Eur. J. Neurosci.

Cognitive control involved in overcoming prepotent response tendencies and switching between tasks

Cereb. Cortex

Learned predictions of error likelihood in the anterior cingulate cortex

Science

Is the P300 component a manifestation of context updating?

Behav. Brain Sci.

Neural bases of cognitive ERPs: more than phase reset

J. Cogn. Neurosci.

Discrete coding of reward probability and uncertainty by dopamine neurons

Science