Elsevier

NeuroImage

Volume 35, Issue 2, 1 April 2007, Pages 968-978
NeuroImage

Reward expectation modulates feedback-related negativity and EEG spectra

https://doi.org/10.1016/j.neuroimage.2006.11.056Get rights and content

Abstract

The ability to evaluate outcomes of previous decisions is critical to adaptive decision-making. The feedback-related negativity (FRN) is an event-related potential (ERP) modulation that distinguishes losses from wins, but little is known about the effects of outcome probability on these ERP responses. Further, little is known about the frequency characteristics of feedback processing, for example, event-related oscillations and phase synchronizations. Here, we report an EEG experiment designed to address these issues. Subjects engaged in a probabilistic reinforcement learning task in which we manipulated, across blocks, the probability of winning and losing to each of two possible decision options. Behaviorally, all subjects quickly adapted their decision-making to maximize rewards. ERP analyses revealed that the probability of reward modulated neural responses to wins, but not to losses. This was seen both across blocks as well as within blocks, as learning progressed. Frequency decomposition via complex wavelets revealed that EEG responses to losses, compared to wins, were associated with enhanced power and phase coherence in the theta frequency band. As in the ERP analyses, power and phase coherence values following wins but not losses were modulated by reward probability. Some findings between ERP and frequency analyses diverged, suggesting that these analytic approaches provide complementary insights into neural processing. These findings suggest that the neural mechanisms of feedback processing may differ between wins and losses.

Introduction

To optimize behavior, organisms must evaluate outcomes of their actions, and use these evaluations to guide decision-making. The neural mechanisms of feedback evaluation are receiving increasing attention in cognitive neuroscience. In particular, researchers using event-related potentials (ERPs) have identified a component of the feedback-locked ERP that is sensitive to the valence of the feedback. This feedback-related negativity (FRN) is a relatively negative deflection in the ERP following losses or error feedback compared to wins or positive feedback. The FRN peaks at around 300 ms and is maximal at fronto-central scalp electrode sites (Hajcak et al., 2005, Holroyd et al., 2003, Yasuda et al., 2004). Convergent findings from source modeling, fMRI, and single-unit recording studies suggest that the FRN is generated in the medial frontal cortex, and probably in the anterior cingulate cortex (Amiez et al., 2005, Brown and Braver, 2005, Mars et al., 2005, Miltner et al., 2003, Niki and Watanabe, 1979, Paulus et al., 2004, Ridderinkhof et al., 2004, Shidara and Richmond, 2002, Tsujimoto et al., 2006, van Schie et al., 2004, Williams et al., 2004). Topographically and functionally similar feedback-locked ERP modulations have been called the medial frontal negativity and feedback error-related negativity (Gehring and Willoughby, 2002, Holroyd et al., 2003). These effects also share many similarities with the error-related negativity (ERN), a negative-going mid-frontally distributed potential elicited by erroneous responses on speeded response tasks. These potentials are thought to reflect activation of a reinforcement learning system that rapidly evaluates outcomes of decisions to guide reward-seeking behavior (Holroyd and Coles, 2002, Nieuwenhuis et al., 2004). This system is capable of rapidly determining whether feedback is better or worse than expected, and encodes this difference between expectations and actual outcomes as a reward prediction error. The anterior cingulate cortex might use these prediction errors to improve performance due to its role in cognitive control and action monitoring (Barber and Carter, 2005, Bokura et al., 2001, Botvinick et al., 2004, Kerns et al., 2004).

Given that a reward prediction error is the difference between an expected and received reward, differences in expectations of rewards should modulate the size of prediction error signals. Single-unit recording studies in nonhuman primates suggest that this is indeed the case, with more unexpected outcomes yielding larger neural responses in midbrain dopamine neurons (Fiorillo et al., 2003). It is unclear whether the magnitude of the FRN is also modulated by reward expectation, because previous studies have yielded inconsistent findings. In two studies (Holroyd et al., 2003, Yasuda et al., 2004), the magnitude of the FRN was larger when outcomes were unexpected. In another study, no statistically significant modulation was observed (Hajcak et al., 2005), although from visual inspection, it appears that the FRN was larger for unexpected than expected outcomes. Of the two studies that found a significant modulation, Yasuda and colleagues (2004) found that ERPs following both losses and wins were enhanced. In the Holroyd et al. (2003) study, however, it appears from visual inspection that only the win-related ERPs were modulated, although a statistical test of this asymmetry was not reported. We designed an experiment to investigate this issue further by examining not only how reward probability might modulate outcome-locked ERPs, but also how changes in reward expectation that occur during learning might further modulate ERPs.

Because the FRN (and ERPs in general) is measured by averaging single-trial EEG traces, this potential will not reflect oscillatory activity that varies in phase from trial-to-trial (particularly in high frequencies, such as gamma). Such event-related oscillations can be assessed using time–frequency decomposition analyses such as complex wavelet convolutions, from which one can obtain estimates of instantaneous power (i.e., energy at different frequencies) and inter-trial phase coherence (i.e., consistency of oscillation onset across trials). Recent findings using this approach have revealed novel insights into task-related cognitive processes beyond what is evident in averaged ERPs (Fell et al., 2004, Makeig et al., 2002, Salinas and Sejnowski, 2001). Although the frequency characteristics of feedback processing are largely unknown, research into the frequency characteristics of the response-related ERN (Bernat et al., 2005, Luu and Tucker, 2001, Luu et al., 2004, Trujillo and Allen, submitted for publication) suggests it reflects enhanced theta (i.e., 4–8 Hz) activity following incorrect compared to correct responses. Based on the idea that the ERN and FRN reflect similar mechanisms of monitoring and controlling behavior (Holroyd and Coles, 2002), we hypothesized that feedback processing would therefore induce increased EEG theta activity for losses compared to wins.

In the present study, we sought to investigate the effects of reward probability on ERP and oscillatory correlates of neural feedback processing. Subjects chose one of two targets on each trial, and received positive or negative feedback (± 10 cents) following each choice. In blocks of 80–150 trials, we manipulated the probability of winning and losing such that subjects had to learn which of the two targets rewarded more often in order to maximize their winnings. This design allowed us to examine neural responses to winning and losing as a function of the probability of wins and losses, using both conventional ERP and time–frequency analyses.

Section snippets

Subjects

Seventeen (6 males) subjects aged 20–30 from the University of Bonn community participated in this experiment. Subjects were paid the amount they earned in the experiment or 10 Euros per hour (whichever was higher), and typically earned around 25 Euros. Informed consent documents were signed prior to the start of the experiment, which was approved by the local ethics committee.

Experiment

On each of 1200 trials during the experiment, subjects saw two small targets on the left and right side of the screen,

Behavior

Although subjects were not told about changes in probabilities of rewards, they quickly adapted their behavior to find the optimal strategy: During blocks when the right-hand target rewarded 25%, 50%, and 75% of the time, subjects selected the right-hand target on 36.1%, 53.4%, and 71.4% of trials (SEM: 2.1%, 1.4%, 1.7%), respectively (Fig. 1b). A 3-way ANOVA revealed a main effect of probability (F2,32 = 97.70, p < 0.0001), and planned comparisons of the simple effects confirmed that each

Discussion

In the present study, we examined whether and how expectations of rewards and losses affected ERP and oscillatory correlates of feedback processing. We found that ERPs, theta, and gamma activity following wins, but not losses, were modulated by the feedback probability manipulation. This was seen both across and within (i.e., learning effects) blocks of trials. Additionally, we found enhanced power and cross-trial phase coherence in the theta frequency band (4–8 Hz) for losses compared to wins,

Acknowledgments

We thank Erin McMorris for her help running subjects, Juergen Fell and Deborah Hannula for their insightful comments and discussions, and two anonymous reviewers for their comments and suggestions. MXC is supported by a NIDA NRSA.

References (58)

  • Y. Kubota et al.

    Frontal midline theta rhythm is correlated with cardiac autonomic activities during the performance of an attention demanding meditation procedure

    Brain Res. Cogn. Brain Res.

    (2001)
  • P. Luu et al.

    Regulating action: alternating activation of midline frontal and motor cortical networks

    Clin. Neurophysiol.

    (2001)
  • P. Luu et al.

    Frontal midline theta and the error-related negativity: neurophysiological mechanisms of action regulation

    Clin. Neurophysiol.

    (2004)
  • R.B. Mars et al.

    Neural dynamics of error processing in medial frontal cortex

    NeuroImage

    (2005)
  • W.H. Miltner et al.

    Implementation of error-processing in the human anterior cingulate cortex: a source analysis of the magnetic equivalent of the error-related negativity

    Biol. Psychol.

    (2003)
  • S. Nieuwenhuis et al.

    Reinforcement-related brain potentials from medial frontal cortex: origins and functional significance

    Neurosci. Biobehav. Rev.

    (2004)
  • H. Niki et al.

    Prefrontal and cingulate unit activity during timing behavior in the monkey

    Brain Res.

    (1979)
  • J. Onton et al.

    Frontal midline EEG dynamics during working memory

    NeuroImage

    (2005)
  • M.P. Paulus et al.

    Anterior cingulate activation in high trait anxious subjects is related to altered error processing during decision making

    Biol. Psychiatry

    (2004)
  • J. Polich et al.

    Cognitive and biological determinants of P300: an integrative review

    Biol. Psychol.

    (1995)
  • T. Shibata et al.

    Event-related dynamics of the gamma-band oscillation in the human brain: information processing during a GO/NOGO hand movement task

    Neurosci. Res.

    (1999)
  • M. Steriade

    Grouping of brain rhythms in corticothalamic systems

    Neuroscience

    (2006)
  • J. Yordanova et al.

    Parallel systems of error processing in the brain

    NeuroImage

    (2004)
  • C. Amiez et al.

    Anterior cingulate error-related activity is modulated by predicted reward

    Eur. J. Neurosci.

    (2005)
  • A.D. Barber et al.

    Cognitive control involved in overcoming prepotent response tendencies and switching between tasks

    Cereb. Cortex

    (2005)
  • J.W. Brown et al.

    Learned predictions of error likelihood in the anterior cingulate cortex

    Science

    (2005)
  • E. Donchin et al.

    Is the P300 component a manifestation of context updating?

    Behav. Brain Sci.

    (1988)
  • J. Fell et al.

    Neural bases of cognitive ERPs: more than phase reset

    J. Cogn. Neurosci.

    (2004)
  • C.D. Fiorillo et al.

    Discrete coding of reward probability and uncertainty by dopamine neurons

    Science

    (2003)
  • Cited by (465)

    View all citing articles on Scopus
    View full text