Reward expectation modulates feedback-related negativity and EEG spectra
Introduction
To optimize behavior, organisms must evaluate outcomes of their actions, and use these evaluations to guide decision-making. The neural mechanisms of feedback evaluation are receiving increasing attention in cognitive neuroscience. In particular, researchers using event-related potentials (ERPs) have identified a component of the feedback-locked ERP that is sensitive to the valence of the feedback. This feedback-related negativity (FRN) is a relatively negative deflection in the ERP following losses or error feedback compared to wins or positive feedback. The FRN peaks at around 300 ms and is maximal at fronto-central scalp electrode sites (Hajcak et al., 2005, Holroyd et al., 2003, Yasuda et al., 2004). Convergent findings from source modeling, fMRI, and single-unit recording studies suggest that the FRN is generated in the medial frontal cortex, and probably in the anterior cingulate cortex (Amiez et al., 2005, Brown and Braver, 2005, Mars et al., 2005, Miltner et al., 2003, Niki and Watanabe, 1979, Paulus et al., 2004, Ridderinkhof et al., 2004, Shidara and Richmond, 2002, Tsujimoto et al., 2006, van Schie et al., 2004, Williams et al., 2004). Topographically and functionally similar feedback-locked ERP modulations have been called the medial frontal negativity and feedback error-related negativity (Gehring and Willoughby, 2002, Holroyd et al., 2003). These effects also share many similarities with the error-related negativity (ERN), a negative-going mid-frontally distributed potential elicited by erroneous responses on speeded response tasks. These potentials are thought to reflect activation of a reinforcement learning system that rapidly evaluates outcomes of decisions to guide reward-seeking behavior (Holroyd and Coles, 2002, Nieuwenhuis et al., 2004). This system is capable of rapidly determining whether feedback is better or worse than expected, and encodes this difference between expectations and actual outcomes as a reward prediction error. The anterior cingulate cortex might use these prediction errors to improve performance due to its role in cognitive control and action monitoring (Barber and Carter, 2005, Bokura et al., 2001, Botvinick et al., 2004, Kerns et al., 2004).
Given that a reward prediction error is the difference between an expected and received reward, differences in expectations of rewards should modulate the size of prediction error signals. Single-unit recording studies in nonhuman primates suggest that this is indeed the case, with more unexpected outcomes yielding larger neural responses in midbrain dopamine neurons (Fiorillo et al., 2003). It is unclear whether the magnitude of the FRN is also modulated by reward expectation, because previous studies have yielded inconsistent findings. In two studies (Holroyd et al., 2003, Yasuda et al., 2004), the magnitude of the FRN was larger when outcomes were unexpected. In another study, no statistically significant modulation was observed (Hajcak et al., 2005), although from visual inspection, it appears that the FRN was larger for unexpected than expected outcomes. Of the two studies that found a significant modulation, Yasuda and colleagues (2004) found that ERPs following both losses and wins were enhanced. In the Holroyd et al. (2003) study, however, it appears from visual inspection that only the win-related ERPs were modulated, although a statistical test of this asymmetry was not reported. We designed an experiment to investigate this issue further by examining not only how reward probability might modulate outcome-locked ERPs, but also how changes in reward expectation that occur during learning might further modulate ERPs.
Because the FRN (and ERPs in general) is measured by averaging single-trial EEG traces, this potential will not reflect oscillatory activity that varies in phase from trial-to-trial (particularly in high frequencies, such as gamma). Such event-related oscillations can be assessed using time–frequency decomposition analyses such as complex wavelet convolutions, from which one can obtain estimates of instantaneous power (i.e., energy at different frequencies) and inter-trial phase coherence (i.e., consistency of oscillation onset across trials). Recent findings using this approach have revealed novel insights into task-related cognitive processes beyond what is evident in averaged ERPs (Fell et al., 2004, Makeig et al., 2002, Salinas and Sejnowski, 2001). Although the frequency characteristics of feedback processing are largely unknown, research into the frequency characteristics of the response-related ERN (Bernat et al., 2005, Luu and Tucker, 2001, Luu et al., 2004, Trujillo and Allen, submitted for publication) suggests it reflects enhanced theta (i.e., 4–8 Hz) activity following incorrect compared to correct responses. Based on the idea that the ERN and FRN reflect similar mechanisms of monitoring and controlling behavior (Holroyd and Coles, 2002), we hypothesized that feedback processing would therefore induce increased EEG theta activity for losses compared to wins.
In the present study, we sought to investigate the effects of reward probability on ERP and oscillatory correlates of neural feedback processing. Subjects chose one of two targets on each trial, and received positive or negative feedback (± 10 cents) following each choice. In blocks of 80–150 trials, we manipulated the probability of winning and losing such that subjects had to learn which of the two targets rewarded more often in order to maximize their winnings. This design allowed us to examine neural responses to winning and losing as a function of the probability of wins and losses, using both conventional ERP and time–frequency analyses.
Section snippets
Subjects
Seventeen (6 males) subjects aged 20–30 from the University of Bonn community participated in this experiment. Subjects were paid the amount they earned in the experiment or 10 Euros per hour (whichever was higher), and typically earned around 25 Euros. Informed consent documents were signed prior to the start of the experiment, which was approved by the local ethics committee.
Experiment
On each of 1200 trials during the experiment, subjects saw two small targets on the left and right side of the screen,
Behavior
Although subjects were not told about changes in probabilities of rewards, they quickly adapted their behavior to find the optimal strategy: During blocks when the right-hand target rewarded 25%, 50%, and 75% of the time, subjects selected the right-hand target on 36.1%, 53.4%, and 71.4% of trials (SEM: 2.1%, 1.4%, 1.7%), respectively (Fig. 1b). A 3-way ANOVA revealed a main effect of probability (F2,32 = 97.70, p < 0.0001), and planned comparisons of the simple effects confirmed that each
Discussion
In the present study, we examined whether and how expectations of rewards and losses affected ERP and oscillatory correlates of feedback processing. We found that ERPs, theta, and gamma activity following wins, but not losses, were modulated by the feedback probability manipulation. This was seen both across and within (i.e., learning effects) blocks of trials. Additionally, we found enhanced power and cross-trial phase coherence in the theta frequency band (4–8 Hz) for losses compared to wins,
Acknowledgments
We thank Erin McMorris for her help running subjects, Juergen Fell and Deborah Hannula for their insightful comments and discussions, and two anonymous reviewers for their comments and suggestions. MXC is supported by a NIDA NRSA.
References (58)
- et al.
Event-related theta oscillations: an integrative and comparative approach in the human and animal brain
Int. J. Psychophysiol.
(2001) - et al.
P300-response: possible psychophysiological correlates in delta and theta frequency channels. A review
Int. J. Psychophysiol.
(1992) - et al.
Decomposing ERP time–frequency energy using PCA
Clin. Neurophysiol.
(2005) - et al.
Electrophysiological correlates for response inhibition in a Go/NoGo task
Clin. Neurophysiol.
(2001) - et al.
Conflict monitoring and anterior cingulate cortex: an update
Trends Cogn. Sci.
(2004) - et al.
EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis
J. Neurosci. Methods
(2004) - et al.
The feedback-related negativity reflects the binary evaluation of good versus bad outcomes
Biol. Psychol.
(2006) - et al.
Human large-scale oscillatory brain activity during an operant shaping procedure
Brain Res. Cogn. Brain Res.
(2001) - et al.
The role of theta-range oscillations in synchronising and integrating activity in distributed mnemonic networks
Cortex
(2003) EEG alpha and theta oscillations reflect cognitive and memory performance: a review and analysis
Brain Res. Brain Res. Rev.
(1999)
Frontal midline theta rhythm is correlated with cardiac autonomic activities during the performance of an attention demanding meditation procedure
Brain Res. Cogn. Brain Res.
Regulating action: alternating activation of midline frontal and motor cortical networks
Clin. Neurophysiol.
Frontal midline theta and the error-related negativity: neurophysiological mechanisms of action regulation
Clin. Neurophysiol.
Neural dynamics of error processing in medial frontal cortex
NeuroImage
Implementation of error-processing in the human anterior cingulate cortex: a source analysis of the magnetic equivalent of the error-related negativity
Biol. Psychol.
Reinforcement-related brain potentials from medial frontal cortex: origins and functional significance
Neurosci. Biobehav. Rev.
Prefrontal and cingulate unit activity during timing behavior in the monkey
Brain Res.
Frontal midline EEG dynamics during working memory
NeuroImage
Anterior cingulate activation in high trait anxious subjects is related to altered error processing during decision making
Biol. Psychiatry
Cognitive and biological determinants of P300: an integrative review
Biol. Psychol.
Event-related dynamics of the gamma-band oscillation in the human brain: information processing during a GO/NOGO hand movement task
Neurosci. Res.
Grouping of brain rhythms in corticothalamic systems
Neuroscience
Parallel systems of error processing in the brain
NeuroImage
Anterior cingulate error-related activity is modulated by predicted reward
Eur. J. Neurosci.
Cognitive control involved in overcoming prepotent response tendencies and switching between tasks
Cereb. Cortex
Learned predictions of error likelihood in the anterior cingulate cortex
Science
Is the P300 component a manifestation of context updating?
Behav. Brain Sci.
Neural bases of cognitive ERPs: more than phase reset
J. Cogn. Neurosci.
Discrete coding of reward probability and uncertainty by dopamine neurons
Science
Cited by (465)
Effort and Appetitive Responding in Depression: Examining Deficits in Motivational and Consummatory Stages of Reward Processing Using the Effort-Doors Task
2023, Biological Psychiatry Global Open Science