Midbrain dopamine neurons encode a quantitative reward prediction error signal

Hannah M Bayer; Paul W Glimcher

doi:10.1016/j.neuron.2005.05.020

Midbrain dopamine neurons encode a quantitative reward prediction error signal

Neuron. 2005 Jul 7;47(1):129-41. doi: 10.1016/j.neuron.2005.05.020.

Authors

Hannah M Bayer¹, Paul W Glimcher

Affiliation

¹ Center for Neural Science, New York University, New York, NY 10003, USA.

Abstract

The midbrain dopamine neurons are hypothesized to provide a physiological correlate of the reward prediction error signal required by current models of reinforcement learning. We examined the activity of single dopamine neurons during a task in which subjects learned by trial and error when to make an eye movement for a juice reward. We found that these neurons encoded the difference between the current reward and a weighted average of previous rewards, a reward prediction error, but only for outcomes that were better than expected. Thus, the firing rate of midbrain dopamine neurons is quantitatively predicted by theoretical descriptions of the reward prediction error signal used in reinforcement learning models for circumstances in which this signal has a positive value. We also found that the dopamine system continued to compute the reward prediction error even when the behavioral policy of the animal was only weakly influenced by this computation.

Publication types

Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Algorithms
Animals
Conditioning, Operant / physiology
Cues
Dopamine / physiology*
Macaca mulatta
Male
Mesencephalon / cytology
Mesencephalon / physiology*
Neurons / physiology*
Photic Stimulation
Reinforcement Schedule
Reward*
Saccades / physiology

Substances

Dopamine

Abstract

Publication types

MeSH terms

Substances

Grants and funding