 |
The Journal of Neuroscience, November 21, 2007, 27(47):12860-12867; doi:10.1523/JNEUROSCI.2496-07.2007
Previous Article | Next Article 
Behavioral/Systems/Cognitive
Reinforcement Learning Signals in the Human Striatum Distinguish Learners from Nonlearners during Reward-Based Decision Making
Tom Schönberg,1
Nathaniel D. Daw,2
Daphna Joel,1 and
John P. O'Doherty3
1Department of Psychology, Tel Aviv University, Tel Aviv 69978, Israel, 2Center for Neural Science and Department of Psychology, New York University, New York, New York 10003, and 3Division of Humanities and Social Sciences and Computation and Neural Systems Program, Caltech, Pasadena, California 91125
Correspondence should be addressed to Tom Schönberg, Department of Psychology, Tel Aviv University, Tel Aviv 69978, Israel. Email: tom{at}post.tau.ac.il
The computational framework of reinforcement learning has been used to forward our understanding of the neural mechanisms underlying reward learning and decision-making behavior. It is known that humans vary widely in their performance in decision-making tasks. Here, we used a simple four-armed bandit task in which subjects are almost evenly split into two groups on the basis of their performance: those who do learn to favor choice of the optimal action and those who do not. Using models of reinforcement learning we sought to determine the neural basis of these intrinsic differences in performance by scanning both groups with functional magnetic resonance imaging. We scanned 29 subjects while they performed the reward-based decision-making task. Our results suggest that these two groups differ markedly in the degree to which reinforcement learning signals in the striatum are engaged during task performance. While the learners showed robust prediction error signals in both the ventral and dorsal striatum during learning, the nonlearner group showed a marked absence of such signals. Moreover, the magnitude of prediction error signals in a region of dorsal striatum correlated significantly with a measure of behavioral performance across all subjects. These findings support a crucial role of prediction error signals, likely originating from dopaminergic midbrain neurons, in enabling learning of action selection preferences on the basis of obtained rewards. Thus, spontaneously observed individual differences in decision making performance demonstrate the suggested dependence of this type of learning on the functional integrity of the dopaminergic striatal system in humans.
Key words: prediction errors; computational models; fMRI; basal ganglia; instrumental conditioning; associative learning
Received June 1, 2007;
revised Oct. 5, 2007;
accepted Oct. 6, 2007.
Correspondence should be addressed to Tom Schönberg, Department of Psychology, Tel Aviv University, Tel Aviv 69978, Israel. Email: tom{at}post.tau.ac.il
This article has been cited by other articles:

|
 |

|
 |
 
T. V. Maia
Reinforcement learning, conditioning, and the brain: Successes and challenges
Cogn Affect Behav Neurosci,
December 1, 2009;
9(4):
343 - 364.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
S. J. Gershman, B. Pesaran, and N. D. Daw
Human Reinforcement Learning Subdivides Structured Action Spaces by Learning Effector-Specific Values
J. Neurosci.,
October 28, 2009;
29(43):
13524 - 13531.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. K. Krugel, G. Biele, P. N. C. Mohr, S.-C. Li, and H. R. Heekeren
Genetic variation in dopaminergic neuromodulation influences the ability to rapidly and flexibly adapt decisions
PNAS,
October 20, 2009;
106(42):
17951 - 17956.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Cools, M. J. Frank, S. E. Gibbs, A. Miyakawa, W. Jagust, and M. D'Esposito
Striatal Dopamine Predicts Outcome-Specific Reversal Learning and Its Sensitivity to Dopaminergic Drug Administration
J. Neurosci.,
February 4, 2009;
29(5):
1538 - 1543.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. W. Weickert, T. E. Goldberg, J. H. Callicott, Q. Chen, J. A. Apud, S. Das, B. J. Zoltick, M. F. Egan, M. Meeter, C. Myers, et al.
Neural Correlates of Probabilistic Category Learning in Patients with Schizophrenia
J. Neurosci.,
January 28, 2009;
29(4):
1244 - 1254.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. R Delgado, J. Li, D. Schiller, and E. A Phelps
The role of the striatum in aversive learning and aversive prediction errors
Phil Trans R Soc B,
December 12, 2008;
363(1511):
3787 - 3800.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|

|