The Journal of Neuroscience, October 1, 2008, 28(40):10062-10074; doi:10.1523/JNEUROSCI.0259-08.2008
Previous Article | Next Article 
Behavioral/Systems/Cognitive
A Local Circuit Model of Learned Striatal and Dopamine Cell Responses under Probabilistic Schedules of Reward
Can Ozan Tan and
Daniel Bullock
Cognitive and Neural Systems Department, Boston University, Boston, Massachusetts 02215
Correspondence should be addressed to Daniel Bullock, Cognitive and Neural Systems Department, Boston University, 677 Beacon Street, Boston, MA 02215. Email: danb{at}cns.bu.edu
Recently, dopamine (DA) neurons of the substantia nigra pars compacta (SNc) were found to exhibit sustained responses related to reward uncertainty, in addition to the phasic responses related to reward-prediction errors (RPEs). Thus, cue-dependent anticipations of the timing, magnitude, and uncertainty of rewards are learned and reflected in components of DA signals. Here we simulate a local circuit model to show how learned uncertainty responses are generated, along with phasic RPE responses, on single trials. Both types of simulated DA responses exhibit the empirically observed dependencies on conditional probability, expected value of reward, and time since onset of the reward-predicting cue. The model's three major pathways compute expected values of cues, timed predictions of reward magnitudes, and uncertainties associated with these predictions. The first two pathways' computations refine those modeled by Brown et al. (1999). The third, newly modeled, pathway involves medium spiny projection neurons (MSPNs) of the striatal matrix, whose axons corelease GABA and substance P, both at synapses with GABAergic neurons in the substantia nigra pars reticulata (SNr) and with distal dendrites (in SNr) of DA neurons whose somas are located in ventral SNc. Corelease enables efficient computation of uncertainty responses that are a nonmonotonic function of the conditional probability of reward, and variability in striatal cholinergic transmission can explain observed individual differences in the amplitudes of uncertainty responses. The involvement of matricial MSPNs and cholinergic transmission within the striatum implies a relation between uncertainty in cue–reward contingencies and action-selection functions of the basal ganglia.
Key words: dopamine; substance P; neuropeptide; reinforcement learning; substantia nigra; computational model
Received Jan. 19, 2008;
revised May 31, 2008;
accepted Aug. 29, 2008.
Correspondence should be addressed to Daniel Bullock, Cognitive and Neural Systems Department, Boston University, 677 Beacon Street, Boston, MA 02215. Email: danb{at}cns.bu.edu
This article has been cited by other articles:

|
 |

|
 |
 
T. V. Maia
Reinforcement learning, conditioning, and the brain: Successes and challenges
Cogn Affect Behav Neurosci,
December 1, 2009;
9(4):
343 - 364.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
C. O. Tan
Anticipatory Changes in Regional Cerebral Hemodynamics: A New Role for Dopamine?
J Neurophysiol,
June 1, 2009;
101(6):
2738 - 2740.
[Abstract]
[Full Text]
[PDF]
|
 |
|