A cAMP Pathway Underlying Reward Prediction in Associative Learning

Mazen A. Kheirbek; Jeff A. Beeler; Yoshihiro Ishikawa; Xiaoxi Zhuang

doi:10.1523/JNEUROSCI.4115-08.2008

Abstract

In associative learning, animals learn to associate external cues or their own actions with appetitive or aversive outcomes. Although the dopamine (DA) system and the striatum/nucleus accumbens have been implicated in both the pavlovian and instrumental form of associative learning, whether specific neuronal signaling mechanisms underlie one form or the other is unknown. Here, we report that the striatum-enriched isoform of adenylyl cyclase (AC), AC5, is selectively required for appetitive pavlovian learning. Mice with genetic deletion of AC5 (AC5KO) acquired instrumental responding yet were unable to use cues that predicted reward delivery. The specificity of this deficit was confirmed by an inability of AC5KO mice to learn a simple appetitive pavlovian conditioning task. Conversely, AC5KO mice showed intact aversive pavlovian learning, suggesting the deficit was specific for learning about appetitive outcomes. Our results suggest that AC5 is a critical component of DA-dependent strengthening of stimulus–reward contingencies.

Introduction

An animal's ability to associate environmental stimuli or their own actions with appetitive outcomes is essential for goal-directed behavior and environmental adaptation. Although these forms of appetitive learning, pavlovian and instrumental conditioning, are often integrated, they are dissociable under experimental conditions (Hall, 2002; Kelley, 2004). The distinct neural substrates that underlie these specific forms of appetitive associative learning, however, have not been fully determined and characterized.

The cAMP second messenger system is highly conserved and mediates some form of learning in nearly all organisms (Kandel, 2001). Nine membrane-bound isoforms of adenylyl cyclase (AC) are expressed in mammals, each with different expression patterns and regulatory properties (Iwami et al., 1995; Guillou et al., 1999; Hanoune and Defer, 2001). Of these, the calcium/calmodulin (CaCaM)-stimulated AC1 and AC8 have been extensively studied and shown to be critical for hippocampus-based learning and synaptic plasticity because they couple glutamate-mediated increases in intracellular calcium with cAMP production (Wu et al., 1995; Wong et al., 1999; Wang and Storm, 2003). In contrast, the role of AC5 has been less well characterized. AC5 is highly expressed in the striatum, an area strongly associated with reinforcement learning and a major target of dopamine (DA) innervation (Matsuoka et al., 1997). In the striatal regions in which AC5 is expressed, AC1 and AC8 expression is very low (Matsuoka et al., 1997; Cooper et al., 1998; Nicol et al., 2005). The high level of AC5 expression in the striatum suggests that this isoform may be important for certain forms of striatum-dependent learning.

Striatal DA has been demonstrated to be critical for both appetitive pavlovian and instrumental conditioning (Schultz et al., 1997; Reynolds et al., 2001; Dickinson and Balleine, 2002; Yin and Knowlton, 2006; Day et al., 2007) and is required for induction of synaptic plasticity at corticostriatal synapses (Calabresi et al., 1992, 2007; Wickens et al., 1996; Kreitzer and Malenka, 2007). It has been hypothesized that DA facilitates the learning of environmental contingencies by mediating plasticity mechanisms that strengthen or weaken corticostriatal inputs associated with reward delivery (Wickens et al., 1996, 2003; Schultz et al., 1997; Reynolds et al., 2001; Reynolds and Wickens, 2002; Schultz, 2006). AC5 has been shown to be a primary downstream effector of DA receptor signaling, because in mice deficient in AC5 (AC5KO), stimulation of the DA D₁ or D₂ receptors does not alter cAMP levels (Iwamoto et al., 2003). In addition, loss of AC5 causes a reduction in D₁ receptor levels in the striatum (Iwamoto et al., 2003), further suggesting that loss of this isoform may have critical effects on reward learning.

Using AC5KO mice, we examined the role of this AC isoform in instrumental and pavlovian conditioning. Although AC5KO mice acquired instrumental responding, they exhibited a severe impairment in appetitive pavlovian conditioning and as a result had a difficulty using cues that predicted the availability of reward. This deficit was specific for appetitive conditioning, because aversive conditioning was intact.

Materials and Methods

Mice

ADCY5-deficient (AC5KO) mice were generated as previously described (Iwamoto et al., 2003). AC5KO mice were backcrossed to C57BL/6 for eight generations. Heterozygote offspring were crossed with each other to obtain AC5KO homozygotes and wild-type (WT) controls. All mice tested were 8–12 weeks of age. All animals were group-housed (four to five per cage) in a temperature- and humidity-controlled barrier facility, with lights on/off at 6:00 A.M./6:00 P.M. All testing was conducted during the light phase. All experiments were approved by the Institutional Animal Care and Use Committee of the University of Chicago.

Behavioral procedures

Appetitive and instrumental conditioning experiments were conducted in mouse operant conditioning chambers that have two retractable levers, a house light, two signal lights above levers, a signal light and a nosepoke hole on the back wall, and a feeder with photobeam (MED Associates). All sessions began with the onset of the house light.

Instrumental conditioning.

In all experiments mice were fed ad libitum regular chow in their home cage for 2 h after testing. Naive food-restricted mice were first introduced to the conditioning chambers with two magazine training sessions, in which sucrose pellets were given not-contingently at a variable time of 180 s (VT180), and both levers were retracted. After magazine training, mice were trained on a fixed-interval 20 (FI20) schedule of reinforcement, in which the first lever press after 20 s was reinforced. Sessions ended in 1 h or when mice received 30 rewards. Mice were trained until they reached criterion of 30 rewards in a 1 h session. After all mice reached learning criterion, they were all run in a single FI20 session. After FI20 training, mice were trained for 1 d on a random-ratio 10 (RR10) schedule of reinforcement and 3 d on an RR20 schedule of reinforcement. During training, only the left lever was extended and mice were fed regular chow for 2 h after each session. An event recorder written into the program documented the time of each lever press, head entry, and reward during the session to generate behavioral raster plots.

Outcome devaluation.

Twenty-four hours after the last day of training, mice were tested for 2 consecutive days for sensitivity to outcome devaluation. Mice were placed in feeding cages and fed ad libitum either the reinforcer earned by lever press (sucrose pellets, devalued) or the ad libitum available reinforcer (regular chow, valued) for 1 h. The amount of each reinforcer consumed during prefeeding was recorded. Immediately after prefeeding, mice were tested for lever press behavior in a 5 min extinction session. Mice were counterbalanced for the order of valued or devalued conditions on either day.

Contingency degradation.

One week after testing for outcome devaluation, contingency assessment began. Water-restricted mice were trained to press the right lever for a water reward (25 μl) for 1 d on a fixed-ratio 1 (FR1) schedule of reinforcement, followed by 1 d of RR10 and 3 d of RR20. Then, both levers were extended and food and water-restricted mice were trained to press the left lever for sucrose pellet reward and the right lever for water reward for 4 d. Both levers provided rewards on an RR20 schedule of reinforcement. After two-lever training, testing for contingency was conducted for 5 d. During these sessions, the right lever gave water at an RR20 schedule of reinforcement and pellets were dropped noncontingent on a lever press on a random time (RT) 60 s schedule. Sessions ended after 30 pellets were dispensed.

Appetitive pavlovian conditioning.

Naive mice were food restricted before the first conditioning session. Pavlovian conditioning was conducted in the same chambers as instrumental conditioning, with both levers retracted. Sessions began with the onset of the house light. Mice were trained for 14 d, during which each session consisted of 15 daily trials with a 120 s variable intertrial interval (ITI). Each trial consisted of presentation of a 12 s, 85 dB, 2700 Hz tone [conditioned stimulus (CS)] followed by a click of the pellet dispenser and the drop of a single 20 mg sucrose pellet. The conditioned response was measured as head entries into the food receptacle. Head entries were recorded during the intertrial intervals, and during 2 s bins during tone presentation, immediately after pellet drop, and for 10 s after pellet drop. Data were presented as raw number of head entries during each of the 2 s bins of CS presentation and after pellet drop across all 15 trials in the session. ITI rate was calculated as total head entries during ITI divided by ITI time. Total head entry rate was calculated as total head entries in session divided by session time. A 0.33 s delay for detection of head entries was written into the program to reduce excessive head entry counts caused by twitching of the head in the receptacle. Mice were fed regular chow for 2 h after each session.

Aversive pavlovian conditioning.

The same mice were used for aversive conditioning as those used for appetitive conditioning. Four days after appetitive conditioning, mice were placed in fear conditioning chamber (Coulbourn Instruments) to test aversive conditioning. Baseline freezing was measured in response to context and cue before conditioning. During a 5 min training session, mice received two conditioning trials (60 s intertrial interval) of a 30 s, 90 dB, 2400 Hz tone followed by a 2 s, 0.5 mA footshock. Twenty-four hours later, mice were placed back in chamber and contextual freezing was scored for 2 min. The next day, mice were placed back in the chamber, altered for context by placing a gold-colored cardboard triangular cutout inside the fear conditioning chamber. The triangular cutout obscured the walls, changed the dimensions of the chamber by making it both smaller and triangular in shape, as well as covered the floor of the chamber. The tone CS was given to measure cued freezing. Freezing behavior was monitored every 5 s, and a freezing score was calculated as total number of 5 s bins the subject was immobile divided by total 5 s bins in the session. The cued percentage freezing was calculated as a percentage of total freezing observations during tone presentation.

Immunohistochemistry

Fifteen minutes after injection of either vehicle (0.9% saline) or 6-chloro-2,3,4,5-tetrahydro-1-phenyl-1H-3-benzazepine hydrobromide (SKF81297) (5 mg/kg; Sigma-Aldrich), mice were perfused transcardially with 4% paraformaldehyde, and brains were postfixed overnight in 4% paraformaldehyde. Brains were cryoprotected in 30% sucrose until they sank, and 40 μm coronal sections were cut on a cryostat, and then stored at −20° until use. Successive sections separated by 120 μm were processed for detection of p-ERK1/2 immunoreactivity. Sections were first washed in 0.1 m Tris-buffered saline followed by blocking in 4% donkey serum and 0.1% Triton. Sections were incubated overnight at 4°C in a 1:200 dilution of phospho-p44/42 extracellular signal-regulated kinase 1/2 (ERK1/2) antibody (Cell Signaling; no. 9101) in 4% donkey serum and 0.1% Triton. A biotinylated horse anti-rabbit IgG (1:500; Vector Laboratories) and peroxidase-conjugated avidin–biotin complex (VECTASTAIN Elite ABC kit; Vector Laboratories) were used, and the reaction was visualized by using SigmaFast DAB tablets (Sigma-Aldrich). For counting p-ERK1/2-positive neurons, six successive sections through the nucleus accumbens separated by 120 μm, beginning at ∼1.20 mm anterior to bregma, were used for counting. A 100 μm² counting window was drawn using Stereo Investigator 6 software (MicroBrightField) medial to the anterior commissure for nucleus accumbens (NAcc) shell counts and ventral to the anterior commissure for NAcc core counts. One count was made per section, and the total numbers of p-ERK-positive neurons in each of six consecutive sections were counted, and the average of the six counts was taken for each mouse. Correct location of counting windows was confirmed by referencing a mouse brain atlas (Paxinos and Franklin, 2001).

Statistical analysis

For the instrumental conditioning data, the latency to check food receptacle, bout length, head entry, lever press rate, and trials to reach criterion were analyzed with Student's t test. Outcome latency was analyzed using a two-way ANOVA with repeated-measures design. For appetitive pavlovian conditioning data, CS+ head entry behavior in Figure 2A was analyzed using a three-way ANOVA with repeated-measures design. Head entries after pellet dispenser activation, total head entry rate, and ITI rate were analyzed using a two-way ANOVA with repeated-measures design. Effect of outcome devaluation and contingency degradation were analyzed using a two-way ANOVA with repeated-measures design. Fear conditioning was analyzed using a two-way ANOVA with repeated-measures design, and baseline differences were analyzed using Student's t test. Cell counting data were analyzed using a two-way ANOVA with repeated-measures design. All p values and effects are indicated in the text. All error bars are ±SEM.

Results

AC5KO mice exhibit altered distribution of goal-directed behaviors in operant tasks

To determine the behavioral consequence of AC5 deficiency, mice were first tested for any overt locomotor deficits. AC5KO mice did not differ from WT littermates in distance traveled when tested in the open field, and showed normal dopamine-dependent locomotor activity (supplemental Fig. S1, available at www.jneurosci.org as supplemental material). Because the dorsal striatum and NAcc, areas with high AC5 expression (for AC5 expression pattern, see supplemental Fig. S2, available at www.jneurosci.org as supplemental material), play important roles in associative learning; we tested WT and AC5KO mice in an instrumental learning paradigm. Mice were trained to press a lever for a sucrose reward, and then tested on an RR20 schedule of reinforcement. Analysis of the distribution of responses on the last day of testing revealed marked differences between AC5KO and WT mice. Figure 1, A and B, are representative raster plots of WT and AC5KO mice that show their actions during a single session. Whereas WT mice focused their efforts on lever pressing and only periodically checked the food receptacle, AC5KO mice checked the receptacle frequently (Fig. 1A,B). Comparing the latencies between rewarded and unrewarded lever presses and checking the food receptacle indicated that WT mice discriminated the rewarded lever press from the unrewarded lever press, whereas AC5KO mice did not. WT mice exhibited a long latency to check after unrewarded lever presses and short latency after rewarded presses (Fig. 1C) (n = 8 WT; latency effect, p < 0.0001). In contrast, AC5KO mice showed no difference in latency to check the food receptacle between rewarded and unrewarded lever presses (Fig. 1C) (n = 8 AC5KO; latency effect, p = 0.47). In addition, the average length of a bout of lever pressing before checking the food receptacle was significantly shorter in AC5KO mice compared with WT controls [n = 8 per genotype (geno); WT, 8.012 (±2.77 SD); AC5KO, 2.176 (±0.833 SD); genotype effect, p < 0.0001], indicating that completing the required number of presses to obtain a pellet was disrupted in the mutants by unnecessary head entries [mean head entry rate, WT, 2.022 (±0.37 SD); AC5KO, 10.762 (±4.386 SD); genotype effect, p < 0.0001]. Although the AC5KO exhibited significantly more head entries into the food receptacle than WT mice, they pressed at a similar or slightly lower rate [WT, 10.274 (±5.47 SD); AC5KO, 6.65 (±2.9 SD); genotype effect, p = 0.126]. There were no significant differences in total goal-directed actions (lever press plus head entries) or overall reinforcement rate during sessions (supplemental Fig. S3, available at www.jneurosci.org as supplemental material). AC5KO and WT mice acquired the lever press response at an FI20 s schedule of reinforcement (Fig. 1D) (reward bin by geno, p = 0.98) and both groups required a similar number of sessions to reach learning criterion [WT, 3 (±2.62 SD); AC5KO, 3.125 (±1.73 SD); p = 0.912]. Yet they reached different asymptotic performance as AC5KO mice showed a greater latency to receive rewards (Fig. 1D) (genotype effect, p = 0.023), which is consistent with their inefficient performance caused by a higher head entry rate during these sessions (supplemental Fig. S4, available at www.jneurosci.org as supplemental material) (genotype effect, p = 0.0072).

Figure 1.

AC5KO mice have altered distribution of goal-directed behaviors in operant tasks. A, B, Behavioral raster plots of representative WT (A) and AC5KO (B) mice during lever pressing at RR20 schedule of reinforcement revealed AC5KO mice made excessive head entries into food receptacle. Each row represents a rewarded trial, and each tick represents the time point each event occurred. Each trial ended with a rewarded lever press. Green ticks, Unrewarded lever press; red ticks, head entry; white ticks, rewarded lever press. C, Average latency to enter food receptacle after each unrewarded and rewarded lever press. Unlike WT mice, AC5KO mice did not withhold head entries after an unrewarded lever press (n = 8 per genotype; WT latency effect, p < 0.0001; AC5KO, p = 0.47). D, AC5KO mice acquired the lever press behavior at an FI20 schedule of reinforcement at the same rate as WT mice (n = 8 per genotype; reward bin by geno, p = 0.98) but reached different asymptotic performance (genotype effect, p = 0.023). Each point represents average latency to receive reward, bins of five rewards. Error bars are ±SEM.

AC5KO mice lack reward prediction in appetitive pavlovian conditioning

The instrumental conditioning procedure used has both an instrumental component (learning the lever press action leads to reward outcome) and a pavlovian component [associating the click of the pellet dispenser and sound of pellet drop with the availability of sucrose pellet (Kelley, 2004)]. The instrumental performance of AC5KO mice suggested a deficit in the pavlovian component of the task, that is, an inability to use the cues that indicate reward availability to determine when to press and when to check the food receptacle. To directly assess this possibility, mice were tested in a pavlovian appetitive conditioning task. Mice were presented with a 12 s tone followed by pellet dispenser click and pellet drop (CS). Head entries into the feeder were counted and binned (2 s bins) in histograms around CS presentation and pellet delivery (Fig. 2A). Learning in WT mice was indicated by an increase in discriminative head entries in response to CS presentation (Fig. 2A). Across sessions, WT mice increased anticipatory head entries, whereas AC5KO mice did not (Fig. 2A) (n = 8 per genotype; days by bin by genotype interaction, p < 0.0001). Discriminative head entries immediately after the pellet dispenser activation and pellet drop further highlighted a significant learning curve in WT but not in AC5KO mice (Fig. 2B) (session by genotype interaction, p < 0.0001). Although no significant difference between genotypes was found for head entry rate during the ITI, WT but not AC5KO mice showed a trend of decreasing ITI head entries across sessions (Fig. 2D) (genotype effect, p = 0.27; session by genotype interaction, p = 0.68). Total head entry rate in the session did not differ between AC5KO and WT mice across days (Fig. 2C) (genotype effect, p = 0.58; genotype by session interaction, p = 0.97), indicating that the AC5KO mice have no motor or motivational impairments. These data, compared with those in Figure 1, suggest that AC5KO mice do not make excessive head entries; rather, they make indiscriminative head entries.

Figure 2.

AC5KO mice lacked reward prediction in appetitive pavlovian conditioning. A, Total number of head entries were collected in 2 s bins during and after cue presentation. Fifteen cues were presented in each session of appetitive pavlovian conditioning. Each point represents total number of head entries in a 2 s bin, bins 1–6 are during 12 s tone presentation, bin 7 is immediately after pellet dispenser activation and pellet drop, and bins 8–12 are posttrial responses. WT mice acquired CS-evoked head entries, whereas AC5KO mice did not (n = 8 per genotype; days by bin by genotype interaction, p < 0.0001). B, Total number of head entries in 2 s bin after pellet dispenser activation and pellet drop. WT mice increase head entries after CS presentation, whereas AC5KO mice did not (session by genotype interaction, p < 0.0001). C, Total head entry rate did not differ between genotypes across sessions (genotype effect, p = 0.58; genotype by session interaction, p = 0.97). D, Head entry rate during the intertrial interval did not differ between genotypes (genotype effect, p = 0.27; session by genotype interaction, p = 0.68). Error bars are ±SEM.

AC5KO mice form normal action–outcome contingencies

To test whether the AC5KO phenotype in instrumental conditioning derives solely from abnormalities in the pavlovian component of the task or whether they additionally have deficits forming action–outcome contingencies or estimating reward value, mice were assessed for changes in lever-pressing behavior in response to outcome devaluation or contingency degradation. First, mice were tested for their ability to suppress their responding when the reward is devalued by sensory-specific satiety. One day after RR20 training, subjects were fed ad libitum either sucrose pellets (devalued group) or regular chow (valued group) for 1 h before testing for the effect of prefeeding [amount consumed during prefeeding shown in supplemental Fig. S5 (available at www.jneurosci.org as supplemental material)]. Both groups of mice decreased their lever press rate when the outcome (sucrose pellets) had been devalued, suggesting a similar ability to associate the value of the outcome with their instrumental response and adjust responding accordingly (Fig. 3A) (n = 8 per genotype; genotype effect, p = 0.94; value effect, p = 0.02; geno by value, p = 0.86) (supplemental Fig. S6A, head entry rate, available at www.jneurosci.org as supplemental material). Next, mice were tested for the ability to suppress responding when the response outcome contingency is degraded, that is, when rewards are delivered independent of lever pressing. Food and water-restricted mice were trained to press one lever for a water reward and another for sucrose pellets on an RR20 schedule of reinforcement. After 4 training days with both levers, the sucrose contingency was degraded by random delivery of sucrose independent of lever pressing. Analysis of lever press rate during the session before contingency degradation compared with last session of contingency degradation showed that contingency degradation suppressed responding in both WT and AC5KO mice (Fig. 3B) (genotype effect, p = 0.28; degradation effect, p < 0.0001; geno by degradation, p = 0.063) (supplemental Fig. S6B, head entry rate, available at www.jneurosci.org as supplemental material). This effect was specific for the degraded sucrose lever, because the lever press rate for the nondegraded water lever was not significantly altered (Fig. 3C) (genotype effect, p = 0.09; degradation effect, p = 0.94; genotype by degradation, p = 0.32). Both groups of mice exhibited a lower lever press rate for water compared with sucrose, and AC5KO mice exhibited a slightly lower rate of lever pressing on the water lever than WT mice. This is similar to the lower rate of lever pressing for sucrose before contingency degradation (Fig. 3B) and in RR20 training, which could be attributable to the excessive head entries. The lower rate of lever pressing was not attributable to a difference in restriction protocols, or total water consumption, because both groups of mice drank a similar amount of water when restricted (supplemental Fig. S7, available at www.jneurosci.org as supplemental material). Yet, before degradation, AC5KO mice pressed the water on average once per minute, whereas in comparable instrumental learning experiments an inactive lever was only sampled once every 4 min (data not shown), suggesting their lever press behavior for water was goal-directed. Importantly, both AC5KO and WT mice showed a clear reduction in lever pressing for sucrose after contingency degradation, suggesting their pressing on the sucrose lever was goal-directed and under the control of a contingency between the action and outcome.

Figure 3.

AC5KO mice form normal action–outcome contingencies. A, AC5KO and WT mice suppress responding in 5 min probe extinction test after outcome devaluation by sensory-specific satiety (n = 8 per genotype; genotype effect, p = 0.94; value effect, p = 0.02; geno by value, p = 0.86). B, C, Both AC5KO and WT mice decreased pressing on lever in which contingency had been degraded (B) but not on nondegraded lever (C) (degraded lever, genotype effect, p = 0.28; degradation effect, p < 0.0001; geno by degradation, p = 0.063; nondegraded lever, genotype effect, p = 0.09; degradation effect, p = 0.94; genotype by degradation, p = 0.32). Error bars are ±SEM.

AC5KO mice show normal aversive pavlovian conditioning

To test whether the pavlovian conditioning impairment in AC5KO mice was specific for appetitive conditioning, we tested aversive conditioning in a fear conditioning paradigm. In the conditioning chamber, mice were presented with a 30 s tone (CS) followed by a 2 s, 0.5 mA footshock [unconditioned stimulus (US)]. After conditioning, learning was measured by increases in freezing behavior over preconditioning rates in response to either the contextual cues of the chamber or presentation of the tone CS. There was no significant difference in freezing behavior between AC5KO and WT mice in response to the chamber (context) or tone presentation before conditioning (Fig. 4B) (baseline, cue genotype effect, p = 0.3; context, p = 0.2). Twenty-four hours after training, mice were placed back in the chambers, and freezing as a result of context was measured (Fig. 4A). Contextual freezing did not differ between genotypes (Fig. 4A) (n = 8 WT, 7 AC5KO; genotype effect, p = 0.4; context effect, p = 0.0002; context by geno, p = 0.69). The next day, mice were placed in a modified chamber to eliminate contextual cues (see Materials and Methods) and the tone was presented to measure freezing in response to the CS. Freezing behavior in the altered context was minimal, and no significant difference was seen in freezing behavior before tone delivery (data not shown) (genotype effect, p = 0.13). In response to cue presentation, AC5KO and WT mice exhibited similar freezing behavior (Fig. 4B) (genotype effect, p = 0.3; cue effect, p < 0.0001; cue by geno, p = 0.52), suggesting a similar association between the CS and US was formed in the AC5KO and WT mice. These data also suggested intact sensory processing (i.e., normal ability to hear tones and perceive shock) in AC5KO mice, although the sound of pellet drop in the instrumental conditioning task is quieter than the tone in pavlovian conditioning tasks. In situ hybridization studies suggest AC5 levels are very low in the amygdala (Matsuoka et al., 1997) (supplemental Fig. S1, available at www.jneurosci.org as supplemental material), a neural substrate for fear conditioning (LeDoux, 2000), supporting the notion that AC5 deficiency selectively affects striatum/nucleus accumbens-dependent learning.

Figure 4.

AC5KO mice have normal aversive pavlovian conditioning. A, AC5KO and WT mice show similar freezing behavior in response to context paired with aversive footshock (n = 8 WT, 7 AC5KO; genotype effect, p = 0.4; context effect, p = 0.0002; context by geno, p = 0.69). B, AC5KO and WT mice show similar freezing behavior in response to auditory cue paired with footshock (genotype effect, p = 0.3; cue effect, p < 0.0001; cue by geno, p = 0.52). Error bars are ±SEM.

AC5KO mice have impaired D₁ receptor-mediated ERK activation in the NAcc

D₁ receptor antagonism can inhibit appetitive pavlovian learning (Eyny and Horvitz, 2003). Although AC5KO mice lack D₁-stimulated cAMP production, they retain D₁-stimulated locomotor activity, suggesting that specific D₁-mediated signaling pathways disrupted by loss of AC5 may be essential for appetitive pavlovian learning. Recent studies indicate a potential role of the ERK1/2 in the NAcc in appetitive pavlovian learning (Shiflett et al., 2008). We therefore examined the ability of a D₁ agonist to induce activation of ERK in WT and AC5KO mice. Mice were killed 15 min after an injection of either vehicle (0.9% saline) or SKF81297 (5 mg/kg), and brains were prepared for immunohistochemistry. No significant difference in ERK1/2 phosphorylation was seen between genotypes after vehicle injection. However, analysis of D₁ agonist-mediated phosphorylation of ERK1/2 in the NAcc revealed marked differences between AC5KO and WT mice. Although injection of D₁ agonist produced a robust increase in phosphorylated ERK1/2-positive neurons in the NAcc shell and core of WT mice, this response was severely diminished in AC5KO mice (Fig. 5A,B) (n = 3 per genotype per treatment; core, genotype effect, p = 0.012, drug effect, p = 0.0001, geno by drug interaction, p = 0.0009; shell, genotype effect, p = 0.025, drug effect, p = 0.0015, geno by drug interaction, p = 0.0062). No significant D₁-mediated increase in phosphorylated ERK1/2-positive neurons was seen in the dorsal striatum, consistent with published results (Gerfen et al., 2002). This suggests a profound decoupling of D₁ receptor activation from downstream activation of ERK1/2 in the NAcc of AC5KO mice, which may underlie the deficits seen in reward learning.

Figure 5.

AC5KO mice have impaired D₁ receptor-mediated activation of ERK in the NAcc. A, Systemic injection of the D₁ agonist SKF81297 increases p-ERK levels in the NAcc of WT mice, but not AC5KO mice. B, Number of p-ERK1/2-positive neurons in each counting window (100 μm²) of nucleus accumbens core and shell reveals that D₁ agonist significantly activates ERK1/2 in the NAcc core and shell in WT mice, but not in AC5KO mice (n = 3 per genotype per treatment; core, genotype effect, p = 0.012; drug effect, p = 0.0001; geno by drug interaction, p = 0.0009; shell, genotype effect, p = 0.025; drug effect, p = 0.0015; geno by drug interaction, p = 0.0062). Error bars are ±SEM.

Discussion

The current study indicates a critical role for the striatum-enriched AC5 in appetitive pavlovian learning. In appetitive pavlovian conditioning tasks and in appetitive instrumental conditioning tasks with a pavlovian component, AC5KO mice exhibited impairment in their ability to use cues to predict the availability of reward. In contrast, they acquired an instrumental response for food reward, and their instrumental responding was sensitive to changes in both outcome value and action–outcome contingency. AC5KO mice also showed normal fear conditioning, indicating intact aversive pavlovian learning. These data indicate that AC5 is specifically required for appetitive pavlovian learning.

Distinguishing learning and performance deficits is an enduring challenge in behavioral studies. Pavlovian deficits in AC5KO mice may be explained by a performance deficit rather than by a learning deficit per se. However, a number of observations suggest that this is unlikely. AC5KO mice make the head entry responses at rates similar to WT mice, indicating no motor impairment. Moreover, like WT, they show an increase in head entry behavior between the first and second sessions, indicating they increase their performance of this behavior in response to reward availability. They consume the same quantity of sucrose pellets in both the pavlovian and instrumental tasks, indicating there are no motivational deficits and that the reward is equally desirable for both groups of mice. In addition, AC5KO mice can adjust their instrumental performance in response to changes in the value of reward, suggesting mechanisms linking motivation and performance are intact. In summary, we observe no performance deficits in the AC5KO mice except that they are unable to use cues to predict the availability of reward.

The role of DA in appetitive pavlovian conditioning has been studied extensively (Schultz, 1998; Dalley et al., 2002; Eyny and Horvitz, 2003; Day et al., 2007). It has been demonstrated that DA cells increase their activity in response to unexpected reward, which has led to the “prediction error” hypothesis of DA (Schultz, 1998). In this model, a sudden burst of DA activity in response to unexpected reward serves as a teaching signal, reinforcing an association between the reward and the preceding stimuli so that the animal can better predict reward in the future (Schultz et al., 1997; Schultz, 2002). Current understanding of the dopaminergic modulation of corticostriatal plasticity is consistent with this model (Reynolds et al., 2001; Reynolds and Wickens, 2002). In the presence of low extracellular DA associated with tonic activity, coincident presynaptic and postsynaptic activity at medium spiny neurons (MSNs) results in long-term synaptic depression (Calabresi et al., 2007); however, with transient, high concentrations of DA achieved during phasic DA release, this same coincident activity results in long-term potentiation (Wickens et al., 1996; Reynolds and Wickens, 2002). This arrangement serves to integrate midbrain dopaminergic and cortical glutamatergic input in the striatum (Reynolds et al., 2001; Reynolds and Wickens, 2002) and provides a mechanism whereby DA can act as a teaching signal by facilitating synaptic plasticity in response to reward (Reynolds and Wickens, 2002). Reports that either D₁ or NMDA receptor antagonism can inhibit appetitive pavlovian learning (Di Ciano et al., 2001; Eyny and Horvitz, 2003) are consistent with this view. Previous studies have demonstrated a downregulation of D₁ receptors in the striatum of AC5KO, and modulation of cAMP levels in response to D₁ or D₂ stimulation is impaired in the AC5KO mice (Iwamoto et al., 2003). In addition, the data presented here indicated a loss of D₁-mediated increases in activation of ERK1/2. Thus, it is reasonable to hypothesize that these alterations in D₁-mediated signaling cascades in AC5KO mice may have significant effects on corticostriatal plasticity. The loss of pavlovian learning further suggests possible plasticity deficits in the striatum of AC5KO mice.

A competing perspective on the role of DA in reward is the “incentive salience” hypothesis (Berridge, 2007). In this view, DA can attribute incentive salience to the CS and therefore the CS serves to motivate behavior. Thus, it is possible that the AC5KO mice do form the CS–US association, but that the CS does not exert incentive control over head entry behavior. In the outcome devaluation experiment, however, the AC5KO mice modulate their lever-pressing behavior in response to changes in the value of the outcome, suggesting that motivational control of behavior is intact. In addition, both WT and AC5KO mice equally scale up their head entry behavior in response to reward (Fig. 2C); only the AC5KO do it indiscriminately, suggesting a deficit in reward prediction rather than incentive control of behavior.

The results presented here indicate a decoupling of D₁ receptor activation from phosphorylation of ERK1/2 in the NAcc of AC5KO mice. It has been suggested that D₁ regulates the phosphorylation of ERK1/2 via a cAMP-dependent regulation of DARPP-32, which inhibits protein phosphatase-1, which induces an activation of ERK1/2 (Valjent et al., 2005). Although our data extend previous observations of impaired D₁-cAMP signaling in the striatum of AC5KO mice (Iwamoto et al., 2003), and recent studies have reported an increase in ERK1/2 activation after pavlovian conditioning (Shiflett et al., 2008), additional studies will be required to determine whether loss of D₁-mediated ERK1/2 activation underlies the deficits in pavlovian conditioning seen in AC5KO mice.

Because appetitive pavlovian learning is an important component of many behaviors, impairments in pavlovian learning could potentially have many consequences. Impaired performance in instrument behavior is one such consequence, as demonstrated by our data. The reward pathway is often implicated in impulsive choice behavior (Belin et al., 2008). Although the excessive head entry behavior displayed by AC5KO mice in Figure 1 was most likely attributable to indiscriminative head entries, how impairment in appetitive pavlovian learning may affect impulsive choice and addiction remains to be examined in the AC5KO mice.

The present study showing that AC5 plays a significant role only in the pavlovian component of instrumental conditioning needs to be reconciled with published work reporting that inhibition of the cAMP pathway in the ventral striatum inhibits learning of an instrumental task (Baldwin et al., 2002). In one study, rats receiving protein kinase A (PKA) inhibitors acquired the instrumental lever press, but acquisition was slowed and performance was inhibited. However, the specific task in that study incorporated a strong pavlovian component, that is, a correct lever press was always followed by a pavlovian cue (3 s house light offset and red signal light onset) that was then followed by food delivery (Baldwin et al., 2002). In their design, impaired pavlovian learning would impair instrumental performance. In an alternative task design in which a lever press is immediately followed by reward delivery, minimizing the pavlovian component, PKA inhibitors had no effect on instrumental responding for food reward (Self et al., 1998).

The studies presented here indicate that AC5KO mice are sensitive to changes in outcome value and action–outcome contingency. However, previous studies have reported that lesions of the dorsomedial striatum (Yin et al., 2005) and the NAcc core (Corbit et al., 2001), areas with high AC5 expression, produce insensitivity to outcome devaluation, suggesting a role for these structures in this aspect of instrumental learning. The selectivity of the genetic manipulation used here, which preserves the integrity of the basal ganglia thalamocortical loop and non-AC5-dependent signaling pathways, preserves sensitivity to outcome value. This suggests two possible interpretations about the role of DA-mediated signaling mechanisms in instrumental learning. One is that DA signaling is critical to these behaviors but mediated through downstream effectors other than AC5. Although AC5KO mice show severely reduced DA receptor modulation of cAMP content, reduced D₁ receptor levels in the striatum, and severely diminished D₁-mediated activation of ERK1/2, AC5KO mice respond robustly to D₁ receptor stimulation in locomotor assays, suggesting the significance of alternative downstream signaling pathways in MSNs (Iwamoto et al., 2003). Recent studies have identified non-cAMP-dependent DA receptor signaling pathways in MSNs mediating DA-dependent behaviors (Beaulieu et al., 2005). Alternatively, DA-dependent signaling mechanisms may not be required for some forms of associative learning, as has been reported with genetically engineered DA-deficient mice (Robinson et al., 2005). Discriminating between these possibilities will require future experiments that manipulate other downstream effectors of DA signaling.

In conclusion, the present study demonstrates that a specific cAMP isoform, AC5, is required for appetitive pavlovian learning. Genetic deletion of AC5 abolishes the animal's ability to use environmental cues to predict reward availability. This deficit further impairs instrumental performance when the task includes a pavlovian component, that is, the need to use predictive cues. This demonstrates that the striatum-enriched AC5 plays as an important role in classical pavlovian learning.

Footnotes

This work was supported by National Institute of Mental Health Grants 1F31MH076422 (M.A.K.) and MH66216, National Institute on Drug Abuse Grants 1F32DA020427 (J.A.B.) and DA022269, National Heart, Lung, and Blood Institute Grant HL059139, National Institute of General Medical Sciences Grant GM067773 (Y.I.), and The Edward Mallinckrodt Jr Foundation (X.Z.). We thank Rui Costa, Linan Chen, Wei-Jen Tang, Jon Horvitz, Peter Balsam, Peggy Mason, and Cristianne Frazier for helpful discussions, and Zhen Fang Huang Cao and Stephanie Tang for technical assistance.
Correspondence should be addressed to Xiaoxi Zhuang, Department of Neurobiology, The University of Chicago, 924 East 57th Street, Knapp R214, Chicago, IL 60637. xzhuang{at}bsd.uchicago.edu

References

↵
1. Baldwin AE,
2. Sadeghian K,
3. Holahan MR,
4. Kelley AE
(2002) Appetitive instrumental learning is impaired by inhibition of cAMP-dependent protein kinase within the nucleus accumbens. Neurobiol Learn Mem 77:44–62.
OpenUrl CrossRef PubMed
↵
1. Beaulieu JM,
2. Sotnikova TD,
3. Marion S,
4. Lefkowitz RJ,
5. Gainetdinov RR,
6. Caron MG
(2005) An Akt/beta-arrestin 2/PP2A signaling complex mediates dopaminergic neurotransmission and behavior. Cell 122:261–273.
OpenUrl CrossRef PubMed
↵
1. Belin D,
2. Mar AC,
3. Dalley JW,
4. Robbins TW,
5. Everitt BJ
(2008) High impulsivity predicts the switch to compulsive cocaine-taking. Science 320:1352–1355.
OpenUrl Abstract/FREE Full Text
↵
1. Berridge KC
(2007) The debate over dopamine's role in reward: the case for incentive salience. Psychopharmacology (Berl) 191:391–431.
OpenUrl CrossRef PubMed
↵
1. Calabresi P,
2. Maj R,
3. Mercuri NB,
4. Bernardi G
(1992) Coactivation of D1 and D2 dopamine receptors is required for long-term synaptic depression in the striatum. Neurosci Lett 142:95–99.
OpenUrl CrossRef PubMed
↵
1. Calabresi P,
2. Picconi B,
3. Tozzi A,
4. Di Filippo M
(2007) Dopamine-mediated regulation of corticostriatal synaptic plasticity. Trends Neurosci 30:211–219.
OpenUrl CrossRef PubMed
↵
1. Cooper DM,
2. Karpen JW,
3. Fagan KA,
4. Mons NE
(1998) Ca²⁺-sensitive adenylyl cyclases. Adv Second Messenger Phosphoprotein Res 32:23–51.
OpenUrl PubMed
↵
1. Corbit LH,
2. Muir JL,
3. Balleine BW
(2001) The role of the nucleus accumbens in instrumental conditioning: evidence of a functional dissociation between accumbens core and shell. J Neurosci 21:3251–3260.
OpenUrl Abstract/FREE Full Text
↵
1. Dalley JW,
2. Chudasama Y,
3. Theobald DE,
4. Pettifer CL,
5. Fletcher CM,
6. Robbins TW
(2002) Nucleus accumbens dopamine and discriminated approach learning: interactive effects of 6-hydroxydopamine lesions and systemic apomorphine administration. Psychopharmacology (Berl) 161:425–433.
OpenUrl CrossRef PubMed
↵
1. Day JJ,
2. Roitman MF,
3. Wightman RM,
4. Carelli RM
(2007) Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nat Neurosci 10:1020–1028.
OpenUrl CrossRef PubMed
↵
1. Di Ciano P,
2. Cardinal RN,
3. Cowell RA,
4. Little SJ,
5. Everitt BJ
(2001) Differential involvement of NMDA, AMPA/kainate, and dopamine receptors in the nucleus accumbens core in the acquisition and performance of pavlovian approach behavior. J Neurosci 21:9471–9477.
OpenUrl Abstract/FREE Full Text
↵
1. Pashler HE
1. Dickinson A,
2. Balleine BW
(2002) in Steven's handbook of experimental psychology, The role of learning in the operation of motivational systems, ed Pashler HE (Wiley, New York), Ed 3.
↵
1. Eyny YS,
2. Horvitz JC
(2003) Opposing roles of D₁ and D₂ receptors in appetitive conditioning. J Neurosci 23:1584–1587.
OpenUrl Abstract/FREE Full Text
↵
1. Gerfen CR,
2. Miyachi S,
3. Paletzki R,
4. Brown P
(2002) D₁ dopamine receptor supersensitivity in the dopamine-depleted striatum results from a switch in the regulation of ERK1/2/MAP kinase. J Neurosci 22:5042–5054.
OpenUrl Abstract/FREE Full Text
↵
1. Guillou JL,
2. Nakata H,
3. Cooper DM
(1999) Inhibition by calcium of mammalian adenylyl cyclases. J Biol Chem 274:35539–35545.
OpenUrl Abstract/FREE Full Text
↵
1. Pashler HE
1. Hall G
(2002) in Steven's handbook of experimental psychology, Associative structures in pavlovian and instrumental conditioning, ed Pashler HE (Wiley, New York), Ed 3.
↵
1. Hanoune J,
2. Defer N
(2001) Regulation and role of adenylyl cyclase isoforms. Annu Rev Pharmacol Toxicol 41:145–174.
OpenUrl CrossRef PubMed
↵
1. Iwami G,
2. Kawabe J,
3. Ebina T,
4. Cannon PJ,
5. Homcy CJ,
6. Ishikawa Y
(1995) Regulation of adenylyl cyclase by protein kinase A. J Biol Chem 270:12481–12484.
OpenUrl Abstract/FREE Full Text
↵
1. Iwamoto T,
2. Okumura S,
3. Iwatsubo K,
4. Kawabe J,
5. Ohtsu K,
6. Sakai I,
7. Hashimoto Y,
8. Izumitani A,
9. Sango K,
10. Ajiki K,
11. Toya Y,
12. Umemura S,
13. Goshima Y,
14. Arai N,
15. Vatner SF,
16. Ishikawa Y
(2003) Motor dysfunction in type 5 adenylyl cyclase-null mice. J Biol Chem 278:16936–16940.
OpenUrl Abstract/FREE Full Text
↵
1. Kandel ER
(2001) The molecular biology of memory storage: a dialogue between genes and synapses. Science 294:1030–1038.
OpenUrl Abstract/FREE Full Text
↵
1. Kelley AE
(2004) Ventral striatal control of appetitive motivation: role in ingestive behavior and reward-related learning. Neurosci Biobehav Rev 27:765–776.
OpenUrl CrossRef PubMed
↵
1. Kreitzer AC,
2. Malenka RC
(2007) Endocannabinoid-mediated rescue of striatal LTD and motor deficits in Parkinson's disease models. Nature 445:643–647.
OpenUrl CrossRef PubMed
↵
1. LeDoux JE
(2000) Emotion circuits in the brain. Annu Rev Neurosci 23:155–184.
OpenUrl CrossRef PubMed
↵
1. Matsuoka I,
2. Suzuki Y,
3. Defer N,
4. Nakanishi H,
5. Hanoune J
(1997) Differential expression of type I, II, and V adenylyl cyclase gene in the postnatal developing rat brain. J Neurochem 68:498–506.
OpenUrl CrossRef PubMed
↵
1. Nicol X,
2. Muzerelle A,
3. Bachy I,
4. Ravary A,
5. Gaspar P
(2005) Spatiotemporal localization of the calcium-stimulated adenylate cyclases, AC1 and AC8, during mouse brain development. J Comp Neurol 486:281–294.
OpenUrl CrossRef PubMed
↵
1. Paxinos G,
2. Franklin KBJ
(2001) The mouse brain in steroetaxic coordinates (Academic, San Diego), Ed 2.
↵
1. Reynolds JN,
2. Wickens JR
(2002) Dopamine-dependent plasticity of corticostriatal synapses. Neural Netw 15:507–521.
OpenUrl CrossRef PubMed
↵
1. Reynolds JN,
2. Hyland BI,
3. Wickens JR
(2001) A cellular mechanism of reward-related learning. Nature 413:67–70.
OpenUrl CrossRef PubMed
↵
1. Robinson S,
2. Sandstrom SM,
3. Denenberg VH,
4. Palmiter RD
(2005) Distinguishing whether dopamine regulates liking, wanting, and/or learning about rewards. Behav Neurosci 119:5–15.
OpenUrl CrossRef PubMed
↵
1. Schultz W
(1998) Predictive reward signal of dopamine neurons. J Neurophysiol 80:1–27.
OpenUrl Abstract/FREE Full Text
↵
1. Schultz W
(2002) Getting formal with dopamine and reward. Neuron 36:241–263.
OpenUrl CrossRef PubMed
↵
1. Schultz W
(2006) Behavioral theories and the neurophysiology of reward. Annu Rev Psychol 57:87–115.
OpenUrl CrossRef PubMed
↵
1. Schultz W,
2. Dayan P,
3. Montague PR
(1997) A neural substrate of prediction and reward. Science 275:1593–1599.
OpenUrl Abstract/FREE Full Text
↵
1. Self DW,
2. Genova LM,
3. Hope BT,
4. Barnhart WJ,
5. Spencer JJ,
6. Nestler EJ
(1998) Involvement of cAMP-dependent protein kinase in the nucleus accumbens in cocaine self-administration and relapse of cocaine-seeking behavior. J Neurosci 18:1848–1859.
OpenUrl Abstract/FREE Full Text
↵
1. Shiflett MW,
2. Martini RP,
3. Mauna JC,
4. Foster RL,
5. Peet E,
6. Thiels E
(2008) Cue-elicited reward-seeking requires extracellular signal-regulated kinase activation in the nucleus accumbens. J Neurosci 28:1434–1443.
OpenUrl Abstract/FREE Full Text
↵
1. Valjent E,
2. Pascoli V,
3. Svenningsson P,
4. Paul S,
5. Enslen H,
6. Corvol JC,
7. Stipanovich A,
8. Caboche J,
9. Lombroso PJ,
10. Nairn AC,
11. Greengard P,
12. Hervé D,
13. Girault JA
(2005) Regulation of a protein phosphatase cascade allows convergent dopamine and glutamate signals to activate ERK in the striatum. Proc Natl Acad Sci U S A 102:491–496.
OpenUrl Abstract/FREE Full Text
↵
1. Wang H,
2. Storm DR
(2003) Calmodulin-regulated adenylyl cyclases: cross-talk and plasticity in the central nervous system. Mol Pharmacol 63:463–468.
OpenUrl Abstract/FREE Full Text
↵
1. Wickens JR,
2. Begg AJ,
3. Arbuthnott GW
(1996) Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex in vitro. Neuroscience 70:1–5.
OpenUrl CrossRef PubMed
↵
1. Wickens JR,
2. Reynolds JN,
3. Hyland BI
(2003) Neural mechanisms of reward-related motor learning. Curr Opin Neurobiol 13:685–690.
OpenUrl CrossRef PubMed
↵
1. Wong ST,
2. Athos J,
3. Figueroa XA,
4. Pineda VV,
5. Schaefer ML,
6. Chavkin CC,
7. Muglia LJ,
8. Storm DR
(1999) Calcium-stimulated adenylyl cyclase activity is critical for hippocampus-dependent long-term memory and late phase LTP. Neuron 23:787–798.
OpenUrl CrossRef PubMed
↵
1. Wu ZL,
2. Thomas SA,
3. Villacres EC,
4. Xia Z,
5. Simmons ML,
6. Chavkin C,
7. Palmiter RD,
8. Storm DR
(1995) Altered behavior and long-term potentiation in type I adenylyl cyclase mutant mice. Proc Natl Acad Sci U S A 92:220–224.
OpenUrl Abstract/FREE Full Text
↵
1. Yin HH,
2. Knowlton BJ
(2006) The role of the basal ganglia in habit formation. Nat Rev Neurosci 7:464–476.
OpenUrl CrossRef PubMed
↵
1. Yin HH,
2. Ostlund SB,
3. Knowlton BJ,
4. Balleine BW
(2005) The role of the dorsomedial striatum in instrumental conditioning. Eur J Neurosci 22:513–523.
OpenUrl CrossRef PubMed

In this issue

View Full Page PDF

Citation Tools

Respond to this article

Request Permissions

Cited By...

Articles

Show more Articles

Behavioral/Systems/Cognitive

Show more Behavioral/Systems/Cognitive

[1] ↵
Baldwin AE,
Sadeghian K,
Holahan MR,
Kelley AE
(2002) Appetitive instrumental learning is impaired by inhibition of cAMP-dependent protein kinase within the nucleus accumbens. Neurobiol Learn Mem 77:44–62.
OpenUrl CrossRef PubMed

[2] Baldwin AE,

[3] Sadeghian K,

[4] Holahan MR,

[5] Kelley AE

[6] ↵
Beaulieu JM,
Sotnikova TD,
Marion S,
Lefkowitz RJ,
Gainetdinov RR,
Caron MG
(2005) An Akt/beta-arrestin 2/PP2A signaling complex mediates dopaminergic neurotransmission and behavior. Cell 122:261–273.
OpenUrl CrossRef PubMed

[7] Beaulieu JM,

[8] Sotnikova TD,

[9] Marion S,

[10] Lefkowitz RJ,

[11] Gainetdinov RR,

[12] Caron MG

[13] ↵
Belin D,
Mar AC,
Dalley JW,
Robbins TW,
Everitt BJ
(2008) High impulsivity predicts the switch to compulsive cocaine-taking. Science 320:1352–1355.
OpenUrl Abstract/FREE Full Text

[14] Belin D,

[15] Mar AC,

[16] Dalley JW,

[17] Robbins TW,

[18] Everitt BJ

[19] ↵
Berridge KC
(2007) The debate over dopamine's role in reward: the case for incentive salience. Psychopharmacology (Berl) 191:391–431.
OpenUrl CrossRef PubMed

[20] Berridge KC

[21] ↵
Calabresi P,
Maj R,
Mercuri NB,
Bernardi G
(1992) Coactivation of D1 and D2 dopamine receptors is required for long-term synaptic depression in the striatum. Neurosci Lett 142:95–99.
OpenUrl CrossRef PubMed

[22] Calabresi P,

[23] Maj R,

[24] Mercuri NB,

[25] Bernardi G

[26] ↵
Calabresi P,
Picconi B,
Tozzi A,
Di Filippo M
(2007) Dopamine-mediated regulation of corticostriatal synaptic plasticity. Trends Neurosci 30:211–219.
OpenUrl CrossRef PubMed

[27] Calabresi P,

[28] Picconi B,

[29] Tozzi A,

[30] Di Filippo M

[31] ↵
Cooper DM,
Karpen JW,
Fagan KA,
Mons NE
(1998) Ca²⁺-sensitive adenylyl cyclases. Adv Second Messenger Phosphoprotein Res 32:23–51.
OpenUrl PubMed

[32] Cooper DM,

[33] Karpen JW,

[34] Fagan KA,

[35] Mons NE

[36] ↵
Corbit LH,
Muir JL,
Balleine BW
(2001) The role of the nucleus accumbens in instrumental conditioning: evidence of a functional dissociation between accumbens core and shell. J Neurosci 21:3251–3260.
OpenUrl Abstract/FREE Full Text

[37] Corbit LH,

[38] Muir JL,

[39] Balleine BW

[40] ↵
Dalley JW,
Chudasama Y,
Theobald DE,
Pettifer CL,
Fletcher CM,
Robbins TW
(2002) Nucleus accumbens dopamine and discriminated approach learning: interactive effects of 6-hydroxydopamine lesions and systemic apomorphine administration. Psychopharmacology (Berl) 161:425–433.
OpenUrl CrossRef PubMed

[41] Dalley JW,

[42] Chudasama Y,

[43] Theobald DE,

[44] Pettifer CL,

[45] Fletcher CM,

[46] Robbins TW

[47] ↵
Day JJ,
Roitman MF,
Wightman RM,
Carelli RM
(2007) Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nat Neurosci 10:1020–1028.
OpenUrl CrossRef PubMed

[48] Day JJ,

[49] Roitman MF,

[50] Wightman RM,

[51] Carelli RM

[52] ↵
Di Ciano P,
Cardinal RN,
Cowell RA,
Little SJ,
Everitt BJ
(2001) Differential involvement of NMDA, AMPA/kainate, and dopamine receptors in the nucleus accumbens core in the acquisition and performance of pavlovian approach behavior. J Neurosci 21:9471–9477.
OpenUrl Abstract/FREE Full Text

[53] Di Ciano P,

[54] Cardinal RN,

[55] Cowell RA,

[56] Little SJ,

[57] Everitt BJ

[58] ↵
Pashler HE
Dickinson A,
Balleine BW
(2002) in Steven's handbook of experimental psychology, The role of learning in the operation of motivational systems, ed Pashler HE (Wiley, New York), Ed 3.

[59] Pashler HE

[60] Dickinson A,

[61] Balleine BW

[62] ↵
Eyny YS,
Horvitz JC
(2003) Opposing roles of D₁ and D₂ receptors in appetitive conditioning. J Neurosci 23:1584–1587.
OpenUrl Abstract/FREE Full Text

[63] Eyny YS,

[64] Horvitz JC

[65] ↵
Gerfen CR,
Miyachi S,
Paletzki R,
Brown P
(2002) D₁ dopamine receptor supersensitivity in the dopamine-depleted striatum results from a switch in the regulation of ERK1/2/MAP kinase. J Neurosci 22:5042–5054.
OpenUrl Abstract/FREE Full Text

[66] Gerfen CR,

[67] Miyachi S,

[68] Paletzki R,

[69] Brown P

[70] ↵
Guillou JL,
Nakata H,
Cooper DM
(1999) Inhibition by calcium of mammalian adenylyl cyclases. J Biol Chem 274:35539–35545.
OpenUrl Abstract/FREE Full Text

[71] Guillou JL,

[72] Nakata H,

[73] Cooper DM

[74] ↵
Pashler HE
Hall G
(2002) in Steven's handbook of experimental psychology, Associative structures in pavlovian and instrumental conditioning, ed Pashler HE (Wiley, New York), Ed 3.

[75] Pashler HE

[76] Hall G

[77] ↵
Hanoune J,
Defer N
(2001) Regulation and role of adenylyl cyclase isoforms. Annu Rev Pharmacol Toxicol 41:145–174.
OpenUrl CrossRef PubMed

[78] Hanoune J,

[79] Defer N

[80] ↵
Iwami G,
Kawabe J,
Ebina T,
Cannon PJ,
Homcy CJ,
Ishikawa Y
(1995) Regulation of adenylyl cyclase by protein kinase A. J Biol Chem 270:12481–12484.
OpenUrl Abstract/FREE Full Text

[81] Iwami G,

[82] Kawabe J,

[83] Ebina T,

[84] Cannon PJ,

[85] Homcy CJ,

[86] Ishikawa Y

[87] ↵
Iwamoto T,
Okumura S,
Iwatsubo K,
Kawabe J,
Ohtsu K,
Sakai I,
Hashimoto Y,
Izumitani A,
Sango K,
Ajiki K,
Toya Y,
Umemura S,
Goshima Y,
Arai N,
Vatner SF,
Ishikawa Y
(2003) Motor dysfunction in type 5 adenylyl cyclase-null mice. J Biol Chem 278:16936–16940.
OpenUrl Abstract/FREE Full Text

[88] Iwamoto T,

[89] Okumura S,

[90] Iwatsubo K,

[91] Kawabe J,

[92] Ohtsu K,

[93] Sakai I,

[94] Hashimoto Y,

[95] Izumitani A,

[96] Sango K,

[97] Ajiki K,

[98] Toya Y,

[99] Umemura S,

[100] Goshima Y,

[101] Arai N,

[102] Vatner SF,

[103] Ishikawa Y

[104] ↵
Kandel ER
(2001) The molecular biology of memory storage: a dialogue between genes and synapses. Science 294:1030–1038.
OpenUrl Abstract/FREE Full Text

[105] Kandel ER

[106] ↵
Kelley AE
(2004) Ventral striatal control of appetitive motivation: role in ingestive behavior and reward-related learning. Neurosci Biobehav Rev 27:765–776.
OpenUrl CrossRef PubMed

[107] Kelley AE

[108] ↵
Kreitzer AC,
Malenka RC
(2007) Endocannabinoid-mediated rescue of striatal LTD and motor deficits in Parkinson's disease models. Nature 445:643–647.
OpenUrl CrossRef PubMed

[109] Kreitzer AC,

[110] Malenka RC

[111] ↵
LeDoux JE
(2000) Emotion circuits in the brain. Annu Rev Neurosci 23:155–184.
OpenUrl CrossRef PubMed

[112] LeDoux JE

[113] ↵
Matsuoka I,
Suzuki Y,
Defer N,
Nakanishi H,
Hanoune J
(1997) Differential expression of type I, II, and V adenylyl cyclase gene in the postnatal developing rat brain. J Neurochem 68:498–506.
OpenUrl CrossRef PubMed

[114] Matsuoka I,

[115] Suzuki Y,

[116] Defer N,

[117] Nakanishi H,

[118] Hanoune J

[119] ↵
Nicol X,
Muzerelle A,
Bachy I,
Ravary A,
Gaspar P
(2005) Spatiotemporal localization of the calcium-stimulated adenylate cyclases, AC1 and AC8, during mouse brain development. J Comp Neurol 486:281–294.
OpenUrl CrossRef PubMed

[120] Nicol X,

[121] Muzerelle A,

[122] Bachy I,

[123] Ravary A,

[124] Gaspar P

[125] ↵
Paxinos G,
Franklin KBJ
(2001) The mouse brain in steroetaxic coordinates (Academic, San Diego), Ed 2.

[126] Paxinos G,

[127] Franklin KBJ

[128] ↵
Reynolds JN,
Wickens JR
(2002) Dopamine-dependent plasticity of corticostriatal synapses. Neural Netw 15:507–521.
OpenUrl CrossRef PubMed

[129] Reynolds JN,

[130] Wickens JR

[131] ↵
Reynolds JN,
Hyland BI,
Wickens JR
(2001) A cellular mechanism of reward-related learning. Nature 413:67–70.
OpenUrl CrossRef PubMed

[132] Reynolds JN,

[133] Hyland BI,

[134] Wickens JR

[135] ↵
Robinson S,
Sandstrom SM,
Denenberg VH,
Palmiter RD
(2005) Distinguishing whether dopamine regulates liking, wanting, and/or learning about rewards. Behav Neurosci 119:5–15.
OpenUrl CrossRef PubMed

[136] Robinson S,

[137] Sandstrom SM,

[138] Denenberg VH,

[139] Palmiter RD

[140] ↵
Schultz W
(1998) Predictive reward signal of dopamine neurons. J Neurophysiol 80:1–27.
OpenUrl Abstract/FREE Full Text

[141] Schultz W

[142] ↵
Schultz W
(2002) Getting formal with dopamine and reward. Neuron 36:241–263.
OpenUrl CrossRef PubMed

[143] Schultz W

[144] ↵
Schultz W
(2006) Behavioral theories and the neurophysiology of reward. Annu Rev Psychol 57:87–115.
OpenUrl CrossRef PubMed

[145] Schultz W

[146] ↵
Schultz W,
Dayan P,
Montague PR
(1997) A neural substrate of prediction and reward. Science 275:1593–1599.
OpenUrl Abstract/FREE Full Text

[147] Schultz W,

[148] Dayan P,

[149] Montague PR

[150] ↵
Self DW,
Genova LM,
Hope BT,
Barnhart WJ,
Spencer JJ,
Nestler EJ
(1998) Involvement of cAMP-dependent protein kinase in the nucleus accumbens in cocaine self-administration and relapse of cocaine-seeking behavior. J Neurosci 18:1848–1859.
OpenUrl Abstract/FREE Full Text

[151] Self DW,

[152] Genova LM,

[153] Hope BT,

[154] Barnhart WJ,

[155] Spencer JJ,

[156] Nestler EJ

[157] ↵
Shiflett MW,
Martini RP,
Mauna JC,
Foster RL,
Peet E,
Thiels E
(2008) Cue-elicited reward-seeking requires extracellular signal-regulated kinase activation in the nucleus accumbens. J Neurosci 28:1434–1443.
OpenUrl Abstract/FREE Full Text

[158] Shiflett MW,

[159] Martini RP,

[160] Mauna JC,

[161] Foster RL,

[162] Peet E,

[163] Thiels E

[164] ↵
Valjent E,
Pascoli V,
Svenningsson P,
Paul S,
Enslen H,
Corvol JC,
Stipanovich A,
Caboche J,
Lombroso PJ,
Nairn AC,
Greengard P,
Hervé D,
Girault JA
(2005) Regulation of a protein phosphatase cascade allows convergent dopamine and glutamate signals to activate ERK in the striatum. Proc Natl Acad Sci U S A 102:491–496.
OpenUrl Abstract/FREE Full Text

[165] Valjent E,

[166] Pascoli V,

[167] Svenningsson P,

[168] Paul S,

[169] Enslen H,

[170] Corvol JC,

[171] Stipanovich A,

[172] Caboche J,

[173] Lombroso PJ,

[174] Nairn AC,

[175] Greengard P,

[176] Hervé D,

[177] Girault JA

[178] ↵
Wang H,
Storm DR
(2003) Calmodulin-regulated adenylyl cyclases: cross-talk and plasticity in the central nervous system. Mol Pharmacol 63:463–468.
OpenUrl Abstract/FREE Full Text

[179] Wang H,

[180] Storm DR

[181] ↵
Wickens JR,
Begg AJ,
Arbuthnott GW
(1996) Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex in vitro. Neuroscience 70:1–5.
OpenUrl CrossRef PubMed

[182] Wickens JR,

[183] Begg AJ,

[184] Arbuthnott GW

[185] ↵
Wickens JR,
Reynolds JN,
Hyland BI
(2003) Neural mechanisms of reward-related motor learning. Curr Opin Neurobiol 13:685–690.
OpenUrl CrossRef PubMed

[186] Wickens JR,

[187] Reynolds JN,

[188] Hyland BI

[189] ↵
Wong ST,
Athos J,
Figueroa XA,
Pineda VV,
Schaefer ML,
Chavkin CC,
Muglia LJ,
Storm DR
(1999) Calcium-stimulated adenylyl cyclase activity is critical for hippocampus-dependent long-term memory and late phase LTP. Neuron 23:787–798.
OpenUrl CrossRef PubMed

[190] Wong ST,

[191] Athos J,

[192] Figueroa XA,

[193] Pineda VV,

[194] Schaefer ML,

[195] Chavkin CC,

[196] Muglia LJ,

[197] Storm DR

[198] ↵
Wu ZL,
Thomas SA,
Villacres EC,
Xia Z,
Simmons ML,
Chavkin C,
Palmiter RD,
Storm DR
(1995) Altered behavior and long-term potentiation in type I adenylyl cyclase mutant mice. Proc Natl Acad Sci U S A 92:220–224.
OpenUrl Abstract/FREE Full Text

[199] Wu ZL,

[200] Thomas SA,

[201] Villacres EC,

[202] Xia Z,

[203] Simmons ML,

[204] Chavkin C,

[205] Palmiter RD,

[206] Storm DR

[207] ↵
Yin HH,
Knowlton BJ
(2006) The role of the basal ganglia in habit formation. Nat Rev Neurosci 7:464–476.
OpenUrl CrossRef PubMed

[208] Yin HH,

[209] Knowlton BJ

[210] ↵
Yin HH,
Ostlund SB,
Knowlton BJ,
Balleine BW
(2005) The role of the dorsomedial striatum in instrumental conditioning. Eur J Neurosci 22:513–523.
OpenUrl CrossRef PubMed

[211] Yin HH,

[212] Ostlund SB,

[213] Knowlton BJ,

[214] Balleine BW

Main menu

User menu

Search

A cAMP Pathway Underlying Reward Prediction in Associative Learning

Abstract

Introduction

Materials and Methods

Mice

Behavioral procedures

Instrumental conditioning.

Outcome devaluation.

Contingency degradation.

Appetitive pavlovian conditioning.

Aversive pavlovian conditioning.

Immunohistochemistry

Statistical analysis

Results

AC5KO mice exhibit altered distribution of goal-directed behaviors in operant tasks

AC5KO mice lack reward prediction in appetitive pavlovian conditioning

AC5KO mice form normal action–outcome contingencies

AC5KO mice show normal aversive pavlovian conditioning

AC5KO mice have impaired D₁ receptor-mediated ERK activation in the NAcc

Discussion

Footnotes

References

In this issue

Citation Manager Formats

Responses to this article

Jump to comment:

Related Articles

Cited By...

More in this TOC Section

Articles

Behavioral/Systems/Cognitive

Main menu

User menu

Search

A cAMP Pathway Underlying Reward Prediction in Associative Learning

Abstract

Introduction

Materials and Methods

Mice

Behavioral procedures

Instrumental conditioning.

Outcome devaluation.

Contingency degradation.

Appetitive pavlovian conditioning.

Aversive pavlovian conditioning.

Immunohistochemistry

Statistical analysis

Results

AC5KO mice exhibit altered distribution of goal-directed behaviors in operant tasks

AC5KO mice lack reward prediction in appetitive pavlovian conditioning

AC5KO mice form normal action–outcome contingencies

AC5KO mice show normal aversive pavlovian conditioning

AC5KO mice have impaired D1 receptor-mediated ERK activation in the NAcc

Discussion

Footnotes

References

In this issue

Citation Manager Formats

Jump to section

Responses to this article

Jump to comment:

Related Articles

Cited By...

More in this TOC Section

Articles

Behavioral/Systems/Cognitive

AC5KO mice have impaired D₁ receptor-mediated ERK activation in the NAcc